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EDITORIAL 


The editor and his associates are reading 
more these days but are not always enjoying 
it more. This is not because manuscripts are 
uninteresting or lacking in merit. The prob- 
lem is quite the reverse. With more papers 
than ever of a quality to merit publication, 
the problem grows increasingly difficult of se- 
lecting the narrow number which our page al- 
lotment will tolerate. During 1962 we received 
over 700 manuscripts, an increase of approxi- 
mately 20% over the number received in 
1961. With this rise in manuscripts goes a 
heavy increase in papers falling into the in- 
termediate category of judgment so that more 
time has to be spent on deciding about a cut- 
ting point. Moreover, authors have been so 
cooperative that our problems of overload 
continue to mount. When papers are rejected 
because of improper statistical analysis or 
weak findings, authors do a highly commend- 
able job of reworking their data and of run- 
ning additional subjects. This is fine for ad- 
vancing our field as a science. It does not, 
however, solve the problem of how one small 
journal can effectively serve the three bur- 
geoning areas of personality dynamics, abnor- 
mal and clinical psychology, and social psy- 
chology. 

Our criteria for the selection of papers must 


. grow increasingly narrow if we are to handle 


the problem within the existing context ofa 
single journal. We will have to place increas- 
ing emphasis upon the amount of clear and 
definitive information a paper adds to a sub- 
ject which has already seen considerable ex- 
ploration. A technically adequate study, the 
results of which further muddy the already 
muddied waters of an area is not a good can- 
didate for acceptance. A hypothesis testing 


. study should deal with predictions that bear 


significantly on some theoretical issue. An 
empirical study should go deep or wide and in 
the latter case should have an adequate sam- 
pling design. A theoretical contribution should 
attempt some degree of formalization. Our 
criteria, then, for selecting papers must go 
beyond the assessment of technical compe- 
tence. This type of emphasis has drawbacks. 
It is easier to be objective in evaluating tech- 


nique than theory. Nonetheless, there is no 
choice and the editor can do no more than 
seek expert advice, exercise his own judgment, 
and hope that his mistakes will not result in 
the loss of outstanding contributions to the 
JASP. 

Psychologists know about cultural lag but 
do little about it. We are still operating the 
editorial side of our journals within the frame- 
work of an earlier era. In that more leisurely 
period a professor could be asked to take on 
the additional role of editing a small journal 
in an area in which he was very knowledge- 
able because of its relative lack of specializa- 
tion and elaboration. Today the editorial role 
is a full-time task which we assign to an over- 
committed colleague to do on marginal time. 
The obvious solution is for the editor to share 
his burdens with as many members. of the 
profession as possible, though he must carry 
the responsibility for final decisions. In fol- 
lowing this pattern I have been impressed by 
the high level of professional commitment of 
psychologists to the advancement of their 
discipline as a science. The response to re- 
quests to evaluate manuscripts and the care 
and objectivity of the reviews show a re- 
markable dedication to the cooperative ven- 
ture of psychological research. We are all in- 
debted to the following unofficial consultants 
who have served as referees for a number of 
papers: 

Norman T. Feather 
Martin Fishbein 


Elliot Aronson 
Richard Atkinson 


Kurt W. Back 
Theodore X. Barber 
Dana Bramel 

John Brownfain 
Harry Burdick 
Eugene Burnstein 
Al Burstein 

Arnold Buss 

Earl Carlson 

Irwin L. Child 
Arthur R. Cohen 
Philip Converse 
Clyde Coombs 

Lee Cronbach 
Anthony Davids 
William Dember 
Elizabeth Douvan 
Don E. Dulany, Jr. 


Bert Forrin 

Basil Georgopoulos 
Harold B. Gerard 
David C. Glass 
Lewis Goldberg 
Michael J. Goldstein 
Jesse Gordon 5t 
John Harding 

Jerry Hirsch 

Jack E. Hokanson 
Edwin P. Hollander 
James J. Jenkins 
Jerome Kagan 

Solis L. Kates 
George Levinger 
Maurice Lorr 
Charles McOllintock 
Eleanor Maccoby 
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Melvin Manis 
Gerald A. Mendelsohn 
Charles Metzner 
John Milholland 
Thomas Natsoulas 
Theodore Newcomb 
Warren Norman 
David Pearl 
Donald Pelz 

Sid Perloe 

Marc Pilisuk 
Walter R. Reitman 
Herbert Saltzstein 


EDITORIAL 


Irwin G. Sarason 
Pauline Sears 
Joseph Speisman 
Mark Spivak 

Ivan D. Steiner 
Jack A. Vernon 
Robert Watson 
Andre M. Weitzenhoffer 
Ben Willerman 

C. Leland Winder 
Julius Wishner 
Herman A. Witkin 
Philip G. Zimbardo 


I would also like to record my deep grati- 
tude for the continuing help of the members 


of the Editorial Board who carry a heavy 
load of editorial work 12 months of the 
year. A particularly rewarding experience for 
the editor is the amount of patience, toler- 
ance, and cooperation which the overwhelm- | 
ing majority of authors have demonstrated, 

The APA office and the Managing Editor 

have brought off the difficult accomplishment | 
of eliminating the production lag in JASP. It | 
now remains for the editor to reduce the in- 

creased publication lag resulting from dd 
2096 rise in manuscripts. 


| 
| 
Dantet Katz | 
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IMITATION OF FILM-MEDIATED AGGRESSIVE MODELS* 


ALBERT BANDURA, DOROTHEA ROSS,? Aw» SHEILA A. ROSS 
Stanford University 


In a test of the hypothesis that exposure of children to film-mediated aggressive 
models would increase the probability of Ss’ aggression to subsequent frustra- 
tion, 1 group of experimental Ss observed real-life aggressive models, a 2nd 
observed these same models portraying aggression on film, while a 3rd group 
viewed a film depicting an aggressive cartoon character. Following the exposure 
treatment, Ss were mildly frustrated and tested for the amount of imitative E 
and nonimitative aggression in a different experimental setting. The overall 
results provide evidence for both the facilitating and the modeling influence 
of film-mediated aggressive stimulation. In addition, the findings reveal that 
the effects of such exposure are to some extent a function of the sex of the 
model, sex of the child, and the reality cues of the model. 


Most of the research on the possible effects 
of film-mediated stimulation upon subsequent 
aggressive behavior has focused primarily on 
the drive reducing function of fantasy. While 
the experimental evidence for the catharsis 
or drive reduction theory is equivocal (Albert, 
1957; Berkowitz, 1962; Emery, 1959; Fesh- 
bach, 1955, 1958; Kenny, 1952; Lovaas, 
1961; Siegel, 1956), the modeling influence 
of pictorial stimuli has received little research 
attention. 

A recent incident (San Francisco Chronicle, 
1961) in which a boy was seriously knifed 
during a re-enactment of a switchblade knife 
fight the boys had seen the previous evening 
on a televised rerun of the James Dean movie, 
Rebel Without a Cause, is a dramatic illustra- 
tion of the possible imitative influence of 
film stimulation. Indeed, anecdotal data sug- 
gest that portrayal of aggression through 
pictorial media may be more influential in 
shaping the form aggression will take when a 
person is instigated on later occasions, than in 
altering the level of instigation to aggression. 

In an earlier experiment (Bandura & 


1This investigation was supported in part by Re- 
search Grants M-4398 and M-5162 from the National 
Institute of Health, United States Public Health 
Service, and the Lewis S. Haas Child Development 
Research Fund, Stanford University. 

The authors are indebted to David J. Hicks for 

* his generous assistance with the photography and to 

John Steinbruner who assisted with various phases 
of this study. 

2'phis research was carried out while the junior 
author was the recipient of an American Association 
of University Women International Fellowship for 
postdoctoral research. 


Huston, 1961), it was shown that children 
readily imitated aggressive behavior exhibited 
by a model in the presence of the model. A 
succeeding investigation (Bandura, Ross, & 
Ross, 1961), demonstrated that children ex- 
posed to aggressive models generalized aggres- 
sive responses to a new setting in which the 
model was absent. The present study sought 
to determine the extent to which film-mediated 
aggressive models may serve as an important 
source of imitative behavior. 

Aggressive models can be ordered on a reality- 
fictional stimulus dimension with real-life 
models located at the reality end of the con- 
tinuum, nonhuman cartoon characters at the 
fictional end, and films portraying human 
models occupying an intermediate position. It 
was predicted, on the basis of saliency and 
similarity of cues, that the more remote the 
model was from reality, the weaker would be 
the tendency for subjects to imitate the be- 
havior of the model. 

Of the various interpretations of imitative 
learning, the sensory feedback theory of 
imitation recently proposed by Mowrer (1960) 
is elaborated in greatest detail. According to 
this theory, if certain responses have been 
repeatedly positively reinforced, proprioceptive 
stimuli associated with these responses acquire 
secondary reinforcing properties and thus the 
individual is predisposed to perform the be- 
havior for the positive feedback. Similarly, 
if responses have been negatively reinforced, 
response correlated stimuli acquire the capac- 
ity to arouse anxiety which, in tifrn, inhibit 
the occurrence of the negatively valenced 
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behavior. On the basis of these considerations, 
it was predicted subjects who manifest high 
aggression anxiety would perform significantly 
less imitative and nonimitative aggression than 
subjects who display little anxiety over ag- 
gression. Since aggression is generally con- 
sidered female inappropriate behavior, and 
therefore likely to be negatively reinforced in 
girls (Sears, Maccoby, & Levin, 1957), it was 
also predicted that male subjects would be 
more imitative of aggression than females. 

To the extent that observation of adults 
displaying aggression conveys a certain degree 
of permissiveness for aggressive behavior, it 
may be assumed that such exposure not only 
facilitates the learning of new aggressive re- 
sponses but also weakens competing inhibitory 
responses in subjects and thereby increases the 
probability of occurrence of previously learned 
patterns of aggression. It was predicted, there- 
fore, that subjects who observed aggressive 
models would display significantly more ag- 
gression when subsequently frustrated than 
subjects who were equally frustrated but who 
had no prior exposure to models exhibiting 
aggression. 


METHOD 


Subjects 


The subjects were 48 boys and 48 girls enrolled in 
the Stanford University Nursery School. They ranged 
in age from 35 to 69 months, with a mean age of 
52 months. 

Two adults, a male and a female, served in the 
role of models both in the real-life and the human 
film-aggression condition, and one female experi- 
menter conducted the study for all 96 children. 


General Procedure 


Subjects were divided into three experimental 
groups and one control group of 24 subjects each. 
One group of experimental subjects observed real- 
life aggressive models, a second group observed these 
same models portraying aggression on film, while a 
third group viewed a film depicting an aggressive 
cartoon character. The experimental groups were fur- 
ther subdivided into male and female subjects so 
that half the subjects in the two conditions involv- 
ing human models were exposed to same-sex models, 
while the remaining subjects viewed models of the 
opposite sex. 

Following the exposure experience, subjects were 
tested for the amount of imitative and nonimita- 
tive aggression in a different experimental setting 
in the absence of the models. 

The control group subjects had no exposure to 


the aggressive models and were tested only in the 
generalization situation. 

Subjects in the experimental and control groups 
were matched individually on the basis of ratings 
of their aggressive behavior in social interactions in 
the nursery school. The experimenter and a nursery 
school teacher rated the subjects on four five-point 
rating scales which measured the extent to which 
subjects displayed physical aggression, verbal ag- 
gression, aggression toward inanimate objects, and 
aggression inhibition. The latter scale, which dealt with 
the subjects’ tendency to inhibit aggressive reactions 
in the face of high instigation, provided the measure 
of aggression anxiety. Seventy-one percent of the 
subjects were rated independently by both judges) 
so as to permit an assessment of interrater agreement, 
The reliability of the composite aggression score, 
estimated by means of the Pearson product-moment 
correlation, was .80. 

Data for subjects in the real-life aggression con- 
dition and in the control group were collected as 
part of a previous experiment (Bandura et al., 1961), 
Since the procedure is described in detail in the 
earlier report, only a brief description of it will be 
presented here. 


Experimental Conditions 


Subjects in the Real-Life Aggressive condition were 
brought individually by the experimenter to the 
experimental room and the model, who was in the 
hallway outside the room, was invited by the ex- 
perimenter to come and join in the game. The subject 
was then escorted to one corner of the room and 
seated at a small table which contained potato 
prints, multicolor picture stickers, and colored paper. 
After demonstrating how the subject could design 
pictures with the materials provided, the experi- 
menter escorted the model to the opposite corner 
of the room which contained a small table and 
chair, a tinker toy set, a mallet, and a 5-foot 
inflated Bobo doll. The experimenter explained that 
this was the model’s play area and after the model 
was seated, the experimenter left the experiment: 
room. H 

The model began the session by assembling th 
tinker toys but after approximately a minute ha 
elapsed, the model turned to the Bobo doll an 
spent the remainder of the period aggressing toward 
it with highly novel responses which are unlike 
to be performed by children independently of th 
observation of the models behavior. Thus, j 
addition to punching the Bobo doll, the modi 
exhibited the following distinctive aggressive ac 
which were to be scored as imitative responses: 

The model sat on the Bobo doll and punched il 
repeatedly in the nose. 

"The model then raised the Bobo doll and pommeled 
it on the head with a mallet. : 

Following the mallet aggression, the model tossed 
the doll up in the air aggressively and kicked it 
about the room, This sequence of physically aggres- 
sive acts was repeated approximately taree times 
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interspersed with verbally aggressive responses such 


as, "Sock him in the nose... ,” “Hit him down ...,” 
“Throw him in the air . . . ,” “Kick him . .. ,” and 
“Pow.” 


Subjects in the Human Film-Aggression condition 
were brought by the experimenter to the semi- 
darkened experimental room, introduced to the 
picture materials, and informed that while the sub- 
jects worked on potato prints, a movie would be 
shown on a screen, positioned approximately 6 
feet from the subject’s table. The movie projector 
was located in a distant corner of the room and 
was screened from the subject’s view by large wooden 
panels, 

The color movie and a tape recording of the 
sound track was begun by a male projectionist as 
soon as the experimenter left the experimental room 
and was shown for a duration of 10 minutes. The 
models in the film presentations were the same adult 
males and females who participated in the Real-Life 
condition of the experiment. Similarly, the aggressive 
behavior they portrayed in the film was identical 
with their real-life performances, 

For subjects in the Cartoon Film-Aggression con- 
dition, after seating the subject at the table with 
the picture construction material, the experimenter 
walked over to a television console approximately 
3 feet in front of the subject’s table, remarked, 
“I guess I'll turn on the color TV,” and ostensibly 
tuned in a cartoon program. The experimenter then 
left the experimental room. The cartoon was shown 
on a glass lens screen in the television set by means 
of a rear projection arrangement screened from the 
subject's view by large panels. 

The sequence of aggressive acts in the cartoon was 
performed by the female model costumed as a black 
cat similar to the many cartoon cats. In order to 
heighten the level of irreality of the cartoon, the 
floor area was covered with artificial grass and the 
walls forming the backdrop were adorned with 
brightly colored trees, birds, and butterflies creating 
a fantasyland setting. The cartoon began with a 
close-up of a stage on which the curtains were 
slowly drawn revealing a picture of a cartoon cat 
along with the title, Herman the Cat. The remainder 
of the film showed the cat pommeling the Bobo doll 
on the head with a mallet, sitting on the doll and 
punching it in the nose, tossing the doll in the air, 
and kicking it about the room in a manner identical 
with the performance in the other experimental con- 
ditions except that the cat's movements were char- 
acteristically feline, To induce further a cartoon set, 
the program was introduced and concluded with 
appropriate cartoon music, and the cat's verbal ag- 
gression was repeated in a high-pitched, animated 
voice, 

In both film conditions, at the conclusion of the 
movie the experimenter entered the room and then 
escorted the subject to the test room. 


Aggression Instigation 


In order to differentiate clearly the exposure and 
test situatibns subjects were tested for the amount 


of imitative learning in a different experimental 
room which was set off from the main nursery 
school building. 

The degree to which a child has learned aggres- 
sive patterns of behavior through imitation be- 
comes most evident when the child is instigated to 
aggression on later occasions. Thus, for example, the 
effects of viewing the movie, Rebel Without a Cause, 
were not evident until the boys were instigated to 
aggression the following day, at which time they 
re-enacted the televised switchblade knife fight in 
considerable detail. For this reason, the children in 
the experiment, both those in the control group, 
and those who were exposed to the aggressive models, 
were mildly frustrated before they were brought 
to the test room. 

Following the exposure experience, the experimenter 
brought the subject to an anteroom which contained 
a varied array of highly attractive toys. The ex- 
perimenter explained that the toys were for the sub- 
ject to play with, but, as soon as the subject became 
sufficiently involved with the play material, the 
experimenter remarked that these were her very best 
toys, that she did not let just anyone play with 
them, and that she had decided to reserve these toys 
for some other children. However, the subject could 
play with any of the toys in the next room. The 
experimenter and the subject then entered the ad- 
joining experimental room. 

It was necessary for the experimenter to remain 
in the room during the experimental session; other- 
wise, a number of the children would either refuse 
to remain alone or would leave before the termina- 
tion of the session. In order to minimize any in- 
fiuence her presence might have on the subject’s be- 
havior, the experimenter remained as inconspicuous 
as possible by busying herself with paper work at a 
desk in the far corner of the room and avoiding 
any interaction with the child. 


Test for Delayed Imitation 


The experimental room contained a variety of 
toys, some of which could be used in imitative 
or nonimitative aggression, and others which tended 
to elicit predominantly nonaggressive forms of be- 
havior. The aggressive toys included a 3-foot Bobo 
doll, a mallet and peg board, two dart guns, and 
a tether ball with a face painted on it which hung 
from the ceiling. The nonaggressive toys, on the 
other hand, included a tea set, crayons and coloring 
paper, a ball, two dolls, three bears, cars and trucks, 
and plastic farm animals, 

In order to eliminate any variation in behavior 
due to mere placement of the toys in the room, the 
play material was arranged in a fixed order for each 
of the sessions. 

The subject spent 20 minutes in the experimental 
room during which time his behavior was rated in 
terms of predetermined response categories by judges 
who observed the session through a oneeway mirror 
in an adjoining observation room. The 20-minute 
session was divided in S-second intervals by means 
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TABLE 1 
MEAN AGGRESSION SCORES FOR SUBGROUPS OF EXPERIMENTAL AND CONTROL SUBJECTS 
Experimental groups 
-lifi i H film- i Control 
inesponse category, Real-life aggressive uman film-aggressive eae a 
r M. = i aggressive 
Model Model Model Model 

Total aggression 

Girls 65.8 57.3 87.0 79.5 80.9 36.4 

Boys 76.8 131.8 114.5 85.0 117.2 72.2 
Imitative aggression 

Girls 19.2 9.2 10.0 8.0 7.8 18 

Boys 18.4 38.4 34.3 13.3 16.2 3.9 
Mallet aggression 

Girls 17.2 18.7 49.2 19.5 36.8 13.1 

Boys 15.5 28.8 20.5 16.3 12.5 13.5 
Sits on Bobo doll* 

Girls 10.4 5.6 10.3 4.5 15.3 3.3 

Boys 13 0.7 7.7 0.0 5.6 0.6200 
Nonimitative aggression 

Girls 27.6 24.9 240 34.3 21.5 17.8 

Boys 35.5 48.6 46.8 31.8 71.8 40.4 
Aggressive gun play 

Girls di 1.8 4.5 3.8 17.6 8.8 3.7 

Boys T3 15.9 12.8 23.7 16.6 14.3 


^ This response category was not included in the total aggression score. 


of an electric interval timer, thus yielding a total 
number of 240 response units for each subject. 

The male model scored the experimental sessions 
for all subjects, In order to provide an estimate 
of interjudge agreement, the performances of 40% 
of the subjects were scored independently by a 
second observer. The responses scored involved highly 
specific concrete classes of behavior, and yielded high 
interscorer reliabilities, the product-moment coeffi- 
cients being in the .90s. 


Response Measures 


The following response measures were obtained: 

Imitative aggression. This category included acts of 
striking the Bobo doll with the mallet, sitting on 
the doll and punching it in the nose, kicking the 
doll, tossing it in the air, and the verbally aggressive 
responses, “Sock him,” “Hit him down,” “Kick him,” 
“Throw him in the air,” and “Pow.” 

Partially imitative responses. A number of subjects 
imitated the essential components of the model’s 
behavior but did not perform the complete act, or 
they directed the imitative aggressive response to 
some object other than the Bobo doll. Two responses 
of this type were scored and were interpreted as 
partially imitative behavior: 

Mallet aggression. The subject strikes objects 
other than the Bobo doll aggressively with the mallet. 

Sits on Bobo doll. The subject lays the Bobo doll 
on its side and sits on it, but does not aggress 
toward it. > 

Nonimitative aggression. This category included 
acts of punching, slapping, or pushing the doll, 


physically aggressive acts directed toward objects 
other than the Bobo doll, and any hostile remarks 
except for those in the verbal imitation category; 
for example, “Shoot the Bobo,” “Cut him,” “Stupid 
ball,” “Knock over people,” “Horses fighting, biting." 

Aggressive gun play. The subject shoots darts or. 
aims the guns and fires imaginary shots at objects in 
the room. 

Ratings were also made of the number of be- 
havior units in which subjects played nonaggressively 
or sat quietly and did not play with any of the 
material at all. | 


RESULTS 


The mean imitative and nonimitative ag- 
gression scores for subjects in the various 
experimental and control groups are presented 
in Table 1. 

Since the distributions of scores departed 
from normality and the assumption of homo- 
geneity of variance could not be made for 
most of the measures, the Freidman two-way 
analysis of variance by ranks was employed 
for testing the significance of the obtained 
differences. 


Total Aggression 
The mean total aggression scores for sub- 


jects in the real-life, human film, cartoon film, 
and the control groups are 83, 92, 99, and 54, 
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respectively. The results of the analysis of 
variance performed on these scores reveal that 
the main effect of treatment conditions is 
significant (x,? = 9.06, p < .05), confirming 
the prediction that exposure of subjects to 
aggressive models increases the probability 
that subjects will respond aggressively when 
instigated on later occasions. Further analyses 
of pairs of scores by means of the Wilcoxon 
matched-pairs signed-ranks test show that 
subjects who viewed the real-life models and 
the film-mediated models do not differ from 
each other in total aggressiveness but all three 
experimental groups expressed significantly 
more aggressive behavior than the control 
subjects (Table 2). 


Imitative Aggressive Responses 


The Freidman analysis reveals that ex- 
posure of subjects to aggressive models is also 
a highly effective method for shaping subjects’ 
aggressive responses (x,? = 23.88, p < .001). 
Comparisons of treatment conditions by the 
Wilcoxon test reveal that subjects who ob- 
served the real-life models and the film- 
mediated models, relative to subjects in the 
control group, performed considerably more 
imitative physical and verbal aggression 
(Table 2). 

Illustrations of the extent to which some 
of the subjects became virtually "carbon 
copies" of their models in aggressive behavior 
are presented in Figure 1. The top frame 
shows the female model performing the four 
novel aggressive responses; the lower frames 
depict a male and a female subject reproduc- 
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ing the behavior of the female model they 
had observed earlier on film. 

The prediction that imitation is positively 
related to the reality cues of the model was 
only partially supported. While subjects who 
observed the real-life aggressive models ex- 
hibited significantly more imitative aggression 
than subjects who viewed the cartoon model, 
no significant differences were found between 
the live and film, and the film and cartoon 
conditions, nor did the three experimental 
groups differ significantly in total aggression 
or in the performances of partially imitative 
behavior (Table 2). Indeed, the available 
data suggest that, of the three experimental 
conditions, exposure to humans on film por- 
traying aggression was the most influential in 
eliciting and shaping aggressive behavior. 
Subjects in this condition, in relation to the 
control subjects, exhibited more total aggres- 
sion, more imitative aggression, more partially 
imitative behavior, such as sitting on the Bobo 
doll and mallet aggression, and they engaged 
in significantly more aggressive gun play. In 
addition, they performed significantly more 
aggressive gun play than did subjects who 
were exposed to the real-life aggressive models 
(Table 2). 


Influence of Sex of Model and Sex of Child. 


In order to determine the influence of sex 
of model and sex of child on the expression of 
imitative and nonimitative aggression, the: 
data from the experimental groups were com~ 
bined and the significance of the differences 
between groups was assessed by ¢ tests for 


TABLE 2 


SIGNIFICANCE OF THE DIFFERENCES BETWEEN EXPERIMENTAL AND CONTROL GROUPS 
IN THE EXPRESSION OF AGGRESSION 


Comparison of treatment conditions? 
Wed Vire a s ? Live vs. | Live vs. | Füm ve. | Livevs | Film vs. |Cattoon vs. 
Film Cartoon Cartoon Control Control Control 
b p ? ? b b 
Total aggression 9.06 <.05 ns ns ns <.01 <.01 <.005 
Imitative aggression 23.88 <.001 ns <.05 ns <.001 <.001 <.005 
Partial imitation 
Mallet aggression 1.36 | .10>p>.05 
Sits on Bobo doll 8.05 <.05 ns ns ns ns <.05 «.005 
Nonimitative aggression | 7.28 | .10>p>.05 
Aggressive gun play 8.06 «.05 <.01> | ns ns ns «.05 ns 
s 


* The probability values are based on the Wilcoxon test. 


b This pwobability value is based on a two-tailed test of significance. 
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Fic. 1. Photographs from the film, Social Learning of Aggression through Imitation 
of Aggressive Models. 


uncorrelated means. In statistical comparisons 
involving relatively skewed distributions of 
scores the Mann-Whitney U test was em- 
ployed. 

Sex of subjects had a highly significant 
effect on both the learning and the perform- 
ance of aggression. Boys, in relation to girls, 
exhibited significantly more total aggression 
(£ = 2.69, p < .01), more imitative aggression 
(t= 2.82, p< .005), more aggressive gun 
play (z = 3.38, p < .001), and more nonimita- 
tive aggressive behavior (¢ = 2.98, p < .005). 
Girls, on the other hand, were more inclined 
than boys to sit on the Bobo doll but refrained 
from punching it (z = 3.47, p < .001). 

The analyses also disclosed some influences 
of the sex of the model. Subjects exposed to 
the male model, as compared to the female 
model, expressed significantly more aggressive 
gun play (z= 2.83, p< .005). The most 
marked differences in aggressive gun play 
(U=9.5, p< 001), however, were found 
between girls exposed to the female model 
(M — 2.9) and males who observed the male 
model (44 = 19.8). Although the overall 
model difference in partially imitative be- 


havior, Sits on Bobo, was not significant; 
Sex X Model subgroup comparisons yielded 
some interesting results. Boys who observed 
the aggressive female model, for example, 
were more likely to sit on the Bobo doll 
without punching it than boys who viewed 
the male model (U=33, p< .05). Girls 
reproduced the nonaggressive component of 
the male model’s aggressive pattern of be- 
havior (i.e., sat on the doll without punchin 
it) with considerably higher frequency tham 
did boys who observed the same model (U — 
21.5, p< .02). The highest incidence of 
partially imitative responses was yielded by 
the group of girls who viewed the aggressive 
female model (M — 10.4), and the lowest 
values by the boys who were exposed to thé 
male model (M — 0.3). This difference was 
significant beyond the .05 significance level. 
These findings, along with the sex of child and 
sex of model differences reported in thi 
preceding sections, provide further support for 
the view that the influence of models in 
promoting social learning is determined, in 
part, by the sex appropriateness of the modeli 
behavior (Bandura et al., 1961). , 

| 
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Aggressive Predisposition and Imitation 


Since the correlations between ratings of 
aggression and the measures of imitative and 
total aggressive behavior, calculated separately 
for boys and girls in each of the experimental 
conditions, did not differ significantly, the 
data were combined. The correlational anal- 
yses performed on these pooled data failed to 
yield any significant relationships between 
ratings of aggression anxiety, frequency of 
aggressive behavior, and the experimental 
aggression measures. In fact, the array means 
suggested nonlinear regressions although the 
departures from linearity were not of sufficient 
magnitude to be statistically significant. 


Discussion 


The results of the present study provide 
strong evidence that exposure to filmed ag- 
gression heightens aggressive reactions in 
children. Subjects who viewed the aggressive 
human and cartoon models on film exhibited 
nearly twice as much aggression than did 
subjects in the control group who were not 
exposed to the aggressive film content. 

In the experimental design typically em- 
ployed for testing the possible cathartic func- 
tion of vicarious aggression, subjects are first 
frustrated, then provided with an opportunity 
to view an aggressive film following which 
their overt or fantasy aggression is measured. 
While this procedure yields some information 
on the immediate influence of film-mediated 
aggression, the full effects of such exposure 
may not be revealed until subjects are in- 
stigated to aggression on a later occasion. 
Thus, the present study, and one recently 
reported by Lovaas (1961), both utilizing a 
design in which subjects first observed filmed 
aggression and then were frustrated, clearly 
reveal that observation of models portraying 
aggression on film substantially increases 
rather than decreases the probability of ag- 
gressive reactions to subsequent frustrations. 

Filmed aggression, not only facilitated the 
expression of aggression, but also effectively 
shaped the form of the subjects’ aggressive 
behavior. The finding that children modeled 
their behavior to some extent after the film 
characters suggests that pictorial mass media, 
particularly television, may serve as an im- 
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portant source of social behavior. In fact, a 
possible generalization of responses originally 
learned in the television situation to the ex- 
perimental film may account for the signifi- 
cantly greater amount of aggressive gun play 
displayed by subjects in the film condition as 
compared to subjects in the real-life and con- 
trol groups. It is unfortunate that the qualita- 
tive features of the gun behavior were not 
scored since subjects in the film condition, 
unlike those in the other two groups, devel- 
oped interesting elaborations in gun play (for 
example, stalking the imaginary opponent, 
quick drawing, and rapid firing), character- 
istic of the Western gun fighter. 

The view that the social learning of ag- 
gression through exposure to aggressive film 
content is confined to deviant children 
(Schramm, Lyle, & Parker, 1961), finds little 
support in our data. The children who par- 
ticipated in the experiment are by no means 
a deviant sample, nevertheless, 88% of the 
subjects in the Real-Life and in the Human 
Film condition, and 79% of the subjects in 
the Cartoon Film condition, exhibited varying 
degrees of imitative aggression. In assessing 
the possible influence of televised stimulation 
on viewers’ behavior, however, it is important 
to distinguish between learning and overt. 
performance. Although the results of the 
present experiment demonstrate that the vast 
majority of children learn patterns of social 
behavior through pictorial stimulation, never- 
theless, informal observation suggests that 
children do not, as a rule, perform indis- 
criminately the behavior of televised char- 
acters, even those they regard as highly at- 
tractive models. The replies of parents whose 
children participated in the present study to 
an open-end questionnaire item concerning 
their handling of imitative behavior suggest 
that this may be in part a function of negative 
reinforcement, as most parents were quick to 
discourage their children’s overt imitation of ` 
television characters by prohibiting certain 
programs or by labeling the imitative behavior 
in a disapproving manner. From our knowl- 
edge of the effects of punishment on behavior, 
the responses in question would be expected 
to retain their original strength and could 
reappear on later occasions in theepresence of 
appropriate eliciting stimuli, particularly if 
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instigation is high, the instruments for ag- 
gression are available, and the threat of nox- 
ious consequences is reduced. 

The absence of any relationships between 
ratings of the children's predisposition to ag- 
gression and their aggressive behavior in the 
experimental setting may simply reflect the 
inadequacy of the predictor measures. It may 
be pointed out, however, that the reliability 
of the ratings was relatively high. While this 
does not assure validity of the measures, it 
does at least indicate there was consistency in 
the raters’ estimates of the children’s aggres- 
sive tendencies. 

A second, and perhaps more probable, ex- 
planation is that proprioceptive feedback 
alone is not sufficient to account for response 
inhibition or facilitation. For example, the 
proprioceptive cues arising from hitting re- 
sponses directed toward parents and toward 
peers may differ little, if any; nevertheless, 
tendencies to aggress toward parents are apt 
to be strongly inhibited while peer aggression 
may be readily expressed (Bandura, 1960; 
Bandura & Walters, 1959). In most social 
interaction sequences, proprioceptive cues 
make up only a small part of the total stimulus 
complex and, therefore, it is necessary to take 
into consideration additional stimulus com- 
ponents, for the most part external, which 
probably serve as important discriminative 
cues for the expression of aggression. Con- 
sequently, prediction of the occurrence or in- 
hibition of specific classes of responses would 
be expected to depend upon the presence of a 
certain pattern of proprioceptive or introcep- 
tive stimulation together with relevant dis- 
criminative external stimuli. 

According to this line of reasoning, failure 
to obtain the expected positive relationships 
between the measures of aggression may be 
due primarily to the fact that permissiveness 
for aggression, conveyed by situational cues 
in the form of aggressive film content and 
play material, was sufficient to override the 
influence of internal stimuli generated by the 
commission of aggressive responses. If, in fact, 
the behavior of young children, as compared 
to that of adults, is less likely to be under 
internal stamulus control, one might expect 
environmental cues to play a relatively im- 


portant role in eliciting or inhibiting aggres- 
sive behavior. 

A question may be raised as to whether the | 
aggressive acts studied in the present experi- 
ment constitute "genuine" aggressive re- 


` sponses. Aggression is typically defined as 


behavior, the goal or intent of which is injury 
to a person, or destruction of an object 
(Bandura & Walters, 1959; Dollard, Doob, 
Miller, Mowrer, & Sears, 1939; Sears, Mac- 
coby, & Levin, 1957). Since intentionality is 
not a property of behavior but primarily an 
inference concerning antecedent events, the 
categorization of an act as “aggressive” in- 
volves a consideration of both stimulus and 
mediating or terminal response events. 
According to a social learning theory of 
aggression recently proposed by Bandura and 
Walters (in press), most of the responses 
utilized to hurt or to injure others (for ex- 
ample, striking, kicking, and other responses 
of high magnitude), are probably learned for 
prosocial purposes under nonfrustration con- 
ditions. Since frustration generally elicits | 
responses of high magnitude, the latter classes 
of responses, once acquired, may be called out 
in social interactions for the purpose of injur- 
ing others. On the basis of this theory it would 
be predicted that the aggressive responses 
acquired imitatively, while not necessarily 
mediating aggressive goals in the experimental 
situation, would be utilized to serve such 
purposes in other social settings with higher 
frequency by children in the experimental 
conditions than by children in the control 
group. l 
The present study involved primarily vicar- 
ious or empathic learning (Mowrer, 1960) 
in that subjects acquired a relatively complex 
repertoire of aggressive responses by the mere 
sight of a model’s behavior. It has been gen- 
erally assumed that the necessary conditions 
for the occurrence of such learning is that the 
model perform certain responses followed by 
positive reinforcement to the model (Hill, 
1960; Mowrer, 1960). According to this 
theory, to the extent that the observer experi- 
ences the model’s reinforcement vicariously, 
the observer will be prone to reproduce the 
model’s behavior. While there is some evidence 
from experiments involving both human 
(Lewis & Duncan, 1958; McBrearty, Marston, 


IMITATION oF Firm Mopets 11 


& Kanfer, 1961; Sechrest, 1961) and animal 
subjects (Darby & Riopelle, 1959; Warden, 
Fjeld, & Koch, 1940), that vicarious rein- 
forcement may in fact increase the probability 
of the behavior in question, it is apparent 
from the results of the experiment reported 
in this paper that a good deal of human imita- 
tive learning can occur without any reinforcers 
delivered either to the model or to the ob- 
server. In order to test systematically the 
influence of vicarious reinforcement on imita- 
tion, however, a study is planned in which the 
degree of imitative learning will be compared 
in situations in which the model’s behavior is 
paired with reinforcement with those in which 
the model’s responses go unrewarded. 
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MEDIATED TRANSFER OF ATTITUDES 
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Attitudes cannot only be acquired, but may also be transferred like verbal 
conditioned responses. This was shown by associating a nonsense syllable with 
the name of a tribe, then developing a favorable or unfavorable attitude 
towards the nonsense syllable and subsequently testing whether such an at- 
titude had been transferred to the tribe. 2 nonsense syllables and 2 names 
of tribes were chosen as stimuli in a counterbalanced, sensory preconditioning 
design, requiring 4 groups of 20 Ss. A notable feature was the use of probability 
learning during the conditioning period; the probability of occurrence of Good 
or Bad after a nonsense syllable being .85. Transfer, which was successful, was 
tested in 2 ways: by requiring Ss to choose 10 out of a mixed list of 20 
favorable and unfavorable attributes for each tribe and by rating each tribe on 
a semantic differential scale of Good-Bad. 


Attitudes are evaluative verbal responses 
and, as such, should obey the general laws of 
verbal conditioned responses (CRs). Staats 
and Staats (1957) have shown how meaning 
can be acquired as a classical CR. Since at- 
titudes are one aspect of meaning, they can 
be acquired in the same manner as Staats 
and Staats (1958) showed in a later study. 
It should be possible to demonstrate that 
attitudes can be transferred just as any other 
verbal CR, and the same mediational process 
which explains acquisition can also account 
for the transfer of attitudes. In order to 
investigate the transfer of attitudes, the ap- 
propriate experimental model should be that 
used in sensory preconditioning (SPC) or 
in mediated stimulus generalization (MSG). 
The present authors regard SPC and MSG 
to be essentially similar and assume that 
these come under the broad principles of 
generalization of classical CRs. Such an as- 
sumption is already implicit in the following 
studies described below. Shichko (1959) was 
able to develop conditioned connections be- 
tween two or four indifferent stimuli with- 
out reinforcement and maintained that the 
“mechanism of forming conditioned reflex is, 
in the main, the same for both reinforced 
and nonreinforced, functionally connected 
stimuli.” Wyrwicka (1955) reports the fol- 
lowing phenomena which resemble the gen- 
eralization of classical CR, but could be 
viewed as SPC phenomena. Two CRs, 


$i— n afd Se—> rs were established in two dif- 
ferent experimental situations, 2: and Zz respec- 
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tively. Then in the situation 2: a new stimulus 
was introduced and a reflex Sx—>rı was formed 
When this stimulus was applied in the situation Za 
it evoked the response rs instead of the respo: 
rı... (p. 34). 


From Wyrwicka’s experiment, it is perhaps 
evident that mediated transfer of attitudes 
should correspond to generalization of CRs. 
The present study is based on such a hy: 
pothesis. It is designed to test whether ot 
not an attitude relevant to stimulus S can be 
transferred to Sı when S and S; had occurred’ 
together for sometime previous to the de< 
velopment of the attitude to S. 


METHOD 


General Design and Materials 


An SPC paradigm was adopted. The experiment 
was divided into the following three stages: Pre 
conditioning—here the stimuli Sı and Ss are pre: 
sented in close succession in that order; Condition: 
ing—response Rı is developed for Sı; Testing fo 
transfer of Ri to Ss. 

Two Sı and two S: stimuli were chosen. The 
stimuli, JAGAPA and KACHADA, were nonsense syllable 
and the S; stimuli, Ho and UNDA, were the names 0! 
two aboriginal tribes. S; and S» were paired off i 
the four possible combinations of a counterbalanced 
design. Subjects had too little knowledge of the two 
tribes to form any pre-experimental preference fo 
any of them. But all subjects knew that such tribes) 
were inhabitants of some jungle areas in the state 
In the conditioning stage, the nonsense syllables 
were paired with adjectives Good and Bad, where) 
the probability of a Good following yacapa and a 


1 All subjects lived in the city of Cuttack and were 
far away from tribal settlements in other parts of 
the State of Orissa. 
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Bad following KACHADA was .85. A counterbalanced 
design permitted the presentation of Bad following 
JAGAPA and Good following KacHADA with the same 
probability of .85. 

Each experimetal group received 60 trials in 
Stage 1 and 200 trials in Stage 2. The two SS: 
pairs in Stage 1 were exposed, each for 30 times 
according to a predetermined random schedule. 
Likewise, each S;-Ri combination in Stage 2 re- 
ceived 100 trials, The third stage of the experiment 
was concerned with a search for some evidence of 
transfer. Subjects were required to choose 10 at- 
tributes for each of the tribes from a randomly 
presented list of 10 favorable and 10 unfavorable 
adjectives. They were subsequently asked to rate the 
tribes on a five-point scale of good-bad. The scale 
was similar to the Semantic Differential scales 
recommended by Osgood (Osgood, Suci, & Tannen- 
baum, 1957). At the end, subjects were required to 
rate the nonsense syllables on the same scale. 

The 80 subjects were selected from Class X of 
local high schools. The average age for the class 
was 15 years. 


Procedure 


The subjects were run in groups of 20 by two 
research assistants one of whom was the junior 
author. In the first stage, the nonsense syllable-tribe 
pairs (Sı-S:) were read out by one of the experi- 
menters, one by one, and the subjects were asked 
to repeat after the experimenter. The two SS: 
pairs were thus read out for 60 times in a pre- 
determined random order with a probability of .5 
for the occurrence of each pair. After a convenient 
short interval of 5 minutes, the subjects took part 
in the second stage of the experiment, The pro- 
cedure described elsewhere (Das & Panda, in press) 
was simple. Association between Sı (JAGAPA or 
KACHADA) and Ri (Good or Bad) was learned in a 
probability learning situation. Subjects found in 
front of them five decks of cards. Each deck had 
40 cards, adding up to a total of 200 cards. The 
procedure for arranging stimulus cards may be illus- 
trated by describing the cards for Experimental 
Group I. Each of the five decks for this group had 
20 cards where JAGAPA was S; and another 20 where 
KACHADA was S; stimulus. Out of the 20 JAGAPA 
cards, 17 (85%) had the combination JAGAPA-Good 
and 3 (15%) had yacapa-Bad. This ensured the .85 
probability of occurrence that was required by the 
experimental design. The same procedure was fol- 
lowed for cards with xacmapa; the combination of 
Bad with it was found in 17 cards and Good in the 
remaining 3 cards, These four types of cards in each 
deck were arranged in a random order. Subjects 
were asked to guess whether Good or Bad would 
follow, immediately after the experimenter an- 
nounced the nonsense syllable written on the top 
card, After this, the experimenter exposed and read 
out the adjective on the card, thus allowing the 
subjects to know whether their guess was correct. 
Each suBject was provided with five strips of 


TABLE 1 
ADJECTIVE AND RATING SCORES 
HO MUNDA 
Category 
Good | Bad | Good | Bad 
A. Favorable adjectives 
M y 742 | 5.07 | 4,30 | 1.85 
SD 2.5 | 3.92 | 3.84 | 1.81 
B. Rating tribes 
M 3.97 | 3.15 | 3.20 | 2.17 
SD 94 | 1.42 | 1.36 | 1.09 
JAGAPA KACHADA 
C. Rating nonsense 
syllables 
M 3.75 | 2.65 | 3.52 | 2.68 
SD 122 | 1,17 | 1.14 | 1.45 


papers with 20 places in each in order to record 
his guesses, and was prevented from consulting a 
completed strip.2 At the end, the strips were col- 
lected. After a 5-minute rest, subjects were supplied 
mimeographed forms in which 20 adjectives and 
the names of the two tribes were written with the 
instruction that they choose 10 adjectives for each 
tribe. This was followed by filling out the position 
of the tribe in a five-point Semantic Differential 
scale. For rating, subjects found five headings: ex- 
tremely bad, bad, neither bad nor good, good, very 
good, under which they could decide to put the 
tribe. The same scale written on the other side of 
the form was provided to rate the two nonsense 
syllables, JAGAPA and KACHADA. 

The whole session lasted for about 2 hours at 
the end of which all subjects were rewarded with 
toffees promised to them at the start of the experi- 
ment. The subjects took the test in their mother 
tongue which was Oriya. 


RESULTS 
Transfer of Attitudes 


Table 1 gives the means and SDs of favor- 
able adjectives and the rating scores. In that 
Table, Ho-Good indicates that 110 is expected 
to be rated good since the nonsense syllable 
associated with it was later conditioned to 
Good. The other expressions, such as MUNDA- 
Bad, are identically defined. In Table 1 the 
higher scores imply a more favorable rating. 


2 Although the method of recording guesses is 
different from the usual American method where 
subjects are prevented from consultimg a previous 
guess, it has been found to work as well (Panda, 
in press). 
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TABLE 2 
ANALYSIS OF VARIANCE OF ADJECTIVE SCORES 


Source SS df F 
Good/Bad 202.500 1 19.255* 
HO/MUNDA 366.025 1 34.803* 
Interaction 1.600 1 1 
Residual 1640.650 156 

Total 2210.775 159 
Residual MSV 10.517 


*p <,001, 


Figures in Portion A stand for the number of 
favorable attributes chosen by the subjects. 
Three separate analyses of variance were per- 
formed corresponding to data in Portions A, 
B, and C. Their results are presented in 
Tables 2, 3, and 4. 

The analysis of adjective scores (Table 2) 
reveals two significant main effects, Good/Bad 
and no/MUNDA. Good indicates that a greater 
number of favorable adjectives was expected 
for the tribe which was paired with the non- 
sense syllable associated with Good. Bad is 
similarly defined. Thus the highly significant 
Good/Bad main effect (F —19.25, df= 
1/156, p< .001) implies that a tribe was 
ascribed favorable or unfavorable attributes 
in accordance with its nonsense syllable. This 
is a clear proof of a transfer of attitudes from 
nonsense syllables to the tribes. The second 
main effect indicates pre-experimental prefer- 
ence for one of the tribes, Ho. Perhaps the 
word mo is more pleasant than the word 
MUNDA. Significant interaction between the 
two main effects was not expected; therefore 
the negligible interaction obtained is in order. 
Table 3 presents the analysis of scores on the 
five-point good-bad rating scale. 


TABLE 4 


ANALYSIS OF VARIANCE OF RATING SCORES FOR 
THE NONSENSE SYLLABLES 


Source SS df | F 
Good/Bad 38.02 1 | 23.63* 
JAGAPA/KACHADA 0.40 1 1 
Interaction 0.63 1 | 1 
Residual 251.35 156 | 

"Total 290.40 159 | 
Residual MSV 1.611 | 
* p « .001. 


Consistent with the analysis of adjective 
scores, the same two highly significant (p 
.001) main effects are obtained here. The 
interaction is again not significant. The leas 
indirect transfer would be the rating of th 
two nonsense syllables according to their as- 
sociation with Good and Bad. Table 4 shows, 
as expected, a highly significant main effec 
(p < .001) for Good/Bad. The other mai 
effect is very small and shows that no extra- 
experimental preference existed for any 0 
of the nonsense syllables. 


Two-Choice Learning 


The mean percentages of prediction of thi 
more frequently occurring event are shown 
in Table 5. Scores are presented in five suc 
cessive 20-trial blocks. An inspection of Table 
5 reveals that only one of the four terminal 
predictions reaches the stimulus probability 
level of .85. Clearly, the learning curves that 
may be drawn from these scores will not 
asymptotic. More trials would be required to 
obtain the asymptotic probability level as hi 
been done in comparable studies (Grant 
Hake, & Hornseth, 1951; Popper & Atkinson. 


TABLE 3 TABLE 5 
ANALYSIS OF VARIANCE OF RATING SCORES MEAN PERCENTAGE OF PREDICTION PER 
FOR TRIBES 20-TrIAL BLOCKS 
Source SS df F Trial blocks 
Stimuli 
Good/Bad 3425 | 1| 2241 y gus sibi Rs HL aA [on 
HO/MUNDA iU 1 20.056* : 
Interaction E 1 1 7 
3 JAGAPA-Good 59.25 | 73.50 | 72.12 | 77.50 | 82.8 
RM zen nds xacmapa-Bad | 57.25 | 71.50 | 76.75 | 76.37 | 84.75 
Residual MSV 1.527 JAGAPA-Bad — | 56.75 | 69.25 | 73.87 | 78.25 | 80.37 


* p «.001. 


KACHADA-Good | 54.00 | 67.87 | 78.37 78.00 | 85.00 


MEDIATED TRANSFER OF ATTITUDES 


TABLE 6 


Successive RUNS OF E, PREDICTIONS IN PERCENTAGES 


OE: Ei EiEi EEE: EEEE EiEiEiE:E: Ei;EiEiE;EiE 4 
20-trial blocks 
1 60.60 63.15 65.65 65.25 63.75 71.25 51.85 
2 66.85 80.65 78.10 75.60 78.10 76.25 69.85 
3 68.10 78.75 78.10 78.10 76.85 78.10 80.00 
4 81.25 81.85 89.35 88.75 86.25 86.25 80.00 
5 81.25 81.85 90.00 88.10 89.35 | 86.25 80.65 


1958). An overall view of Table 5 shows that 
learning proceeds steadily. This is also ap- 
parent from the analysis of runs shown in 
Table 6. In order to prepare the run scores, 
the more frequently occurring event (Ei, 
zı = .85) was scheduled in such a way dur- 
ing stimulus presentation that in each trial 
block, it had an uninterrupted run of at least 
seven occurrences. Let X, stand for the pre- 
diction of E, on the ith run where i = 0,1,2, 
3,4,5,6. X, for each 20-trial block was obtained 
and converted into percentage which is pre- 
sented in Table 6. Thus OE; is an E; trial 
which is preceded by E»; 1E, is an Ei trial 
preceded by one E;; Trial 2E; likewise has 
been preceded by two successive E, trials. 
Besides indicating a general increment in E; 
predictions with increasing trial blocks, the 
data also show some negative recency effect 
as obtained elsewhere (cf. Anderson & 
Whalen, 1960). 


DISCUSSION 


The results indicate that word meaning 
will condition to contiguously presented verbal 
stimuli as found by Staats and Staats (1958) 
and, once conditioned, it can be generalized 
or transferred, The extent of this transfer is 
quite great, since not only the tribes received 
consistent ratings of good or bad, but favor- 
able and unfavorable adjectives were selected 
for the tribes because their corresponding 
nonsense syllables had acquired goodness or 
badness. Since subjects were asked to choose 
adjectives for the tribes prior to rating the 
tribes on the good-bad continuum, their pre- 
dicted choice of adjectives is an unequivocal 
proof of mediated transfer of attitudes. The 
results are also significant as perhaps one of 


the first instances where sensory precondition- 
ing is successfully demonstrated using socially 
meaningful verbal stimuli; and this has 
been possible even with a partial 85% re- 
inforcement. In usual practice, the two initial 
stimuli (cf. Light and Buzzer in animal 
experiments) are presented simultaneously in 
sensory preconditioning paradigms. Here, one 
stimulus followed the other, thus showing 
that simultaneity of stimulus occurrence is 
not a necessary condition for sensory pre- 
conditioning. Incidentally, the present study 
shows that responses acquired through sen- 
sory preconditioning are similar to condi- 
tioned responses, to the extent that these can 
be generalized or transferred. This is addi- 
tional evidence in support of a basic similar- 
ity between conditioning and sensory pre- 
conditioning, an issue which has engaged 
the attention of some researchers (Hoffeld, 
Kendall, Thompson, & Brogden, 1960). 
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ALCOHOLISM, BIRTH ORDER, AND FAMILY SIZE* 


REGINALD G. SMART? 
Alcoholism and Drug Addiction Research Foundation, Toronto 


On the basis of Schachter's findings relating affiliation during anxiety to birth 
order it was predicted that later borns and persons from large families should 
be overrepresented among alcoholics and that the number of alcoholics within 
each family size should increase with increasing birth order. First borns were 
expected to have significantly more therapeutic contacts than later borns. 
It was pointed out that Schachter's analyses, based on Bakan's data, lacked, 
among other things, a correction for family size. When this correction was 
employed with birth order and family size data from a sample of 242 treated 
alcoholics, the only hypothesis supported was that relating to an over- 
representation of persons from large families. 


A recent volume by Schachter (1959), The 
Psychology of Affiliation, reports a series of 
investigations of the relationships between 
birth order and family size, and behavior 
during anxiety. Under conditions of experi- 
mentally induced anxiety, Schachter found 
that first born and only children are con- 
siderably more likely to want to be with other 
people than are later born. He also found 
that a significantly smaller number of first 
and later borns from large families chose to 
affiliate during anxiety than did those from 
smaller families (number of children—1, 2, 
or 3). For every family size he found a con- 
stant and sizable decrease in the proportion 
of subjects choosing to be together in anxiety 
situations. 

Schachter (1959) also attempted to deter- 
mine whether the relationships between affili- 
ative tendencies and ordinal position held for 
“real life situations" (p. 62). The finding 
that later born subjects did not affiliate 
during anxiety suggested to Schachter that 


among alcoholics, later borns should be over- ° 


represented and early borns underrepresented, 
since an alcoholic is “a troubled person who 
is not strongly inclined to handle his difficul- 
ties by social means and seeks to dispose of 
his problems by drinking” (p. 62). In an 
empirical test of this hypothesis Schachter 
has reanalyzed the data on the ordinal posi- 


1A modified version of this paper was presented 
at the June 1961 meeting of the Canadian Psycho- 
logical Association. 

The author wishes to thank R. J. Gibbins for 
his many suggestions and for his comments on the 
manuscrjpt. 
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tion of 1,493 alcoholics reported by Bakan 
(1949). The conclusion was reached that 
“later borns are disproportionately repre- 
sented among chronic alcoholics” (Schachter, 
1959, p. 63). However, several considera- 
tions make this conclusion suspect. Although 
Bakan identifed his sample as “alcoholic” 
males this group actually consisted of per- 
sons who had been convicted of crimes in- 
volving the misuse of alcohol, chiefly “public 
intoxication.” The extent to which their 
drinking fitted the usual definitions of alcohol- 
ism had not been determined. Furthermore, 
Bakan reported birth orders without family 
sizes and Schachter has assumed that children 
born in Birth Orders 1, 2, and 3 are “early” 
born and that those in Birth Orders 4 and up 
are “later” born. Without reference to the 
size of his sibship it would be impossible to 
say whether a person born second or third 
was “early” or “later” born. 

And finally Schachter’s (1959) analyses do 
not contain a correction for family size such 
as the Greenwood-Yule correction proposed 
by Gregory (1958). The necessity of this 
correction depends upon the fact that the 
probability of meeting an alcoholic from a 
particular sibship size is proportionate to the 
size of the sibship. 

The purpose of the present study was to 
retest Schachter’s (1959) hypothesis about 
alcoholism and birth order with corrected 
birth order and family size data from a well 
defined group of alcoholics. Efforts were also 
made to test several other hypotheses derived 
from Schachter’s experimental &ndings. For 
example, if a smaller number of first and 
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later borns from large families than from 
smaller chose to affiliate during anxiety, it 
might be expected that large families con- 
tribute disproportionately to the alcoholic 
population. Since for every family size there 
was a decrease, with increasing birth rank, 
in the proportion of subjects choosing to 
affiliate during anxiety it might also be 
expected that the number of alcoholics, in 
each family size, would increase with increas- 
ing birth rank. Schachter also found some 
support for the hypothesis that when exposed 
to psychotherapy, first borns are more likely 
to become involved in therapy than are later 
borns. 

The following specific hypotheses were, 
therefore, formulated. A sample of alcoholics 
exposed to treatment for alcoholism will: 

1. contain a significantly larger number of 
later borns (ie., persons not first born), and 
a significantly smaller number of first borns 
than expected 

2. show, for each family size, an increasing 
number of alcoholics with increasing ordinal 
position 

3. contain a significantly larger number of 
persons from large families than expected 

4. contain a larger number of first borns 
and a smaller number of later borns than 
expected who maintain frequent psychothera- 
peutic contacts. 


METHOD 


Sample 


A sample of 242 alcoholics was obtained from 
three alcoholism clinics in Ontario. This sample com- 
prised three subgroups of 118, 72, and 52 alcoholics. 
From two of the alcoholism clinics consecutive first 
admissions during a 6-month period were obtained. 
In the case of the third clinic this method could not 
be used. However, a group of 118 alcoholics was 
obtained by drawing every tenth name in an alpha- 
betical listing of all former patients. Each of the 
alcoholics exhibited symptoms of alcoholism such 
as loss of control of drinking or withdrawal states 
when drinking was terminated. The ratio of males 
to females was 10:1 which approximates the 8:1 
ratio found in a survey of all alcoholics in an 
Ontario County (Gibbins, 1954). 


Hypothesis 1 


For each of the 242 alcoholics data on birth 
order and family size were obtained. In determining 
the expected Dumber of alcoholics for each birth 
rank (Table 1) it was assumed that if there were 


no relationship between birth rank and alcoholism 
then the observed number of alcoholics for each 
family size would be evenly distributed over the 
birth ranks for that family size. 

The necessity of correcting sibship data for family 
size arises from the fact that the probability of 
encountering a member from any particular sibship 
depends upon the size of that sibship. As stated 
by Gregory (1958), 


if there were equal numbers of sibships contain- 
ing X members, and of sibships containing only 
one member, the chances of meeting individuals 
from sibships containing X members would be 
X times the chances of meeting only children 
(provided the sample is small in comparison with 
the total population). In order to obtain the 
approximately unbiased frequencies of family size 
that would have been obtained by taking a sample | 
of "mothers," it is, therefore, necessary to divide 
the biased frequencies obtained through members 
of the sibships (F+), by the number of individuals 
in each sibship (X) (p. 45). 


Table 2 shows the Greenwood-Yule transformation 
of the birth order and family size data and Columns 
1 and 2 show X and F., respectively. The biased 
mean size of family is given by Z(X Fs) ZF., and 
the corrected mean family size is obtained by sub- 
stituting F;/X for Fz in this formula. The distribu- 
tion F./X must be used for making comparisons 
between sizes of families from which the alcoholics 
came with the sizes of families in the general popula- 
tion. The F./X distribution must also be used in 
determining the corrected birth order data and this 
is given under Column 4 in Table 2 as E. (obtained 
by summing F./X from the bottom upwards). 


Hypothesis 2 


The observed number of alcoholics in Birth 
Rank 1 was taken as a base and for each observed 
frequency which was larger than that in the next 
lowest birth rank (for that family size) a. plus was 
assigned (Table 1). For each observed frequency 
which was smaller than that in the next lowest birth 
rank a minus was assigned; where there was no 
difference a zero was assigned. If the number of 
alcoholics tended (within each family) to increase 
with increasing ordinal position then the number 
of pluses should be significantly greater than the 
number of minuses. 


Hypothesis 3 


The expected percentages of alcoholics from 
various family sizes were obtained in the following 
manner. In order to test the hypothesis that a non- 
alcoholic population would contain a disproportion- 
ately large number of persons from large families 
it was necessary to determine the distribution of 
family sizes for the general population. Moreover, 
it was necessary to determine this distribution for 
the time at which the alcoholics were living, as 
children, with their families of origin. It was there- 
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TABLE 1 


DISTRIBUTIONS OF BIRTH RANK AND FAMILY SIZE FOR THE SAMPLE OF 242 ALCOHOLICS AND THE EXPECTED 
FREQUENCIES OF ALCOHOLICS WITHIN EACH FAMILY SIZE 


Ef 2 3 4 5 6 7 8 9 10 | 11 12 | Total 
il 21 |22(16)| 17(13.3)| 11(11.75)| 7(5.8)| 5(3.83)| 5(2.86)| 1(1.38)| 2(1.11)| 0(.30)| 0(.27)| 0(.25)| 91(70)* 
2 10(16)| 14(13.3)| 10(11.75)| 6(5.8)| 2(3.83)| 2(2.86)| 2(1.38)| 2(1.11)| 0(.30)| 0(.27)| 0(.25)| 48 
3 9(13.3)| 13(11.75)| 3(5.8)| 6(3.83)| 3(2.86)| 2(1.38)| 0(1.11)| 1(.30)| 1(.27)| 0(.25)| 38 
4 13(11.75)| 9(5.8)| 3(3.83)| 4(2.86)| 2(1.38)| 0(1.11)| 0(.30)| 0(.27)| 1(.25)| 32 
5 4(5.8)| 2(3.83)| 1(2.86)| 1(1.38)| 2(1.11)| 0(.30)| 0(.27)| 0(.25)| 10 
6 5(3.83)| 3(2.86)| 0(1.38)| 2(1.11)| 1(.30)| 1(.27)| 0(.25) 12 
7 2(2.86)| 1(1.38)| 1(1.11)| 0(.30)| 0(.27)| 0(.25)| 4 
8 2(1.38)| 0(1.11)| 1(.30)| 1(.27)| 0(.25 
9 1(1.11)| 0(.30)| 0(.27)| 0(.25. 

10 0630) 027)| 225)| 7^ 
11 0(.27)| 0(.25 
12 0(.25, 


a Only children (for whom no expected values are available) 
* Birth Ranks 8-12 were combined to yield expected values 


fore decided to base the expected frequencies on the 
1931 census for Ontario? (Dominion Bureau of Sta- 
tistics, 1931) which reports the number of children 
in each family (Table 96, p. 1314). From these data 
the expected frequencies shown in Table 4 were 
computed. 

It was thought that factors related to nationality 
and religious affiliation might, independently of 
childhood experiences, affect both family size and 
susceptibility to alcoholism, There was some expecta- 
tion that Roman Catholics and persons born in 


The 1931 census is perhaps the best choice 
among all possible census years. Since the mean age 
of the alcoholic sample is 43.00 (SD = 11.95) the 
distribution of family sizes in 1931 should give the 
best approximation to the population from which 
the alcoholic family sizes were drawn. 


were excluded from the chi square test. 
greater than 5. 


Quebec might be from larger families and also 
more susceptible to alcoholism. The family size data 
were reorganized as to religious affiliation and birth- 
place, and comparisons were made with the general 
population of Ontario (Dominion Bureau of Sta- 
tistics, 1951). 


Hypothesis 4 


Data relating to therapeutic contacts and length 
of therapy could not be obtained for all alcoholics 
in the sample. However, it was possible to obtain 
the data on the frequency of their psychotherapeutic 
contacts for each of the 72 patients obtained from 
one clinic. It should be noted here that all these 
patients were offered psychotherapeutic help in refer- 
ence to their problems of addiction, However, no 
effort was made to coerce patients into therapy nor 


TABLE 2 


GREENWOOD-YULE RECONSTRUCTION OF SIBSHIPS OF 242 ALCOHOLICS AND THE CHI SQUARE TEST OF THE 
SIGNIFICANCE OF THE DIFFERENCES BETWEEN THE OBSERVED AND EXPECTED (Corrected) 
SIBSHIP FREQUENCIES 


y Expected frequency Observed frequency — Ey)? 
AS Frequency teque (ie/X) | tah birth order each birth order (Os Bs) 
1 2 21.00 71.88 91 221 
2 32 16.00 56.88 48 1.39 
3 40 13.33 40.88 38 02) 
n 4T 11.75 21.55 32 0. 
5 29 5.80 15.80 10 2.13 
6 23 3.83 10.00 12 ‘40 
7 20 2.86 647 4 0:76 
MET opem 
9 : : 
10 3 0.30 0.82 +6.83 247 0.0042 
11 3 0.27 0.52 a 
12 3 0.25 0.25 an 
* — Total 242 77.88 241.99 242 e 7812 


Note.ş Goodness of fit between expected and observed freqi 


uencies of birth order: x? = 7.81, df = 7, .30 < p <.50. 
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TABLE 3 


OBSERVED AND EXPECTED NUMBERS OF PLUSES AND 
MiNUSES ASSIGNED TO THE CELLS 


IN TABLE 1 
Pluses Minuses 
Observed (Expected) 19 (21) 23 (21) 


to discourage those who wished to become involved 
in psychotherapy. 

In accordance with Schachter’s (1959, p. 69) 
analysis those alcoholics who had only one psycho- 
therapeutic contact were termed “nonaccepters,” 
those who had two or three contacts “token accept- 
ers,” and those with more than three “accepters.” 


RESULTS 
Hypothesis 1 


The observed and expected numbers of 
alcoholics in each birth rank, for each family 
size, are shown in Table 2. 

It can be seen from this table that the chi 
square test of goodness of fit was not sig- 
nificant (x? = 7.81, p < .50) and that there 
are no significant differences between the 
observed and expected frequencies in any of 
the birth ranks.* The hypotheses relating to 
an overrepresentation of later borns and an 
underrepresentation of early borns therefore 
received no support. 


Hypothesis 2 


Table 3 shows the total number of pluses 
and minuses assigned to the birth rank cells 
in Table 1. 

If there were a tendency for the num- 
ber of alcoholics within each family size to 
increase with increasing birth order the 
number of pluses should exceed the number 
of minuses. Since there are slightly more 
minuses the hypothesis of increasing numbers 
of alcoholics with increasing birth order can 
be rejécted without a statistical test. 


* An article by Keeping (1952) has indicated that 
the chi square test might, in some cases, under- 
estimate the significance of the differences between 
Observed and unexpected birth order frquencies. For 
this reason an alternative method, suggested by 
Keeping (p. 113) and employing the normal deviate, 
was also computed. This test gave a normal deviate 
of 1.79 which failed to achieve significance (p > .08) 
and thereby” corroborated the test using the 
Greenwood-Yule correction. 


Hypothesis 3 


The data relating to the observed and ex- 
pected frequencies of alcoholics from each 
sibship are shown in Table 4. If we accept 
the distribution of sibship sizes in 1931 as 
the population from which the alcoholic sib- 
ship sizes were drawn we would expect, under 
the null hypothesis, to find no significant 
differences between the observed and expected 
frequencies of alcoholics in the various sib- 
ship sizes. The differences were tested by 
means of chi square tests and Sibship Sizes 5 
and over were combined to provide an ex- 
pected frequency greater than 5. It can be 
seen from Table 4 that the observed percent- 
age of alcoholics is greater than expected 
only for the combined category of Sibship 
Sizes 5 and over (x? = 4.54, p < .05). The 
percentage of alcoholics who come from large 
families is significantly greater than expected. 

Data on nationality and religious affilia- 
tion are shown in Tables 5 and 6, respec- 
tively. 

It can be seen from Table 5 that the sample 
studied is representative of the Ontario 
population with regard to nationality. The 
sole discrepancies appear to be the under- 
representation of persons from Europe and 
overrepresentation of persons from the United 
States but the frequencies are too small to 
warrant further analysis. Nationality, then, 


TABLE 4 


THE OBSERVED AND EXPECTED' FREQUENCIES OF 
ALCOHOLICS COMING FROM SrBsHIPS OF VARIOUS 
SIZES AND THE CHI SQUARE TEST OF THE 
DIFFERENCES BETWEEN THE OB- 

SERVED AND EXPECTED 


VALUES 
Size of sibship (i.e., Observed (cor- E d 
total number of ` | rected) frequency Bessere Me: 
1 21.00 27.6 1.58 
2 16.00 20.7 1.07 
3 13.33 12.7 .03 
1 11.75 7.5 2.41 
5 and over> 15.80 9.3 4.54* 
Total 77.88 77.80 | 9.63 


Note.—Goodness of fit between observed and expected fre- 
quencies of family size: x? = 9.63, df = 4, $ < .05. 
Pues on the frequency distribution of sibship sizes in-1931. 
io. 
b Sibship Sizes 8-12 were combined to yield expected values 
greater than 5. 
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TABLE 5 
BIRTH ORDER AND FAMILY SIZE IN RELATION TO PLACE OF BIRTH 
Family size Percentage 
Birthplace Percentage| in Ontario 
1|]2]1s[|4|s|6]|7]|8]|9 |10]|11 | 12 | Total Auc 
Canada 14 | 23 |28 | 31 | 20 | 17 | 13] 10| 5 | 3 | 2 | 2 | 168 78.5 78.2 
(except Quebec) 
Quebec 2 3 1 — 7 3.3 3.4 
British Isles oye sy) 2| 1| 2|—| 1 |—|—|—| 19 8.9 94 
United States 1|j4]12|3|2|—|—| 1] —|—1—|—| 8 6.1 1.6 
Europe E A E eel Pali "| ETA ea rch ba ta 1.9 7.0 
Asia 1 — 1 As) 2 
Other -— 1 —|1 = 2 9 2 
Total 17 | 30 | 37 | 42 | 25] 20]}17|11| 8 | 3 | 2 | 2 | 214 | 100.1 
Not recorded 28 


appears to be an unimportant variable with 
regard to alcoholism and family size. 

The breakdown for religion (Table 6) 
shows that Roman Catholics are slightly 
underrepresented and Protestants slightly 
overrepresented in the sample. However, an 
inspection of the table indicates that 27.4% 
of the alcoholics from larger families (5 and 
over) but only 16.3% of those from smaller 
families (1-4) are Roman Catholic. It may 
be that the finding relating to large families 
is explainable partly as an artifact of the 
high proportion of Catholics from large 
families. Unfortunately it has proven impos- 
sible to obtain birth order and family size 
data for the Ontario population. If some 
such data were obtainable it would be pos- 
sible to determine whether Catholics from 
large families are, in fact, overrepresented 


among alcoholics. Since the sample contains 
a high proportion of Catholics from large 
families the overrepresentation of persons 
from large families may result merely from 
alcoholics being drawn mainly from the lower 
classes (Gibbins, 1954 unpublished socio- 
economic data). 


Hypothesis 4 


Table 7 shows the percentages and fre- 
quencies of nonaccepters, token accepters, 
and accepters among only children, first 
born, and later born alcoholics. Following 
Schachter's (1959) procedure the data were 
analyzed in a 2 X 2 table (Table 8). Chi 
square for this table is 1.93 (for 1 df, 
p< .20) and it is concluded that first and 
later borns are equally likely to become in- 
volved in psychotherapy. 


TABLE 6 
BIRTH ORDER AND FAMILY SIZE IN RELATION TO RELIGIOUS AFFILIATION 
Family size 
Religion EHE LI UL 
i|b2las bals hefa |s| o |10 |11 |12] Tota none 

Catholic K ee 75H40. ada $5 72.171 EE S E 47 20.9 24.8 
Protestant | 12 | 25 | 30] 36| 22 | 16 | 12| 5| 7 | 2 | 1 1 169 75.5 71.3 
Jewish —|1 —|— 1 —L.2 4 9 d 
Other gister 2 =| 6 12 2.7 3.0 

Total 17 | 30 | 36 |46 | 27 | 22 |18| 11) 9 | 3 | 2 | 11 | 232 

. 

Not recorded 20 e 
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TABLE 7 


PERCENTAGES OF ONLY CHILDREN, FIRST, AND LATER 
Born Atconouics (N = 72) WHo WERE 
NONACCEPTERS, TOKEN ACCEPTERS, OR 
ACCEPTERS OF PSYCHOTHERAPY 


Category Only children | First born | Later born 
Nonaccepters 16.7 (1) | 11.1 Q)| 31.9 (15) 
"Token accepters 50.0 (3) 16.7 (3)| 21.2 (10) 
Accepters 33.3 (2) | 72.2 (14)| 46.9 (22) 

Total 100 (6) |100 (19)|100 (47) 
Discussion 


Only one of the hypotheses derived from 
Schachter’s (1959) work received empirical 
support in the present study. Later borns 
appear not to be overrepresented among alco- 
holics, and the hypothesized increase in 
representation with increased birth order was 
not found. However, the number of alcoholics 
from large families was significantly greater 
than expected. Lastly, first born and only 
children alcoholics do not become involved in 
psychotherapy more frequently than later 
borns. 

The finding that later borns are not over- 
represented among alcoholics is somewhat 
surprising in view of earlier studies. Aside 
from Schachter's (1959) work, studies by 
Navratil (1959), Deshaies (1951), and 
Martensen-Larsen (1957) have found an 
overrepresentation of later borns in alcoholic 
groups. An explanation for this discrepancy 
may lie in the differences between the alco- 
holic populations studied; but the earlier 
authors present so little data descriptive of 
these populations that specific biases cannot 


TABLE 8 


Cnr Square* TABLE FOR THE SIGNIFICANCE OF THE 
DIFFERENCES BETWEEN THE FREQUENCIES OF 
XARLY AND LATER BORN ALCOHOLICS 
(N — 72) INVOLVED IN AND Nor 
INVOLVED IN PSYCHOTHERAPY 


First born and 


Category only children Later born Total 
Involved 9 (11.8) 25 (222) 34 
Not involved 16 (13.2) 22 (24.8) 38 


"Total 25 AT 72 


a The chi square of 1.93 is significant (for 1 df) at < .20 level. 


be identified. A further shortcoming in all of 
these studies is the lack of corrected birth 
order data. The fact that the present findings 
deviate from those derived by Schachter 
(1959), from Bakan's (1949) data, is not so 
surprising. Bakan's uncorrected data were 
for persons whose alcoholic status was never 
really determined whereas the present sample 
had been intensively studied and the mem- 
bers clearly identified as alcoholic. 

The finding that the number of alcoholics 
from large families is significantly greater 
than expected is difficult to explain. In the 
absence of specific studies of early and later 
children a theoretical explanation is impos- 
sible. Some suggestive evidence relating to 
differential patterns of child rearing for early. 
and later borns has been reported by Sears, 
Maccoby, and Levin (1957). This study 
shows that later born children are less wel- 
come in the family and less often breast fed. 
Large families of course contain larger pro- 
portions of later borns than do small fami- 
lies and hence it may be possible to relate 
the high frequency of children from large 
families in the alcoholic sample to childhood 
deprivations or to the frustration of depend- 
ency needs. 

Contrary to Schachter's (1959) findings 
for (nonalcoholic) patients at a Veterans 
Administration clinic first born and only 
children alcoholics do not become involved in 
psychotherapy more frequently than later 
borns. It would be interesting to determine 
whether first born and only children alco- 
holics more frequently become involved in 
Alcoholics Anonymous. Schachter found that 
his subjects all wished, under anxiety situa- 
tions, to affiliate with persons in the same 
anxiety situation rather than with those in 
dissimilar situations (p. 40). Given this 
finding it would be expected that first born 
and only children alcoholics would more 
often become involved in Alcoholics Anony- 
mous, since this therapy constitutes an ideal 
opportunity to affiliate with persons in 
roughly the same anxiety situations. 
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RESPONSES TO COGNITIVE AND PERCEPTUAL AMBIGUITY 
IN CHRONIC AND ACUTE SCHIZOPHRENICS * 


JURIS G. DRAGUNS ? 
Rochester State Hospital 


Responses of 25 acute and 34 chronic schizophrenics to perceptual and cog- 
nitive ambiguity were compared with those of 36 normal control Ss by means 
of; several “decision location” series consisting of blurred photographs presented 
in the order of increasing clarity; a series of pictures which could be matched, 
at Ss’ discretion, with a number of statements allegedly made by the persons 
pictured; a series of pictures to be similarly matched with nationality names. 
The chronic Ss compared to normals offered earlier recognition responses on 
the perceptual task and matched more items on the cognitive measures. The 
scores of the acute group, however, yielded uniformly negative results and were 
found to be intermediate in relation to those of the chronic and normal Ss. 


Within the last decade, a number of con- 
ceptualizations of schizophrenia have come 
to stress what hitherto had been a neglected 
aspect of schizophrenic decompensation: the 
shift toward a more liberal and frequently 
idiosyncratic utilization of scant perceptual 
or cognitive evidence in forming judgments 
of objects, situations, and people. Thus, 
Cameron (1951) speaks of a movement from 
diffuseness toward greater stability and clarity 
which results in a subjectively distorted per- 
ceptual reorganization. Schwartz and Wolfe 
(1959) describe schizophrenia as a “quest for 
certainty in unreality.” McReynolds (1960) 
refers to a facilitated assimilation of percepts 
in schizophrenics brought about by a lowering 
of standards for fitting incoming data into 
pre-existing categories. A number of theo- 
tists, notably Conrad (1958) and Matussek 
(1952, 1953), have presented formulations 
of schizophrenia which are closely akin to 
those mentioned above; they emphasize the 
prevalence of physiognomic qualities in 
meaningful structuring of sensory input and 


1An abbreviated version of this paper was pre- 
sented ot the American Psychological Association 
meetings in New York, September 1961. 

2The author wishes to express sincere apprecia- 
tion to A. Donald Smith, Victor Atyas, and Dotsie 
Earle for many helpful suggestions in the formative 
stages of this experiment; to Victor Atyas, Mildred 
Brody, ‘and Louis Izzo for assistance in gathering 
data; and to J. Schwartz and T. Bocka for their 
efforts in enlisting the participation of control sub- 
jects. The helpful encouragement of C. F. "Terrence, 
Director, Rochester State Hospital, is gratefully 
acknowledged. 


the resulting certainty in making inferenc 
on a slim factual basis. 

These conceptualizations, however, refer 
only to the crystallized stages of the schizo- 
phrenic process. The phases which precede 
such a crystallization are characterized, ac- 
cording to the above authors, by a cautious; 
suspicious, and avoidant attitude in the face 
of inconclusive or ambiguous evidence. 
Conrad (1958) is perhaps the most articulate 
on this point as he postulates a specific stage 
of early schizophrenia in which confusion, 
perplexity, and lack of certainty are the 
salient features. 

Given information that is inconclusive or 
incomplete, one would expect acute and 
chronic schizophrenics to deviate from the 
normal population in opposite directions: 
toward overevaluation of sparse sensory input 
among chronic patients and toward overly 
conservative, cautious sifting of such equivo- 
cal data by acute individuals. The available 
experimental evidence on this point is mostly 
concerned with the attainment of the correct 
recognition under conditions of stimulus im- 
poverishment and predictably points to a lag 
of veridical identification of the stimuli pre- 
sented by schizophrenics (Brengelmann, 
1953; Cohen, Senf, & Huston, 1956; Davis 
& Cullen, 1958; McGinnies & Adornetto, 
1952; Pinillos & Brengelmann, 1953; Rost, 
1956). The point of emergence of the first 
meaningful response in such series has been 
explored with nonpsychotic populations 
(Binder, 1958; Levitt, 1953; Muuss, 1960; 
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Smock, 1955, 1957), mostly as one of the 
operational indices of “intolerance of ambi- 
guity? (Frenkel-Brunswik, 1948). In the 
present study it was decided to apply a tech- 
nique of a similar format for the comparison 
of the performance of acute and chronic 
schizophrenic groups with normal controls 
and to supplement the series of gradually 
increasing perceptual clarity with two tasks 
which consist of clearly perceptible stimuli 
but leave subjects the option to go to vary- 
ing degrees beyond the information made 
available to them. These materials were in- 
cluded to ascertain any tendencies toward 
stylistic consistency in coping with ambigu- 
ous or incomplete information, regardless of 
the nature of the ambiguity and the format 
of the task. 


Hypotheses 
The following predictions were made: 


1. On a perceptual task of increasing 
clarity, chronic schizophrenic subjects would 
name the object shown earlier than either 
acute schizophrenics or normal controls. 

2. Compared with normal or chronic sub- 
jects, acute schizophrenics would structure 
these stimuli at a later point of the series. 

3. On the two cognitive tasks, the 
chronic patients would perform a greater 
number of item matches on the basis of in- 
sufficient information for such judgments 
than would normal or acute subjects. 

4. Acute subjects would complete fewer 
matches than either normal or chronic sub- 
jects. 


METHOD 


Subjects. A group of 34 schizophrenics, hospital- 
ized continuously over a year, constituted the 
chronic group. The acute group consisted of 25 
patients, diagnosed schizophrenic, who were within 
the first month of their first psychiatric hospitaliza- 
tion. Thirty-six hospital attendants served as con- 
trols. All groups consisted of male subjects between 
the ages of 17-55. Criteria for exclusion from the 
experiment included staff disagreement over diag- 
nosis, presence or question of organicity or mental 
defect, completion of less than four grades of 
schooling, inability to understand or to speak 
English, and refusal or marked unwillingness to 
cooperate. 

Means and standard deviations of the ages and 
educational levels of the three groups were com- 


TABLE 1 


MEANS AND STANDARD DEVIATIONS OF AGES, YEARS 
OF SCHOOLING, AND SHIPLEY-HARTFORD INTELLI- 
GENCE, VOCABULARY, AND ABSTRACTION 
SCORES IN THE EXPERIMENTAL 


Groups 

i- - | Al - 
Grup | Age | Xeargof | Dee | dep | tion 
Chronic 
M 35.67 | 9.50 | 4458 | 27.62 16.96 
SD 9.45| 3.30 | 16.69 9.41 11.58 
Acute 
M 28.40 | 10.40 | 4228 | 25.36 16.92 
SD 6.65 | 2.62 14.45 7.77 9.29 
Control 
M 38.37 | 10.06 | 45.33 | 26.89 18.44 
SD | 13.24) 247 14.10 7.51 7.71 


puted and are presented in Table 1. Intelligence 
level was roughly assessed by means of the Shipley- 
Hartford test; measures gleaned from it are sum- 
marized in the same table. While the three groups 
were rather closely equated in age, amount of 
schooling, and verbal intelligence, there was one 
deviation from this pattern: a sizable age difference 
between acute subjects on the one hand, and chronic 
and normal subjects on the other hand. This dis- 
crepancy, however, appeared to have a negligible 
effect upon the results. 

Materials. A perceptual ambiguity task was con- 
structed by photographically blurring to varying 
degrees clear, realistic drawings of simple scenes or 
objects; eg, a group of trees, telephone, violin.? 
The technique of constructing series of blurred pic- 
tures was reported by Voigt (1956) and its 
modifications used in the construction of the present 
stimuli were described in detail by Draguns and 
Multari (1961). A total of nine series consisting of 
12 pictures each was shown to the subjects indi- 
vidually, beginning with the least clear representa- 
tion. Subjects were instructed to respond at any 
point in a series, provided they “had an idea” of 
what the picture represented. Once this response was 
obtained, further presentation of a given series 
was discontinued. Thus, one response for each series 
was recorded and scored for: recognition level from 
1 at the unclear, to 12 at the clear end of the 
series; correct or erroneous naming of the object 
shown, with the terminal, clear representation as 
the criterion. An individual's recognition level score 
was found by averaging his scores for all of the 
nine series and the number of errors from O to 9 
was also tallied, As an estimate of reliability of both 
recognition level and error scores, Pearson r’s were 
computed between the alphabetically first five and 
last four series and are presented in Table 2. 


8 The writer is indebted to Virginia Jeffrey Smith, 
Rochester artist, for drawing the pictures and to 
Victor Atyas for extensive assistance M photograph- 
ing test materials. 
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TABLE 2 


SPrrr-HALF PEARSON r CORRELATION COEFFICIENTS 
For RECOGNITION LEVEL AND ERROR SCORES 
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Group Recognition Error 
Chronic .883 .660 
Acute .746 717 
Control 856 -180 


A slightly modified version of the Siegel (1954) 
Tolerance-Intolerance of Cognitive Ambiguity Test 
was used. It is made up of 16 randomly selected 
pictures of people from old popular magazines, to 
be matched with 16 statements, also picked at ran- 
dom from the same sources. Subjects were in- 
structed to match any number of statements and 
pictures or to leave them unmatched, at their dis- 
cretion, It was stressed that the subjects could 
choose to return the sheets to the experimenter 
at any time regardless of whether they were blank, 
partially, or entirely completed. The measure on this 
task was the number of pictures which were 
matched with statements, from 0 to 16. 

A similar task, also devised by Siegel (1954) and 
given with analogous instructions, consisted of 20 
Pictures which were to be matched with 20 nation- 
ality names. The measure here was the number of 
matchings completed from 0 to 20. Neither of the 
Cognitive tasks lent itself to assessments of split-half 
reliability nor was the computation of a separate 
error score possible. In view of the random selection 
of their materials, all items matched on both tasks 
might, in a sense, be considered errors. 

Procedure. The above three tasks were given 
individually in a random order, In addition to 
these measures, the booklet form of the MMPI and 
the Shipley-Hartford Scale of Intelligence were 
administered a few days before the individual ses- 
sions. Failure to comply with instructions on these 
two tests served to exclude a subject from further 
participation in the experiment, 


RESULTS 
On all four measures obtained on the three 
experimental tasks, distribution of scores was 


TABLE 3 


KRUsKAL-WaLLIS One-Way ANALYSIS OF VARIANCE 
By RANKS ON EXPERIMENTAL MEASURES 


Measure df H 
Recognition level 2 8.77** 
Number of errors 2 14.73*** 
Number of statements 2 5.96* 

matched 
Number of nationalities 2 7.82** 
matched 


* Significant/at .10 level. 
** Significant at .05 level. 
**** Significant at .01 level. 


markedly skewed. Hence, Kruskal-Wallis H 
tests and Mann-Whitney U tests were use 
for group comparisons. As Table 3 shows 
significant intergroup differences were ob- 
tained on three out of four measures while 
on the remaining one, i.e., statement-picture 
matchings, they reached a borderline level of 
significance. 

These group effects were in a large measure 
based on the significant differences between 
the chronic and the control subjects (see 
Table 4). As predicted, chronic schizophrenics 
made earlier perceptual judgments as we 
as more perceptual recognition errors and 
matched more statements with pictures and 


TABLE 4 


STANDARD SCORES or MANN-WHITNEY TESTS BETWEEN 
SCHIZOPHRENIC AND CONTROL GROUPS ON 
EXPERIMENTAL MEASURES 


Di ion if- 
Task Measure | piecton of dif- | g 
groups 
Perceptual Object nam- Control >Chronic | 2.473** 
decision ing Control >Acute 1.697 
location Control <Acute 1,369 
Perceptual Number of Control «Chronic | 2,947% 
decision errors Control «Acute 1.339 
location Chronic 7» Acute 0.577 | 
Siegal Toler- | Number of Control <Chronic | 3.460*** 
ance-In- statements | Control<Acute | 1.803 
tolerance and pictures | Chronic Acute | 0.548 
of Cogni- matched 
tive Am- 
biguity 
Siegal Toler- | Number of Control <Chronic | 2.091** 
ance-In- nationalities | Control <Acute 1.480 
tolerance and pictures | Chronic >Acute 1.550 
of Cogni- matched 
tive Am- 
biguity 


** Significant at .05 level. 
*** Significant at .01 level. 


more pictures with nationalities. The per-| 
formance of acute subjects on all tasks was | 
intermediate with respect to the other two 
groups without differing significantly from 
either. Thus, hypotheses on the performance 
of acute schizophrenics failed to be supported. 

Rank-order correlations among the various 
experimental scores presented in Table 5, 
though not uniformly significant, hint at a! 
pattern of consistency in evaluating ambigu- 
ous information on the three tasks em- 
ployed. Further correlational analyses given 
in Table 6 concerned the relationship þe- 
tween the experimental scores and chrono- 
logical age. As these yielded, with one excep- | 
tion, nonsignificant results, age was thought || 
to have little contaminating effect upon the f 
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group differences. The one significant cor- 
relation obtained would tend to decrease 
rather than increase the differences among 
the three groups compared. As Table 7 
shows, significant positive rank-order cor- 
relations were obtained in the chronic group 
between intelligence and vocabulary on the 
one hand, and early picture recognition, large 
number of errors, and of statement-picture 
matchings on the other hand. Among the 
acute subjects, both abstraction scores and 
years of schooling were positively associated 
with the number of statements matched. Cor- 
relations between length of hospital stay and 
experimental measures, computed only in the 
chronic group, were negligible; Spearman 


TABLE 5 


SPEARMAN RHO’S AMONG EXPERIMENTAL MEASURES IN 
Groups OF SCHIZOPHRENICS AND CONTROLS 


Mouro ana N | Errors | Statements | Nationali- 
Recognition 
Chronic |34| —.619*** | —.426** | —.363** 
Acute 25| —.674*** | —.230 —475 
Employee |36| —.743*** | —.334** | —.478*** 
Errors 
Chronic |34 103706 48s 
Acute 25 .346** 154 
Employee | 36 Kiho .213 
Statements 
Chronic |34 8411+ 
Acute 25 :279 
Employee |36 .452*** 


** Significant beyond .05 level. 
**** Significant beyond .01 level. 


rho's between length of hospitalization and 
picture naming, recognition errors, state- 
ments, and nationalities matched were 
—.051, .172, .096, and .098, respectively. 
Spearman rho’s computed between various 
» MMPI scales and the cognitive and per- 
ceptual ambiguity scores constitute the con- 
tents of Tables 8 and 9, The result that stands 
out in this matrix of correlations is the virtu- 
ally complete absence of significant results 
in the control group whereas there are sev- 
eeral significant correlation coefficients in the 
two pathological groups. Furthermore, the 
consistent and high correlations in the acute 
group between the Welsh R scale and the 
four experimental variables, absent in both of 
the other,groups, seem to show that conserva- 


TABLE 6 


KENDALL TAU’S BETWEEN EXPERIMENTAL 
MEASURES AND AGE 


Measure 
Group 
i- E - Nationali- 
Repoeni- | errors | fronts | ties 
Chronic 275*" —.162 | —.154 | —.108 
Acute 042 | —18 | 041 | —.109 
Employee | —.026 | 156 | .149 | —.027 


Note.—Kendall tau's were computed instead of Spearman 
rho's to permit the eventual computation of partial correlations 
which, however, did not have to be ascertained because of the 
nonsignificance of correlations with age. 

** Significant at .05 level. 


tive information utilization and error avoid- 
ance may be linked in the early stages of 


schizophrenia with repressive tendencies as 
assessed on the above scale. 


DISCUSSION 


The process of percept or concept forma- 
tion—reduced in this experiment to the task 
of matching two items or of emitting a mean- 
ingful response—is conceived here as an act 
of balancing effective, personality dominated 
impressions against more reality determined 


TABLE 7 


SPEARMAN RHO'S BETWEEN INTELLECTUAL 
AND EDUCATIONAL VARIABLES AND 
EXPERIMENTAL MEASURES 


Measure and | y | Recogni-| Errors | State- | Nation- 
group tion ments | alities 
Intelligence 
(Shipley- 
Hartford) 
Chronic 34| —.305 306 |.441**| .138 
Acute 25| —.033 .091 |.251 143 
Control 36| .205 | —.226 | .149 .086 
Abstractions 
(Shipley- 
Hartford) 
Chronic 34| —.028 264 |.281 025 
Acute 25| .054 045 |.360**  .135 
Control 36| .091 | —.232 |.132 .065 
Vocabulary e 
(Shipley- 
Hartford) 
Chronic 34| —.326**| .382**| .434** 176 
Acute 25| —.048 058 |.207 .320 
Control 36| .142 215 |.018 .021 
Years of 
schooling 
Chronic 34| .144 238 |.188 | —.087 
Acute 25| 211 | —.133 |.387*"| .124 
Control 36 | —.210 141 |.219 | —.211 
e 


** Significant beyond .05 level. 
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characteristics of the stimuli at hand (Fla- 
vell & Draguns, 1957; Graumann, 1959; 
Kragh, 1960; Linschotten, 1959; Ombredane, 
1955; Smith, 1957). In this quest, chronic 
schizophrenics tended to arrive at early 
and/or erroneous judgments of external evi- 
dence. This finding lends support to the 
theoretical formulations by Cameron (1951), 
Conrad (1958), McReynolds (1960), Matus- 
sek (1952, 1953), and Schwartz and Wolfe 


TABLE 8 


SPEARMAN RHO'S BETWEEN MMPI DIAGNOSTIC AND 
OTHER SCALES AND EXPERIMENTAL MEASURES 
IN SCHIZOPHRENIC AND NORMAL GROUPS 


MMPI | | Recogni- State- | Nationali- 
gud | tion Errors ments nE 
Hs 
Chronic 106 101 .080 
Acute —.508***| — 464***| — 054 
ED ERES —.143 —.143 —.414 
Chronic | —.286 .305 .242 085 
Acute —.215 —.425** 074 —.210 
Control | —.047 —.060 .085 —.121 
Hy 
Chronic | —.371** .336** 242 085 
Acute .686***|  ,543***|  624***) _ 309 
Control | —.010 —.044 —.041 121 
Chronic | —.108 244 .330** | —.062 
Acute — AT4***) — 455***| — 348** | — 321 
i5 Control 021 019 =.115 077 
a 
Chronic | —.204 .415** | —.040 = 35/4" 
Acute —.082 033 337 174 
Control | —.220 213 .067 192 
t 
Chronic .120 .149 —.134 —.249 
Acute —.571***] —.505***| — 750*99| —.g12*+* 
2 Control | —.229 .259 .069 —192 
c 
Chronic -040 172 221 —.164 
Acute —.323 —.189 —.017 —.137 
Control .104 431 189 —.150 
a 
Chronic 271 .180 .285 .084 
Acute —.249 .061 .089 .027 
rana .039 .028 —.105 1253 
deter —.011 -155 041 211 
Acute .303 —.196 —.242 RESI be 
Control | —.005 .059 E125 —.005 
Welsh R 
Chronic | —.028 .092 —.167 —.344** 
Acute -769***| — 700***| — 7g7***| — 628*** 
Control | —.150 VES —.014 —.110 
Welsh A 
Chronic | —.299 244 -308** | .412** 
Acute -021 —.086 =130 —.018 
Control | —.092 467 .236 —476 


"e Significant beyond .05 level. 
**** Significant beyond .01 level. 


SPEARMAN RHO'S BETWEEN MMPI VArprty | 


MMPI i M 
scales and | R08" | Errors Siete Neu 1 
group 
L 

Chronic | —.099 .159 .071 

Acute —.131 322 —452** ae 
Control 096 .039 .420*** — 069 
F 


Chronic | —.373** .369** .310** | 142 
Acute .921***| — .209 673***) 


Control | —.013 .098 =.111 
Chronic 114 016 017 
Acute -450** .286 —.339 


Control | —.222 .201 213 


** Significant beyond .05 level. 
*** Significant beyond .01 level. 


(1959) and suggests, in the chronic patho} 
logical group, a more liberal fitting in off 
responses into conceptual or  perceptua 
schemata and a preponderance of endogenous) 
factors in such judgments over exogenous) 
object determined ones. Thus, the present 
results supplement those obtained in a large 
group of perceptual studies cited in the 
introductory portion of the article, all of 


structuring of ambiguous materials by schizo 
phrenics. The parallel between premature 
structuring of vaguely perceived stimuli by 
schizophrenics, by children (Draguns & 
Multari, 1961), and by experimentally, 
aroused or chronically anxious subjects 
(Smock, 1955, 1957) alludes to the hypo 


ture of such behavior which is discussed 
more detail elsewhere (Draguns, 1961). 
The question may be asked, however, 
the results discussed above reflect any dif- 
ferences in perceptual or cognitive structuri 
ing or only variations in readiness to report 
incomplete or unverified impressions. It may 
be contended that when a schizophrenic sub: 
ject identifies a blurred spot as “a caske 


tribute the statement “We hear about sucli 
incidents right away” to a middle-aged mall 
seen on a photograph—to cite two actual) 
responses—he may simply be more willing 
to share, because of submissiveness or bo 

dom, guesses which both normal as well asi 
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psychotic subjects covertly make. However, 
the qualitative observations of some experi- 
menters concerned with percept formation 
under conditions of stimulus impoverish- 
ment seem to point to the reified and over- 
elaborate nature of schizophrenic responses 
in the early portions of these sequences 
(Rost, 1956) and to the difficulties experi- 
enced by these subjects in assimilating addi- 
tional information once a hypothesis about 
ambiguous evidence is formed (Davis & 
Cullen, 1958). It would appear on the basis 
of these findings that it may be plausible 
to consider the results of the present experi- 
ment as a function of perceptual and con- 
ceptual tendencies toward overinvestment of 
meaning rather than as an artifact of differ- 
ences in verbal report. 

The negative results in the acute group 
which fall consistently between the perform- 
ance of the other two groups appear to be 
contrary to the theoretical expectations on 
which this study is based. At first glance, 
these findings seem to point to a continuous 
progression from a relatively cautious to a 
more brazen cue evaluation in schizophrenia, 
instead of the two-stage process postulated 
above. However, the presence of a marked 
overlap between the chronic and acute groups 
on the experimental measures and the ab- 
sence of significant correlations in the chronic 
group between length of hospitalization and 
performance on any of the present tasks 
suggests that duration of the schizophrenic 
process, as assessed in this investigation, may 
not be relevant to the structuring of ambigu- 
ous stimuli in schizophrenia. In that case, 
the positive results in the chronic group 
would simply be 'a function of its more 
typical, schizophrenic composition as its 
„members had exhibited schizophrenic be- 
havior over a substantial period of time. A 


study of a large sample is needed in which 


all stages of schizophrenia, from incipient to 
terminal, would be adequately represented in 
order to determine conclusively the presence 
.gnd nature of any relationships, linear or 
curvilinear, between duration of pathology 
and response to ambiguous materials. Infor- 
mation as to the known length of the schizo- 
phrenic process—which may be affected by 
variations in its social recognition—may well 
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be supplemented by data on the slow or 
sudden onset of this decompensation. Thus, 
the process-reactive dichotomy of schizo- 
phrenia (Becker, 1956) may be worth explor- 
ing in future studies of percept or concept 
formation. 

As consistent response tendencies have 
emerged over a limited range of the experi- 
mental situations investigated, one might 
speak with some justification of a stylistic 
“responsiveness to information input” which 
may be akin to the variable of “organization 
time” proposed by Klein (1949) but some- 
what broader in its implications. The stylistic 
consistency, though observable in both patho- 
logical and normal groups, may possibly stem 
from different sources. The fact that normal 
subjects’ behavior in this experiment was un- 
related to the personality variables assessed 
by the MMPI, whereas a number of such 
relationships were revealed in the two clin- 
ical samples, seems to point, albeit tentatively, 
to the “deneutralization” of perception, 
cognition, and presumably other ego func- 
tions in schizophrenia, as postulated by 
Hartmann (1953). However, because of the 
greater variability of MMPI scores in the 
clinical groups and of the small number of 
significant correlations, this speculative no- 
tion is applied to the data at hand with 
considerable caution. 
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THE STRUCTURE AND DYNAMICS OF THE ETHNIC 
ATTITUDES OF JEWISH ADOLESCENTS * 


MOSHE ANISFELD, STANLEY R. MUNOZ? ann WALLACE E. LAMBERT 
McGill University 


A factorial analysis was carried out on a variety of measures obtained from 
100 Jewish high school students in Montreal. A general factor composed of 
negative attitudes to Jews and to gentiles, a hostile disposition to people, 
and verbal aggression against nonconformists was isolated. This factor, re- 
flecting a generally negative social orientation, is independent of 2 other 
factors representing attitudes to parents and attitudes to self. The factorial 
structure suggests that manifest hostility is a possible mediating variable linking 


all 3 factors. 


In recent years, investigations have strongly 
supported the conclusion made by Hartley 
(1946) that “the degree of tolerance expressed 
by individuals is a generalized function of the 
individual and is not completely determined 
by the specific group towards which the atti- 
tude is directed” (p. 25). The nature of this 
function has been explored and various at- 
tributes were suggested as characteristic of 
the prejudiced personality. A link has also 
been made between an individual’s self-image 
and his attitudes to ethnic groups. It is 
believed that a person burdened with feelings 
of inadequacy is more likely to be intolerant 
of outgroups than a person with a more com- 
fortable self-image. 

The present study, carried out with Jewish 
adolescents in Montreal, was instigated by 
this general theme and was designed to 
investigate whether the reported patterns of 
ethnic attitudes concerning mainly majority 
group members also hold for members of the 
Jewish minority group. Some findings of pre- 
vious studies suggest partial answers to this 
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question. For example, Trent (1953) found 
that those Negro children *who were most 
self-accepting expressed significantly more 
positive attitudes towards both Negroes and 
whites than children who were least self- 
accepting" (p. 137). Steckler (1957) ob- 
tained significant intercorrelations among anti- 
Negro, anti-white, ethnocentric, and author- 
itarian attitudes for Negro college students. 
Two studies of Jewish college students 
(Adelson, 1953; Radke-Yarrow & Lande, 
1953) established that Jewish self-hatred was 
related to authoritarianism, presumed to be an 
index of generalized prejudice. However, no 
studies have appeared which included Jewish 
subjects’ attitudes towards gentiles among 
their variables. The present study was de- 
signed to fill this gap and to bring together 
a series of variables which have been individ- 
ually related to ethnic attitudes. 

The study included five groups of variables: 
attitudes to ethnic groups (Jews and gentiles), 
measures of attitudinal orientations toward 
people in general, variables relating to the 
feelings about parents, measures of self-evalua- 
tion, and measures of scholastic aptitude and 
achievement. 


METHOD 


Subjects 


One hundred tenth grade Jewish students (63 males 
and 37 females) from four classes in one Montreal 
public high school, under the jurisdiction of the 
Protestant School Board, served as subjects. The 
classes tested were almost completely populated by 
Jewish students. The school is located in a lower- 
or lower-middle class district and the®majority of 
students live in this area. An analysis of the occupa- 
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tions of the students’ parents, using the scheme of 
Warner, Meeker, and Eells (1949, pp. 140-141, 165), 
clearly places the vast majority of the subjects in 
the upper-lower or the lower-middle social class. 
The proportions of first, second, and third generation 
Canadians in this sample were: .37, .24, and .39, 
respectively. Forty percent of the subjects’ parents 
speak in Yiddish to their children, and 28% of 
the subjects use this language in addressing their 
parents, 


Materials 


Booklets including a personal information sheet 
and questionnaires described below were completed 
by the students during a regular school hour. 

Measures 1-4 below were answered on standard 
six-point scales, ranging from complete agreement 
(+3) to complete disagreement (—3). 


Measures of Attitudes to Ethnic Groups 


1, Anti-Semitism (A-S) scale. Six items were 
selected from the California Anti-Semitism scale 
(Adorno, Frenkel-Brunswik, Levinson, & Sanford, 
1950) on the basis of their suitability for Jewish 
high school students. This scale was employed to 
measure the extent of the subjects’ agreement with 
derogatory or anti-Semitic attributes assigned to 
Jews. The A-S scale can be viewed as a measure 
of Jewish self-hatred, a notion developed by 
Lewin (1948), Sarnoff (1951), and others, The item 
mean for the A-S scale was —1.27, indicating that 
the subjects, on the average, expressed slight dis- 
agreement with anti-Semitic statements. 

2. Jewish Authoritarianism (JA) scale. The 13 
most discriminatory items were drawn from Adelson’s 
(1953) Jewish Authoritarianism scale, apparently 
Measuring negative attitudes to lower-class Jews. 
The items imply a distinction between “good” and 
"bad" Jews. Undesirable characteristics are at- 
tributed to bad Jews such as: obnoxious, immoral, 
and “try to intrude themselves into circles where 
they are not wanted.” The bad Jews are also blamed 
for invoking anti-Semitism. This scale was em- 
ployed as a more specific index of Jewish seif- 
hatred. The item mean for the JA scale was SAL 
indicating very slight opposition to the statements 
in this scale, that is, the subjects as a group did 
not express feelings of self-hatred. 

3. Anti-gentilism (A-G) scale. The scale consists of 
18 positively worded items. The items were con- 
structed from the California Anti-Semitism scale 
(Adorno et al, 1950) by interchanging the words 
“Jews” and “gentiles” in each item as well as re- 
wording certain items completely. A sample item is: 
Gentiles may have moral standards that they apply 
in their dealings with each other, but with Jews they 
are unscrupulous, ruthless, and undependable. The 
item mean for the A-G scale was —1.46, showing 
that the subjects tended to disagree with anti-gentile 
statements. 


Measures of Orientation to People in Gener 


4. F Scale. Twelve items were chosen from th 
original F Scale, Form 40-45 (Adorno et al., 1950) 
The criteria for selection were discriminatory poy 
and appropriateness for Canadian high  schoj 
students. The scale as here composed is mainly coi 
cerned with relations to authority and should ther 
fore be expected to be salient for adolescents, Fot 
items deal with the importance of respect for, ai 
uncritical obedience of parents and other authoriti 
—authoritarian submission, in the California group 
terminology. Four other items strongly condemn peo 
ple who violate conventional values and disobey ai 
thorities (authoritarian aggression), One item 2 
presses both authoritarian submission and aggressiot 
The item-mean for the authoritarianism scale wi 
+.62. 

5. Manifest Hostility (MH) scale. Fourteen item 
were taken from Siegel’s (1956) original MH scale 
The items describe, in the first person, hostile an 
aggressive behavioral and emotional dispositiot 
towards people, distrust of their motives, am 
suspiciousness, The subjects were asked to ched 
the statements as either true (scored +1) or fali 
(scored —1). 


Measures of Self-Evaluation 


6. Feelings of Social Inadequacy (FSI) scale. Ni 
items which could be answered in one particul 
form (ie. very often, fairly often, sometimes, ont 
in a great while, practically never) were chose 
from the FSI scale of Hovland and Janis (1959) 
The items ask the subjects “to give selí-ratings 0 
anxiety in social situations, self-consciousness, an 
feelings of personal worthlessness." 

7. Evaluation of Self. Subjects rated the concep 
“me” on 22 semantic scales. The bipolar adjective 


unprejudiced, brave-cowardly, handsome-ugly, color 
ful-colorless,  friendly-unfriendly, honest-dishonesl 
stupid-smart, kind-cruel, pleasant-unpleasant, o 
cessful-unsuccessful, secure-insecure, dependable-un 
dependable, leader-follower, mature-immature, stab 
unstable, happy-sad, popular-unpopular, hard worl 
ing-lazy, ambitious-not ambitious. These adjecti 
sample mainly the dominant evaluative factor 9 
the semantic differential (see Osgood, Suci, | 
Tannenbaum, 1957, particularly pp. 179-185). Th 
favorable end of each bipolar scale was assigned 
score of 7 and the unfavorable end a score of 
The 22 ratings were then summed to yield a sco 
on Evaluation of Self. Thus, a high score indicates 
favorable evaluation of self, 


Measures of Relation to Parents 


8. Evaluation of Father. 9. Evaluation of Mothe 
Subjects rated each of the concepts "father" a 
"mother" on the same scales of the semantic di 
ential. Variables 8-9 were scored in the same way ® 
Variable 7. f 
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TABLE 1 
CORRELATION MATRIX 
Measure BN DNR E 5 6 7 8 9 10 11 12 13 
. Anti-Semitism (A-S) —| .34**| 50**) .24*) 
. Jewish Authoritarianism — | 40% .14 


1 

2 

A; 

3, Anti-gentilism (A-G) — | .224 

4, F Scale 

5. Manifest Hostility (MH) 

6. Feelings of Social Inade- 
quacy (FSI) 

7, Evaluation of Self» 

8. Evaluation of Father 

9, Evaluation of Mother* 

10, Identification with Father> 

11. Identification with Motherb 

12. IQ scores 

13, School averages 


Note.—N varies from 77 to 100 due to incomplete data. 
High score indicates favorable evaluation. 
b High score indicates low identification. 
* Significant at or beyond the .05 level. 
** Significant at or beyond the .01 level. 


10. Identification with Father. 11. Identification 
with Mother. Osgood Ds (Osgood et al, 1957) be- 
tween the concepts “me” and “father” and between 
“me” and “mother” rated on the 22 scales listed 
under Variable 7 and on “permissive-strict” con- 
stitute scores on these variables. The higher the score, 
the less the identification. 

Our use of the semantic differential as a measure 
of identification is based on Lazowick’s (1955) 
study which demonstrated the power of this method. 
Identification was defined by Lazowick as a similarity 
in semantic “meaning” structures of a child and his 
parents, where “meaning” is defined as representa- 
tional mediational responses presumed to be indexed 
by the semantic differential scales. 


Measures of Scholastic Aptitude and Achieve- 
ment 

Scores on the following two variables were taken 
from the school records. . 

12. IQ scores (based on the Otis Beta test). These 
scores were available for 89 subjects only. 

13. School averages (for the 1959-1960 scholastic 
year). Averages were not available for those 18 
students who had not attended the school the 
previous year. 


RESULTS AND DISCUSSION 


Pearson product-moment intercorrelations 
for the 13 measures described were calculated 
for the whole group of male and female 
subjects? and are presented in Table 1. 
The correlation matrix was factor analyzed 
using Thurstone's (1947) centroid method. 


3Only minor differences, not directly relevant to 
the present study, were found in a separate analysis 
for ‘male and female subjects. Therefore the two 
groups were pooled for the major statistical analyses 
presented | here. 


Since Identification with Father (Variable 
10) and Identification with Mother (Variable 
11) were derived from Variables 7-9 (Evalua- 
tion of Self, Evaluation of Father, and Evalua- 
tion of Mother), there may exist statistically 
builtin correlations, void of psychological 
meaning, between the following pairs of vari- 
ables: 7 and 10, 7 and 11, 8 and 10, 9 and 11. 
Variables 7, 9, and 10 were therefore excluded 
from the factor analysis. This choice of vari- 
ables to be excluded reduces the likelihood of 
spurious factors emerging. Since most of the 
residual correlations after the extraction of 
five factors (presented in Table 2) were of 
the magnitude of .01 and none exceeded .04, 
no more factors were sought. The five factors 
were rotated by Kaiser's normal varimax 
criterion method (Harman, 1960), which 
yielded the matrix shown in Table 3. These 
operations were performed by the IBM 650 
digital computer with programs developed at 
McGill. 

The composition of Factor I seems to in- 
dicate that such diverse variables as Jewish 
self-hatred (A-S and JA scales), anti- 
gentilism, manifest hostility, and authoritari- 
anism (F Scale) have some feature in com- 
mon. For all these variables, the objects of 
evaluation are people other than oneself and 
the evaluations tend to be similar. Factor I 
can thus be defined as a measure of a gen- 
eral social orientation. The clustering of the 
A-S, JA, and A-G scales suggests that a 
Jewish adolescent who holds negative attitudes 
towards gentiles may be expected to react 
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TABLE 2 
CxNrROID Factor Matrix 
Factor 
Measure 
X II III IV v 

1. Anti-Semitism (A-S) scale AT -25 —.46 —.10 —.14 

2. Jewish Authoritarianism (JA) scale 49 —46 —.31 .26 12 

3. Anti-gentilism (A-G) scale 1 —.49 —.26 —.22 A0 

4. F Scale —.14 —.55 —.29 .07 —.22 

5. Manifest Hostility (MH) scale .65 07 —.17 —.08 —.16 , 

6. Feelings of Social Inadequacy (FSI) scale 1 .30 48 07 —.14 

8. Evaluation of Father* —.53 —.22 24 —.14 -.22 
11. Identification with Mother^ .58 40 —.07 —.26 08 
12. IQ scores —.18 40 —.42 .06 = At 
13. School averages —.26 44 —.54 16 —.04 | 


^ High score indicates favorable evaluation. 
^ High score indicates low identification. 


similarly to his own group. The contribution 
of manifest hostility to Factor I lends support 
to the idea that the social orientation to 
different ethnic groups is a reflection of a 
pervasive tendency of an individual to react 
to his social environment in either a hostile 
or friendly manner. 

Although part of the common variance 
in Factor I may be due to an acquiescence set 
(see, for example, Couch & Keniston, 1960; 
Peabody, 1961), it seems that this factor 
reflects more than acquiescence. First, the F 
Scale has its highest loading on Factor IT 
and a relatively low loading on Factor I 
which would not be in keeping with an 
acquiescence interpretation of the first factor. 
Second, the average for the ethnic attitude 
scales are negative, which indicates that the 


the generalized statements about Jews 
gentiles and it is mainly variation in 
extent of their disagreement that account 
for the correlations obtained. A recent studj 
(Peabody, 1961, p. 10) has shown 
“Disagreement could usually be attributed t 
attitudes opposing authoritarianism, dogi 
tism, or anti-Semitism" and not to acquies 
cence set. Third, the same anti-Semitism an 
anti-gentilism scales used in the present stud 
were also employed in another study (Bogo 
1961) which included evaluations of “Jews 
and "gentiles" on the semantic differential 
In that study the attitude scales correlate 
with scores derived from the semantic d 
ferential ratings of Jews and gentiles. Sino 
the semantic differential is not an “agret 


TABLE 3 
ROTATED Factor MATRIX 
(Orthogonal) 
Factor 
Measure 
I II III IV Vv 
1. Aüti-Semitism (A-S) scale -67 01 A3 .20 41 
2. Jewish Authoritarianism (JA) scale -36 —.02 Al .55 .02 
3. Anti-gentilism (A-G) scale -12 .00 —.09 22 —.25 
4. F Scale i 34 .58 —.08 —.02 .06 
5. Manifest Hostility (MH) scale .46 —.31 .38 .20 —.02 
6. Feelings of Social Inadequacy (FSI) scale .02 —.38 48 AS —.13 
8. Evaluation of Father* —.21 38 —.12 —.49 SUI 
11. Identification with Mother^ 225 —.67 .21 Al 02 
12. IQ scores —.06 —.10 -03 —.02 .61 
13. School averages .02 15 —.12 .09 .60 


a High score indicates favorable evaluation. 
» High score indicates low identification. 
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disagree” type measure, the correlations ob- 
tained seem to be indicative of content validity 
of the attitude scale. Finally, the F Scale 
shows a zero correlation with the MH scale. 
This would not likely have happened if our 
subjects had exhibited an acquiescence set to 
any appreciable degree. 

Factor II appears to reflect an attitude 
towards parents. Indentification with Mother 
has the highest loading on this factor, and 
Evaluation of Father is also represented on it. 
The interpretation of Factor II as a parental 
attitude factor is further supported by the 
high loading of the F Scale. Thus, it will 
be noted that the relations of the F Scale with 
the MH scale, Evaluation of Father, and 
Identification with Mother in Factor II are 
the reverse of those found in Factor I. That 
is, where the F Scale was positively related 
to one of these variables in Factor I it is 
negatively related to it in Factor II and vice 
versa. As already mentioned, the items of the 
F Scale fall into two logically distinctive 
categories, suggesting that the scale measures 
authoritarian aggression on one hand and 
authoritarian submission on the other. These 
two components have been found by Bordua 
(1961) to be independent. The argument 
here is that the aggression component shares 
common variance with those other variables 
which measure a negative social orientation 
(Factor I), whereas the submission com- 
ponent, concerned mainly with submission to 
parental authority, contributes to Factor II. 

Viewing the constitution of Factor II in 
accordance with the foregoing observations 
about the F Scale, it seems that a Jewish 
adolescent who evaluates his parents favor- 
ably, identifies with them, and submits to 
their authority is not likely to be burdened 
by feelings of social inadequacy and can be 
expected to be relatively free of social hos- 
tility. As can be seen from the correlation 
matrix (Table 1), he also enjoys a relatively 
favorable self-evaluation. Factor II thus seems 
to paint a different picture of the high scorer 
eon the F Scale than that developed by 
Adorno et al. (1950). Previous research (sum- 
marized by Titus & Hollander, 1957) has 
already refuted the California group's descrip- 
tion of the authoritarian individual as neurotic 
and maladjusted. In the present study, the 


composition of Factor II further suggests that 
for the adolescents studied, authoritarian sub- 
mission may constitute a component of 
“healthy” adjustment to their social environ- 
ment. It is of course possible that the “favor- 
able” characteristics ascribed to the author- 
itarian adolescent by Factor II are rather 
superficial and cover up underlying alienation 
from parents and feelings of social insecurity, 
as Frenkel-Brunswik (1948) would apparently 
argue. 

The relation between feelings of social in- 
adequacy and manifest hostility indicated in 
Factor II becomes more obvious in Factor III 
and can probably be explained as hostility 
directed against people in general who are 
perceived as agents of frustration in that they 
contribute to one's own feelings of inadequacy 
in social situations. 

Factor IV reveals a relation between the 
JA scale and Evaluation of Father. It seems 
likely that some students would perceive 
their parents as representative members of the 
lower social class Jews in the community and 
hold toward them the same negative feelings 
they do toward all *bad" Jews. 

. The loadings of IQ and school averages 
on Factor V define it as a scholastic aptitude 
and achievement factor. This factor, per se, 
is of no particular interest in the present 
context, but the failure of intelligence to cor- 
relate with any of the attitude variables is 
important to note because it is inconsistent 
with certain previous studies. Gough (1951), 
working with 271 high school seniors in a 
small midwestern community, found that Otis 
IQ scores (the same estimate of IQ as used 
in the present study) correlated negatively 
with the California Anti-Semitism (A-S) scale 
(r = —.37) and with an E-F scale, composed 
of the California Ethnocentrism and Author- 
itarianism scales (r = —.43). He also found 
that high school honor point ratios (compara- 
ble to our school averages) correlated nega- 
tively with the A-S and E-F scales in the 
magnitude of —.38 and —.34, respectively. 
Cohn (1952) interpreted these (and other 
findings) as indicating that intelligent people 
know how to fake and answer “properly,” 
that is, they know how to obtain low scores 
on prejudice scales. From this poimt of view, 
the absence of correlation between intelligence 
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and attitude variables in the present study 
may indicate that the subjects expressed their 
feelings frankly, and therefore increases con- 
fidence in the findings. 

Viewing the factor pattern with reference 
to the structure and dynamics of ethnic at- 
titudes, the following picture emerges. At- 
titudes to Jews, gentiles, a hostile disposition 
towards people, and verbal aggression directed 
against nonconformists form a single factor 
(Factor I) which seems to reflect a generalized 
negative orientation towards people. The sub- 
jects’ attitude towards their parents and their 
feelings about themselves (represented in 
Factors II and III) are independent of the 
disposition towards ethnic groups: That is, a 
negative view of one’s parents, weak identifi- 
cation with them, and an inferior evaluation 
of oneself are not directly associated with 
negative attitudes to Jews or gentiles. How- 
ever, the presence of the MH scale in Factors 
I, II, and III suggests that manifest hostility 
is a possible mediator between attitudes 
towards self and parents on one hand and 
attitudes to ethnic groups on the other. 

This study, taken in the context of others 
concerned with the same relationships, sug- 
gests that problems in ethnic ideology could 
be formulated in a more comprehensive 
manner in terms of a person’s disposition to 
people in general rather than in essentially 
sociological terms of an ingroup-outgroup 
dichotomy. Such a formulation would apply 
to both majority and minority group members 
and to the relatively unprejudiced individuals 
as well. 
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THE EFFECTS OF THREAT IN BARGAINING: 


CRITICAL AND EXPERIMENTAL ANALYSIS * 


LEE A. BORAH, JR. 
Wagner College 


A 1st experiment replicated the main result of the Deutsch and Krauss ex- 
periment which showed that the presence of barriers in the bargaining situa- 
tion led to poorer outcomes for the players. It also indicated, however, that 
the main results obtained by Deutsch and Krauss and the similar ones found 
in the present experiment are probably due to the spurious effect of the 
length of the longer pathway and the lack of comparability between conditions. 
In a 2nd experiment, the introduction of electric shock—whether given no 
interpretation, interpreted as a means of coercing the other to yield, or 
interpreted as a means of threatening future unpleasantness—did not sig- 
nificantly change the outcomes for the bargainers. 


This paper reports two investigations of 
the effect of threat in a bargaining situation 
previously studied by Deutsch and Krauss 
(1960). The present work results from a sharp 
disagreement with their interpretation of their 
experimental results. These investigators con- 
ducted an experiment in which each of two 
subjects, moving in opposite directions, wanted 
to be first through a one-way path in order 
to reach their respective goals and make 
money. The bargaining concerned which one 
would use the path first. The threats con- 
sisted of barriers which could be placed on 
the one-way pathway to prevent the other 
subject from using it. A player could, also, 
reach his goal by using a longer, more costly 
pathway which could not be blocked. The 
subjects were given an individualistic orienta- 
tion, ie., they were told to attempt to make 
as much money as possible for themselves 
and to have no interest in whether the other 
made or lost money. Success in bargaining was 
evaluated by the joint payoff (the sum of 
their respective profits) gained by the pair on 
each trial. 

Deutsch and Krauss reasoned that greater 
difficulty would occur in bargaining in the 
Bilateral Threat condition (both subjects 


1 This research was conducted in the Laboratory 
for Research in Social Relations at the University 
-of Minnesota and is based upon a thesis submitted 
in partial fulfillment of the requirements for the 
PhD. „The research was supported, in part, by 
Grant G-5553 from the National Science Founda- 
tion to H. H. Kelley, whose guidance and help 
are gratefully acknowledged. 
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controlling barriers) than in the No Threat 
condition (no barriers available) because (a) 
where threats are available and there is a 
conflict of interest, the threats will be used, 
and (b) where threats have been used, the 
person threatened, if he believes himself of 
equal or superior status to the threatener, will 
feel hostility and will tend to use counterthreat 
and/or show resistance to yielding to the 
threat. The use of threat should strengthen 
the competitive behavior of the bargainers 
and where the competitive interests are 
heightened there is a decreased probability 
of reaching an agreement. Their data con- 
firmed the hypothesis: The presence of the 
barriers led to poorer outcomes for the sub- 
jects as indicated by lower total joint payoff. 

The principal criticism of the Deutsch and 
Krauss experiment concerns the correctness 
of their interpretation that the lower joint 
outcomes in the Bilateral Threat condition 
indicate greater competitiveness. From a con- 
sideration of the subject's alternatives in 
the No Threat condition, it is apparent that 
the only way in which a player can assure 
himself of making an amount equal ¢o or 
greater than the other player is by taking the 
shorter pathway and attempting to go through 
it. In the Bilateral Threat condition the situa- 
tion is much more complex and the alter- 
natives which the subjects have are rather 
different. If the other player will not permit 
the subject to be first through the short 
pathway (and no relatively conmtpetitively- 
minded player will), there is still a chance 
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to make more profit than the other by more 
efficient use of one’s barrier and long alter- 
native route to the goal. 

Thus, the two conditions are not compa- 
rable in the manner in which the use of the 
long route is affected by increasing degrees 
of competitiveness. Under conditions of co- 
operation the short path is presumably the 
route generally indicated, so with increasing 
competitiveness there will be no change in 
use of the routes for the No Threat subjects 
but there will be an increase in use of the 
longer paths for the Bilateral Threat sub- 
jects. Therefore, with equally increasing com- 
petitiveness the No Threat and Threat con- 
ditions will be increasingly different in the 
joint payoff measure. 

If this argument is correct, it would seem 
that the only way to compare the Threat and 
No Threat conditions with respect to the 
amount of competitiveness they generate is 
in terms of some behavior that does not 
reflect the use or nonuse of the longer path. 
If a measure of such behavior is greater for 
the Threat condition, then it would seem that, 
as Deutsch and Krauss suggest, the players 
there are more competitive. If it is equal for 
the two conditions, the interpretation that the 
barriers heighten competitiveness would be 
cast in doubt. 

The first purpose of the present experiment 
was to replicate the Deutsch and Krauss 
No Threat and Bilateral Threat conditions 
in order to obtain data on which to base the 
analysis suggested above. It was hypothesized 
that these conditions generate competitiveness 
to equal degrees and that insofar as an index 
of competitiveness comparable for the two 
conditions is used, the barriers have no ef- 
fect upon the outcomes of the bargaining. 
The basic one-way path situation is one in 
which, the bargainers are likely to develop 
feelings of competitiveness (cf, Thibaut & 
Kelley, 1959, pp. 227—229, on competition 
and rivalry), and the barriers have the effect 
merely of changing the nature of the interac- 
tion rather than adding to the competitive 
conflict. 

The second purpose of the present research 
was to study the effects of a type of threat 
the use of which does not directly affect the 
dependent variable (the measure of efficiency 


in reaching agreement). Electric shock w 
appear to be an easily manipulable meam 
affecting the other subject without chan 
other aspects of the mutually interdepen 
situation. The shock, which was given fj 
different interpretations in the experim 
considered a threat since its use is detrimél 
to the welfare of the other and is always ¢ 
intentionally. It was hypothesized that. 
presence of the shock will lead to poorer 
outcomes for the bargainers and that this 
be especially marked when it is expli 
portrayed as a means of coercion or thred 

A final problem concerns the attitudes wl 
bargainers have toward each other at 
outset of their interaction. These would $ 
to be basic to the development of competit 
ness and hence to the outcomes of 
bargaining. Where there are unaccepti 
claims of superiority, great difficulty woul 
expected in bargaining, particularly, as im 
Deutsch and Krauss situation, where equa 
of outcomes is not easily obtainable. It 
hypothesized that where unacceptable cla 
of superiority are believed to exist, the: 
gaining outcomes will be poorer than wl 
no such claims are believed to exist. 

Both of the experiments reported here m 
use of an experimental situation which 
patterned very closely after the Deutsch 
Krauss bargaining game. 


EXPERIMENT I 
Method * 


A 2X2 design varied the threats the subj 
could use in the bargaining: presence versus ab 
of barriers and presence versus absence of electi 
shock. The four resulting conditions are No Thi 
Barrier condition, both subjects controlling” 
riers (the Deutsch and Krauss Bilateral Threat 60 
tion); Shock condition, both subjects having $ 
available; and Shock and Barrier, where both fj 
of threat are available. In the two conditions 
electrical shock was present, no interpretation: 
made of it nor was any rationale given for its) 

The subjects were 48 male and 48 female stud 
enrolled in various sections of introductory j 
chology at the University of Minnesota. The stud 
were selected from those who had indicated inj 
classes their willingness to participate in experim 
The mean ages were 20.4 for the males and 
for the females. Each bargaining pair consiste 


? For a full description of the methods used 
two experiments, see Borah (1961). 3 
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persons of the same sex. Six male and six female 
groups were run in each condition. 

By having the subjects report to different rooms 
and by the arrangement of screens around the 
experimental room, the subjects did not know with 
whom they were playing. It was, however, possible 
for them to see and be seen by the experimenter 
at all times during the experiment. No talking was 
allowed in the experimental room by the subjects. 

As a substitute for the elaborate electrical equip- 
ment used by Deutsch and Krauss, the present ex- 
periments used gameboards and chess pawns which 
the subjects moved over the squares of the board. 
The “start” and “goal” for the subjects were at op- 
posite ends of the gameboard so they were moving 
in opposite directions. Two paths were available from 
each start position to the respective goals. The longer 
path consisted of 76 squares while the shorter path 
had 34 squares, the center 16 of which were com- 
mon to both subjects’ shorter path. The subjects 
could not pass in the common section. 

After both subjects were seated at opposite 
ends of a table with the two identical gameboards 
in front of them and, where appropriate, fastened 
into the shock electrodes, the instructions for play- 
ing the game were given. All subjects were given 
an individualistic orientation. The amount of money 
they pretended to make on each game was $.60 
minus the number of moves each had had the 
opportunity to make. Such opportunities were 
signaled by loud clicks presented 30 times per minute 
and recorded by a counter, The subjects were in- 
structed that each time the signal was given, they 
could move one square forward, one square back- 
ward, or stay where they were. 

In the conditions where barriers were available, 
each subject was shown that he could place one at 
the end of the common path nearest his start posi- 
tion. He could do so only when he was himself on 
the shorter pathway, though not necessarily in the 
common section, Once placed, a barrier did not 
limit the moves of the subject controlling it and 
could be removed by him at any later time, regard- 
less of the path he subsequently used. 

In the conditions where shocks were available, 
the subjects were told that they could give a 
shock to the other subject at any time during the 
game. They could give as many as they wished, 
whenever they wished. They indicated their desire 

? to do so by placing their hand fastened into the 
electrodes on a paper which had “shock” printed on 
it, and the experimenter immediately and obviously 
delivered the shock. The voltage of the alternating 
current generator was set at 25 volts and the 
equipment fused so that the maximum current 
flow possible was 1/200 ampere. 

e Because they could not see each other’s boards 
and moves, it was mandatory for the subjects to 
have some way of knowing when they could not 
move forward in the shorter pathway because 
the dther's piece was blocking them and when the 
other subject had placed the barrier on their path 
so that the shorter path was blocked for them. 


TABLE 1 


Mean Torar Joint Pavorr ror EACH CONDITION BY 
First AND SECOND HALVES 


(Experiment I) 
Condition First half Second half 
No Threat 31.08 —44.17 
Barriers — 545.25 —550.75 
Shock —11.92 81.67 
Shock and Barriers —515.17 — 391.00 


Two signal lights in front of the players indicated 
when these events occurred. 

After the procedures of the game had been ex- 
plained, all groups were given five practice trials 
on which they took turns backing up to allow the 
other to go first through the shorter pathway, took 
turns waiting for the other subject to go through 
the common section, and both took the longer 
pathway. In the conditions were the barriers were 
present, the subjects had three additional practice 
trials: both placing the barriers up and then taking 
the longer pathway and alternating in allowing 
the other to go through the common section after 
both had placed the barriers on the pathway. In 
the conditions where shocks were available, the 
subjects ended the practice trials with two trials 
using the shock—exchanging shocks and alternating 
backing up to allow the other subject to use the 
common section first. 

After the subjects had completed the practice 
trials without errors, the individualistic orientation 
was re-emphasized and 20 games were played. 
The subjects did not know in advance, however, how 
many games would be played. After the twentieth. 
trial, they answered a questionnaire and were told 
about the experiment, all of their questions were 
answered, and they were pledged to secrecy con- 
cerning it. From comments made by subjects during: 
this final period, it appeared that subjects from 
earlier sessions had not discussed the experiment 
with those who participated subsequently. 


Results 


Kolmogorov-Smirnov tests (Siegel, 1956) 
showed no sex differences within any of the 
experimental conditions for either the total 
joint payoff (the amount earned by? both 
subjects summed over all trials) achieved by 
each group or the median joint payoffs (the 
median value for each group over the 20 
trials). The two sexes were, therefore, com- 
bined for an analysis of variance. The analysis 
was made of the total joint payoff scores, 
including a breakdown between the first and 
second sets of 10 games in order &o test for 
trends. The means for the eight cells in the 
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analysis of variance are presented in Table 1. 
The only variable which is significant is the 
presence versus absence of the barriers. The 
F value of 15.23 is significant beyond the 1% 
level (df = 1/44). The Kolmogorov-Smirnov 
test was used to compare distributions of 
median joint payoffs. The results are in agree- 
ment with those from the analysis of variance 
in showing that the barriers lead to signifi- 
cantly poorer joint payoffs for the bargainers. 

Because the results of the present adapta- 
tion of Deutsch and Krauss’ procedure con- 
firm their findings regarding the deleterious 
effect of the barriers, it seems reasonable to 
proceed with further analysis of the present 
data with some confidence that such analysis 
has relevance for interpreting their experi- 
ment. The question to be answered is the one 
raised earlier: Are the conditions with and 
without barriers different when compared in 
terms of a measure of “difficulty in reaching 
agreement” that does not reflect whether or 
not subjects used the longer pathway? 

There are many ways in which time can 
be lost by a player in the game: not moving 
foward as soon as possible; waiting by a 
closed barrier; and, blocking each other on 
the shorter path, symbolically “arguing” 
about which is going to go through first. 
Some of the ways in which time may be lost 
may occur both when barriers are present 
and when they are absent, but others are 
possible only when they are present. An 
example of the latter is time spent withdrawing 
from the shorter pathway without having met 
the other player when it is learned by the 
signallight that the barrier has been placed 
on it. Since this cannot happen in the con- 
ditions without barriers, it does not appear 
legitimate to count this as wasted time when 
comparing the two conditions. Similarly, 
waiting by a closed barrier is not possible in 
the one case. Such waiting may imply an 
‘insistence on taking the shorter pathway, or 
it may reflect stupidity, or it may be a 
realistic solution since, if the other subject 
removes the barrier when he reaches his goal, 
less time will be lost by waiting than by 
going back to the longer pathway. It would 
seem that waiting by a barrier should, also, 
be excluded when comparing the conditions 
with and without barriers. 


The only measure that reflects events th 
can occur equally well in both conditions 
the amount of time they spend togeth 
attempting to go through the shorter pathy 
and consequently blocking each other, 
time is termed “time lost in standoffs,” Fro 
the data available, it is possible to estimy 
it very closely. 


The calculation was made in the follow 
manner. For each trial it was first determined wh 
player reached the goal first and, in the d 
where barriers were present, had not waited by 
closed barrier while the other was progressing? 
his goal. If the first subject to reach the goal h 
waited by a barrier, then the other subject w 
used to determine the time lost in standoffs, 
no case had both players lost time by waiting | 
a closed barrier. From the time this identifi 
subject spent in reaching his goal, all esseni 
travel time was subtracted. This time was 34 ul 
if he reached the goal on the shorter pathway 
110 units if he met the other subject and then to 
the longer path to the goal (17 moves until jl 
subjects met in the center of the common sectii 
17 returning to the start position, and 76 for 1 
longer path). Zero time lost was given to thi 
cases where the subjects did not meet but eit 
took the longer path immediately or started onf 
shorter one and then changed to the longer pil 
before meeting the other subject. (The latter 0 
occurred after the signal light had indicated that 
barrier was blocking the pathway.) The time lost 
the one subject was then doubled in order to | 
rive at the total time lost in standoffs by the fi 
subjects. The resulting index, then, is basically 
time spent by both subjects actively insisting” 
their presence upon going first through the commi 
section of the shorter pathway. 


Separate distributions by sex were m 
of the total time lost in standoffs in ed 


TABLE 2 


Mean True Lost IN SrANDorrs rog EAcu CONDITI 
BY SEX AND By First AND SECOND HALVES 


(Experiment I) 
Condition and sex as 
No Threat, males 452.33 
No Threat, females 137.67 
Barriers, males 308.67 
Barriers, females 129.67 
Shock, males 364.67 
Shock, females 301.33 
Shock and Barriers, males 427.00 
Shock and Barriers, females 140.00 
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condition for the first 10 trials, the second 10, 
and the total 20. Kolmogorov-Smirnov tests 
showed no significant sex differences for any of 
the four experimental conditions on the three 
measures. When the conditions were com- 
bined, however, it was found that the females 
had less total time lost in standofís when the 
barriers were present than did the males. The 
difference between the distributions was sig- 
nificant at the 1% level. The females also had 
less time lost in standoffs for the barriers 
conditions than when no barriers were present 
(significant at the 1% level), There were no 
significant sex differences related to the shock 
variable, 

An analysis of variance, including a trend 
analysis, was then made for the “time lost in 
standoffs” measure. Table 2 presents the 
means of the 16 cells in the analysis. From 
the analysis of variance it appeared that 
there were no significant differences due to 
the barriers, shock, sex, or any of the inter- 
actions, Neither was there a significant dif- 
ference between the first half and the second 
half of the trials. Sex appeared to be the 
most important single factor with its inter- 
action with the barriers and the halves (and 
the triple interaction of these) most closely 
approaching significance. This is due to the 
females having lost less time in standoffs than 
the males in all comparisons except for the sec- 
ond half of the two conditions without bar- 
riers. It is this effect that accounts for the 
two differences observed by means of the 
Kolmogorov-Smirnov test. 

Since the “time lost in standoffs” does 
not produce significant differences in the 
barriers versus no barriers comparison whereas 
the total joint payoff measure does, the 
question may be raised as to the relative 
reliability of the two measures. The time lost 
in standoffs measure eliminates certain com- 
ponents of total time lost (those not com- 
parable between conditions) so it could be 
considered as a shortened version of the joint 
payoff measure. The odd-even reliabilities 
* (odd trials versus even trials) were obtained 

for both measures separately for the con- 
ditions with and without barriers. The reli- 
ability of the joint payoff measure for the 
without barriers conditions is .29 while the 
corresponding figure for the time lost in 


standoffs measure is .27. When the barriers 
are present the reliabilities are .87 and .75, 
respectively. As would be expected from con- 
sidering one measure to be a shortened form 
of the other, its reliabilities are somewhat 
lower. However, the reliabilities are suffi- 
ciently similar so that it seems doubtful the 
different patterns of results yielded by the 
two measures are attributable to their dif- 
ferent reliabilities. 


EXPERIMENT II 
Method 


This experiment used a 2 X 3 experimental design. 
On the three-celled axis were the conditions of No 
Threat, Shock-Coercion (electric shock available and 
interpreted as a means of attempting to control 
the other subject), and Shock-Threat (electric shock 
available as an indication of the intention to do 
something unpleasant to the other subject later 
if he did not do as requested now). The other 
variable was the supposed group membership of 
the other subject. In half of the conditions both 
subjects were told that the other was a member 
of a group generally considered superior to his 
own and that the other agreed with this evaluation 
(Superior Group condition). In the other half of 
the conditions, the subjects were told only that 
they were from the same group (Own Group con- 
dition). Ten groups were run in each of the 
resulting six cells. 

One hundred twenty males, with a mean age of 
20.3, taking one of three psychology courses in the 
General College at the University of Minnesota were 
used as subjects. The General College offers a 2 
year, terminal Associate of Arts degree, An ex- 
tremely wide range of all abilities is found in the 
college although the scholastic aptitude of these 
students is generally lower than that of the typical 
“college sophomore.” For the most part the students 
are those (a) who are not accepted into the other 
colleges at the university, and (b) who have been 
dropped by the other colleges. 

From the nature of the design it was necessary 
in the Superior Group conditions for both subjects 
to believe that the other subject regarded himself 
as superior. This false perception was induced by 
making use of the general belief held hy GC 
students that the Arts college students consider 
themselves superior to GC students. (That this 
perception does exist was demonstrated by a question- 
naire administered to the GC students which yielded 
their beliefs about the attitudes of students in other 
collges toward GC students) Before the experi- 
mental session began, each subject was told that 
the other subject was in the Arts college and had 
a negative stereotype of GC students. They were 
told this individually and casually, a& though it 
did not matter in the experiment but was simply 
something the experimenter thought they should 
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know. This deception was conceived of as the 
laboratory parallel of the situation that arises 
whenever an individual believes, correctly or in- 
correctly, that the other party to an interaction 
believes himself superior and therefore worthy of 
deferential behavior, and the two do not agree 
on this. 

The experimental situation and procedures were 
the same in this experiment as in the first, with the 
following exceptions. In the Shock-Coercion con- 
ditions the subjects indicated their desire to deliver 
a shock (25 volts) to the other subject by pointing 
to a paper with the word “shock” on it. The shocks 
were interpreted as a means of trying to control 
the other subject’s behavior in order to get him to 
do what one wanted him to do. It was implied that 
compliance would indicate he could not “take it.” 
In the Shock-Threat condition, the subjects pointed 
to a paper with the word "threat" on it and the 
shocks were 15 volts. The subjects were told that 
they would later go into a different room where 
they would have ample opportunities to do and 
say unpleasant things to the other subject if they 
so wished. The mild shock indicated the intention of 
doing “bad things” if the other did not do as 
desired. The nature of the later interaction was 
left vague. 

Due to practical considerations of data collection 
in this experiment, it was necessary to establish 
an arbitrary criterion to which groups were run 
if they reached a position in which neither would 
yield. If a pair had a standoff of 860 possible 
moves (somewhat over 10 minutes) on a given game, 
it was discontinued and after five such games, the 
experimental trials for the pair were terminated. 


Results 


Nine groups had five stalemated trials and 
were unable to complete all 20 games. Eight 
of the nine groups are in the Superior Group 
condition. The stalemated games occurred no 
more frequently in one shock or threat con- 
dition than another. 


TABLE 3 


Mean TOTAL Joint PAvorr For EACH CONDITION BY 
First AND SECOND HALVES 
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= (Experiment IT) 
Condition Firsta Ily second 
No Threat, own group —464.0 | —2168.6 
No Threat, superior group —3119.1 | —3302.3 
Shock-coercion, own group —1271.0| —794.0 
Shock-coercion, superior group | —6400.7 —6472.1 
Shock-threat, own group — 1010.2 | —1189.1 
Shock-threat, superior group —2150.7 | —3831.5 


An analysis of variance with trend anal 
was made for all of the data using the tots 
joint payoff for each group.’ Since it is desi 
able for the computation of the analysis 
have equal numbers of groups in each 
groups unable to complete 20 games w 
kept in the analysis by assuming that 
unplayed games would also have been 
mated ones. This appears to be the 
appropriate way to handle these groups sinc 
they had had so much difficulty on th 
games they did play. In all such cases, 4 
least the last four games had been stalemate! 
This means that the subjects had sat for oj 
40 minutes with neither one willing to all 
the other to proceed first through the short 
pathway. 

The means for each of the 12 cells upo 
which the analysis of variance was made 
presented in Table 3. From the analysis, 
only variable which is significant (F=6 
p< 05, df= 1/54) is the description @ 
the other subject as one who considers hii 
self superior. This leads to significantly po 
outcomes for both parties than does the 
scription of the other as an “equal.” 
analysis shows no change from the firsi 
of the trials to the second half, no significa 
effects for the presence versus absence 4 
the shocks, or for any of the interactions. 

Median tests were made over all condit 
for the distributions of total joint profit 
loss) on the first 10, second 10, and total 
games. In all cases the conditions whe 
shocks were available were poorer than the 1 
Threat conditions. When the four shock co 
ditions were compared with the two 
Threat conditions on the first 10 trials, 
median test yielded a chi square, corrected fi 
continuity, of 4.44 (p < .05). This indi 
some deleterious effect on the early S 
of bargaining attributable to the presence 
the threats but this effect is not very large 

Further evidence about the effect of 
threats is provided by the data from the 


3'"Tota] joint payoff” is appropriate since 1 
barriers are not involved in this experiment. In. 
first experiment the rank-order correlations bel 
the “total joint payoff” and the “time lest. 
standoffs” measures ranged from .98 to 1.00 for f 
four conditions without barriers (No Threat 4 
Shock for males and females). V 
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experimental questionnaire. In comparison to 
the subjects in the No Threat conditions, the 
subjects in those conditions where shock was 
available stated that the situation was not 
one where a person’s self-esteem, bravery, or 
self-confidence was at stake and/or that per- 
formance here did not reflect a person’s 
toughness and determination  (x* = 8.19, 
p< 01); that they were not influenced by 
what the other did (x? = 8.97, p < .01); and 
that they were not at all afraid of electric 
shock (x? = 11.91, p < .01). Although it is 
true that the subjects had just been exposed to 
the shock and they may have found that 
shocks are not as unpleasant as they had 
anticipated, in conjunction with the results 
from the other questions, it would seem 
simpler to hypothesize that the subjects in the 
several threat conditions were generally de- 
fensive about the experiment and their per- 
formance, 


Discussion 


From the results of the first experiment, it 
seems that no firm conclusions can be drawn 
concerning the relative degrees of competitive- 
ness due to the presence or absence of the 
barriers, The present results are consistent 
with the conclusion that competition was 
moderately high in both conditions. On the 
other hand, it may be questioned whether 
the subjects are competitive in either con- 
dition. This is because the barriers may be 
used for reasons irrelevant to competitiveness. 
Such a reason could be “tricking” where one 
subject waits near the start position, then 
places his barrier up, and immediately takes 
the longer pathway. This may or may not 
show an increase in competitiveness. It may 
simply be a way of relieving boredom with 
the game through use of an interesting kind 
of strategy or maneuver possible within it. A 
different reason is suggested by the fact that 
the addition of the barriers in this situation 
greatly increases the complexity of the de- 
cisions each subject must make. It makes 
him more interdependent with the other sub- 
ject in the sense that it is more important 
and more difficult to anticipate the other’s 
actions, If this is true, a subject’s use of 
his barrier may represent an attempt to 
resolve the complicated situation by escaping 


from the interdependency. These examples 
indicate how the use of the barriers may 
reflect escape from boredom on the one hand 
or avoidance of complexity and unpredictable 
interaction on the other. Thus, the presence of 
the barriers and use of the longer pathway 
may yield a reduction in joint outcomes with- 
out there being any significant degree of 
competitiveness. 

Although no conclusive statements can be 
made concerning how the degree of com- 
petitiveness is affected by the barriers, the 
present experimenter is inclined to believe 
that the level of competitiveness is relatively 
low in all conditions in the first experiment 
and that the barriers do not cause an in- 
crease in competitiveness. In the second ex- 
periment, the subjects were given information 
calculated to induce them to interpret the 
game more in terms of self-esteem, and when 
this was done the outcome of the bargaining 
was poorer for both of them, 

It was hypothesized that the presentation 
of an unacceptable attitude of superiority on 
the part of the other person would lead to 
poorer outcomes for the bargainers, This is 
based upon Deutsch and Krauss' second as- 
sumption about the effect of threat on bar- 
gaining, and the second experiment clearly 
supports this hypothesis. 

There would appear to be two possible, and 
different, reasons why the presence of the 
shock did not have the predicted detrimental 
effect on the bargaining. The first is sug- 
gested by the questionnaire data which in- 
dicate that the subjects became "defensive" 
about the situation when shocks are available. 
The subjects may be resisting being placed 
arbitrarily and involuntarily into a situation 
in which he shows whether or not he can 
“take it.” If this is the correct interpretation, 
it raises a serious question about the ,limita- 
tions of laboratory experimentation on threats. 

The second possible reason why the pres- 
ence of the shock did not have the predicted 
effect (and this can apply to the effects of 
the barriers as well) has to do with the mean- 
ings their use can contribute to this game. 
The addition of the shock (whether coercive 
or as a threat for the future) presents no 
more information to a subject than simply 
being blocked by one who persists in refusing 
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to move back. Both mean “I want to go 
through first. You back up.” These are by far 
the most common interpretations made of the 
use of the shocks by the subjects according to 
their questionnaire answers. This same in- 
formation seems to be equally well com- 
municated simply by refusals to back up when 
the two meet in the common pathway. 

The present suggestion is that the sub- 
jects can communicate just as much about 
their intentions and desires (and hence, 
develop a competitive interaction) by their 
moves in the game as they can by means of 
the “threats” provided by Deutsch and 
Krauss’ barriers or by the shocks in the pres- 
ent experiment. Inasmuch as the two subjects 
can, in a sense, threaten, attempt to coerce, 
and offer resistance by means of the moves 
intrinsic to the one-way path game, it seems 
doubtful that the so-called “threats” add 


much to the “vocabulary of conflict” available 
to the players. It is a question for further 
research as to whether the addition of threats ` 
to other types of interdependent situations 
may not have a greater effect on the inter- 
action there. 
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DIMENSIONS OF RECOGNITION OF EXPRESSION 


NICO H. FRIJDA aw» ELS PHILIPSZOON 
University of Amsterdam 


Ratings of 30 facial expression photographs by 12 Ss on 22 bipolar scales were 
intercorrelated and subjected to factorial analysis. 4 factors of expressive mean- 
ing were obtained; these were labeled pleasantness-unpleasantness, naturalness 
and submission-artificiality and condescension, intensity of expression-control 
of expression, and attentional activity-disinterest. Correlating these factors with 
measures of the facial features resulted in meaningful relationships of each 
expressive meaning factor with 1 or more of the facial feature measures. 
Results were compared with those from other studies and discussed in relation 
to a theory of emotion and of its recognition. 


The results of two major studies of dimen- 
sions in the recognition of facial expressions 
(Osgood, 1955; Schlosberg, 1954) stand in 
fairly close agreement. Some differences in 
interpretation of the dimensions exist, how- 
ever, particularly in connection with the 
dimension called “attention-rejection” by 
Schlosberg and “control” by Osgood. In order 
to clarify the nature of the dimensions, and 
to make a first step towards a psychophysics 
of expressive meaning, the present study tries 
to obtain such dimensions by a somewhat 
different method, and to investigate their rela- 
tionship with the facial features. Study I of 
this report presents the analysis of dimensions 
in the recognition of facial expression. Study 
II presents the analysis of the relationship 
with facial features. 


Srupy I 
Method 


Material, Thirty photographs were used, showing 
facial expressions posed by a professional actress. 
The photographs used were selected from a series 
of about 150 photographs, according to the following 
criteria; (a) naturalness of the expression; (b) in- 
clusion of the more subtle and subdued expressions, 
such as they occur most frequently in daily life; (c) 
representativeness as far as variations of the more 
important facial features are concerned; (d) some 
representativeness as far as expressive meaning is 
concerned, which was approached by scanning a 
categorized list of emotion words and looking for a 
photograph which might represent each category. 
All photographs showed head and shoulders only 
and were presented to the subject as about 3.5-inch 
X 5-inch glossy prints. 

Procedure. The subject was asked to rate each 
expfession on 27 bipolar seven-point scales, 22 of 
-which were used in the analysis. The list of scales was 
composed, in the following manner, A list of about 
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300 terms, used spontaneously by 30 subjects for 
interpreting similar facial expressions, and divided 
into 29 categories of more or less closely related 
terms by a group of judges, was available from a 
different experiment (Frijda & Van de Geer, 1961). 
A term was selected from each category and a 
suitable opposite chosen from the list. From cate- 
gories, containing many terms, more than one term 
was chosen. The resulting list of 63 pairs of opposites 
was made into a list of seven-point scales and sub- 
mitted to four subjects, who rated 10 photographs, 
not used in the experiment proper, on each scale. 
Scales which appeared to have high intercorrelations, 
as far as could be gathered from the raw data, were 
grouped together, and one of them was selected. 
This reduction was deemed necessary in view of the 
time needed for the experiment and for analysis. 
Scales showing very low variance, and those judged 
ambiguous by subjects were discarded. The remain-' 
ing 27 scales were arranged in a random order; the 
position of each polar adjective was also randomly 
determined. Five scales from this list were omitted 
from the final analysis because of low reliability 
and/or of low variance of the averages for the photo- 
graphs. The scales finally used are presented in 
Table 1. Their direction on the subjects! response 
sheets was not necessarily as given in those tables. 

Subjects were treated individually. Photographs 
were handed to them in random order. There was 
no limit on exposure time. 

Subjects. Subjects were six male and six female psy- 
chology students at Amsterdam University. 


Results 


e 

Scores for each scale and photograph were 
averaged over the 12 subjects. Because of the 
sometimes considerable departure from normal 
distribution, rank-order correlations were com- 
puted between the scales, using as scores the 
average scale value for each photograph. A 


1The correlation matrix has been deposited with 
the American Documentation Institutes Order Docu- 
ment No. 7289 from ADI Auxiliary Publications 
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TABLE 1 
Factor LOADINGS OF SCALES 
Unrotated Rotated | 
Scale [i 
I n | m | wv r TENET TV! 
| 
1, Pleasant-unpleasant 83 | —.44 08 | —.17 91 A3 .24 | —.07 911 
2. Happy-sad 5 4 Š » A X 943 
3. Relieved-startled .827 
4. Quiet-anxious | .965 
5. Friendly-angry .837 
6. Approach-withdrawal | .825 
7. Eagerness-distaste .910 
8. Free-oppressed .990 
9. Warm-cold .807 
10. Submissive-authoritarian 871 
11. Passive-active .863 
12. Mild-derisive 41 
13. Sleep-tension | .198 
14. Hesitant-determined .468 
15. Clear-dull 773 
16. Abandonment-reserve 645 
17. Admiration-despise .821 
18. Attention-rejection .838 
19. Controlled-uncontrolled | 833 
20. Open-closed .878. 
21. Understanding-amazed 473 
22. Natural-artificial 642 
Total variance (in percentages) 80.3 


centroid factor analysis was performed, using 
highest column correlations as estimates of 
communalities and graphic rotation for simple 
structure executed on the four factors ex- 
tracted. Factor loadings before and after 
rotation are given in Table 1. 

“The four factors extracted account for 80% 
of the total variance. Three of them can be 
labeled without difficulty. Factor I’ clearly 
represents the pleasantness-unpleasantness 
continuum. Scales which obtain the highest 
loadings are pleasant-unpleasant, free-op- 
pressed, happy-sad, eagerness-distaste, friend- 
ly-angry, relieved-startled, and  approach- 
withdrawal, in that order. 

Factor III' seems to be a factor of intensity 
of emotional expression versus control of ex- 
pression. Highest loadings are found with the 
scales controlled-uncontrolled, open-closed, 
quiet-anxious, and warm-cold. Control of ex- 
pression, rather than weakness of emotion, 
seems to be the opposite of intensity here. 


Project, Photoduplication Service, Library of Con- 
gress; Washington 25, D. C., remitting in advance 
$1.25 for microfilm or $1.25 for photocopies. Make 
checks payable to: Chief, Photoduplication Service, 
Library of Congress. 


Factor IV' can best be interpreted as a 
factor of attentional activity versus disin- 
terest. Scales with highest loadings are at- 
tention-rejection, passive-active, clear-dull (or 
keen-dull, both clear and keen being transla- 
tions of the Dutch Zeider), and sleep-tension. 

Factor II’ is somewhat less clear. Scales 
with highest loadings are mild-derisive, natu- 
ral-artificial, submissive-authoritarian, and ad- 
miration-despise, in that order. Most of these 
refer to social submission versus condescen- 
sion. The scale natural-artificial does not fit 
this pattern, however. 

Reliability of the scales. Three subjects 
repeated the experiment after an interval of 
about 2 weeks. Product-moment correlations 
for the individual scores of the three subjects 
combined range from .26 to .87, with a mean 
of .66 (N = 90). These coefficients are sub- 
stantially lower than those reported by Engen, 
Levy, and Schlosberg (1958), 7’s for the P-U, 
A-R, and S-T scales are .87, .51, and .74, 
respectively. In their experiment, however, 
the test-retest interval was 45 minutes, anchor- 
ing pictures were used and the test pictures 
carefully selected. 

Average absolute deviations in scale units 
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range from .48 to 1.27, with a mean of .82. 
These figures are a little, but not much, higher 
than those published by Osgood, Suci, and 
Tannenbaum (1957, pp. 129 ff.) on the use 
of similar seven-point scales at comparable 
intervals. 

Consensus among judges. Kendall’s coeffi- 
cient of concordance (Siegel, 1956) was de- 
termined, for each scale separately, over the 
ranked ratings of all 12 subjects. Mean W is 
.55, with a range from .29 to .77. These figures 
are quite satisfactory, particularly in view of 


the reliabilities just mentioned. Product- 
moment correlation between reliability coeffi- 
cients and Ws is .71 (N = 22). 


Srupy II 
Method 


The data and photographs from Study I were 
used, Factor scores on each factor were computed 
for each photograph and ranked according to 
magnitude. These factor scores will be referred to 
as Expressive Meaning Scores (EMSs) ; the factors 
will be referred to as Expressive Meaning Factors 
(EMFs). 


TABLE 2 


RELATIONSHIPS BETWEEN FACTOR SCORES ON EXPRESSIVE MEANING FACTORS AND FACIAL FEATURES 


Feature ig 
A. Eyes 
1. Eyes wider than neutral as 
2. Eyes narrowed or closed ES 
3. Upper eyelid drooping = 
B. Direction of glance 
4. Glance upward Pleasant 
5. Glance downward Unpleasant 
6. Glance turned sideways, = 
head straight forward 
C. | Forehead 
7, Eyebrows lifted d 
8. Frowning Unpleasant*** 
D. Mouth opening 
9. Mouth open Unpleasant 
10. Mouth closed 2 
E. Position of head 
11. Head bent sideways Pleasant 
12. Head twisted* Pleasant 
13. Head bent forward Unpleasant 
14. Head bent backward = 
15. Head stretched forward i 
16, Head pulled backward Unpleasant* 
17. Chin pulled in Unpleasant 
18. Neck stretched upward El 
F. Smiling, laughter 
19. Smiling, laughter Pleasant*** 
G. Corners of mouth i 
20. Corners downward Unpleasant’ 
21. Corners backward Unpleasant*** 
22. Mouth broadened Pleasant 
H. Lips "m 
e 23. Upper lip pulled up Unpleasant 
24. Upper lip arched Unpleasant 
25. Lips compressed Pleasant 
26. Lips pressed in center Pleasant 
27. Lips pursed — 
I. Inconsistency 
28. Lips parted but teeth Unpleasant* 
touching 
J. Asymmetry 
* 29. Asymmetry ae 


Ir nr Iv’ 
= = Active** 
= Uncontrolled Passive 
a = Passive 
Submissive = Active 
— Controlled Active 
E Uncontrolled cu 
= Uncontrolled — 
ser Uncontrolled** — 
= Controlled* = 
Submissive Uncontrolled Passive 
Derisive Uncontrolled — 
Submissive = Passive 
Submissive Uncontrolled* Active 
— = Active* 
Derisive Uncontrolled Active 
Derisive Uncontrolled Passive 
Derisive Controlled — 
— = Passive 
km Uncontrolled Passive 
Derisive Uncontrolled** — 
Derisive Uncontrolled* Passive 
Derisive* Uncontrolled** Passive 
Submissive ID = 
Derisive Controlled Active 
= Controlled Active 
— Uncontrolled** Passive 
Derisive Uncontrolled Active 


/hitney U test, two-tailed, over feature present or absent. € 


$05, Mann Whitney U test, two-tailed, over feature present or absent. 
Oh Mana. Whitney U test, two-tailed, over feature present or absent. 
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TABLE 3 


CORRELATIONS BETWEEN FACTOR SCORES ON EXPRES- 
SIVE MEANING FACTORS AND COMPOSITE 
MEASURES OF FACIAL FEATURES 


Measure r Ir | nr | Iv’ 


Defensive mouth postures* 
Approach-avoidance score? 
Smiling, laughter? 
Frowning? 

"Tenseness score® 

Subtlety score! 
Expressiveness scoret 
Muscular activity score^ 


Mouth open-closed! E : 
Eyes open-closedi .02| .13|—.02 
Attentional activity scorek .14|—.03| .04| —.70 


Note,—Rank-order correlations. With N =30: rho. 37 =p 
7.05, two-tailed; rho 47 =p =.01, two-tailed; rho ,55 =p =.001, 
two-tailed, 

Construction of scores is defined in Footnotes a-k. 

* Withdrawal movements of mouth and lips: Features 20, 
21, 23, and 24 from Table 2, weighted as to class. 

b Turning towards or forwards versus turning away or back- 
wards: Features 4 and 15, + score; Features 5, 13, 16, and 17, 
— score; weighted as to class. 

* Feature 19, full rank order. 

4 Feature 8, full rank order. 

* Unnatural movements: Features 12, 28, and 29; contrac- 
tive-coartative movements: Features 8, 17, 25, and 26 (22 and 
27 if 10). Extra score: 8, if 1 and not 7 (frowning with widened 
eyes without Hine eyebrows). Weighted as to class—26 
considered Class 1 of 25 + 26. 

t Face only locally active—mouth region blank: neither of 
Features 20-27, or only 24 and 27, Class 1; forehead region 
blank: neither 7 nor 8. Presence of features in relatively blank 
face—in mouth region: Features 19-27 if not 7 or 8 and if no 
defensive head movements (15-18); in glance or head ponen: 
4, 6, 11, and 13 if not 7 or 8 and not 15-18. Small, localized 
expressive indications: 24 and 26 if 9. 

« All features, weighted as to class, with only glance sideways 
(not mentioned in Table 2) excepted. 

h All features, but hypotonic features (3 and 11) negative 
instead of positive score (Features 13 and 20 and glance side- 
ways not counted because ambiguous as to activity). 

i Full rank order over Features 9 and 10. 

4 Full rank order over Features 1 and 2 and middle group. 

x Attentional Features 1, 4, 6, 14, 15, and 27, + score; frown- 
ing with eyes wide but without lifting eyebrows (1 if 8 and not 
7), extra + score. Disinterest Features 2, 3, 5, and 11, — score 
(13 not counted because ambiguous in this respect). Only 1 
and 2 weighted. 


Measures of the facial features were obtained in 
the following manner. A list of facial features was 
made, divided into 10 categories. The list, slightly 
simplified and with "neutral" alternatives in each 
category omitted, is reproduced in Table 2. Three 
judges were asked to check, for each photograph, 
at least one of the alternatives in each category. 
Up to seven more judges were asked to classify the 
photographs on those categories where consistent 
results were not yet obtained (this applied mainly 
to Categories E, G, and H). Anchoring pictures 
were used to define the different features. The final 
decision about the presence of a given feature in 
a given photograph was made on the basis of at 
least two-third majority opinion. 

After some alternative within each category was 
checked off for each photograph in the way de- 
scribed, all photographs considered to possess a 
given feature (with the exception of the “neutral” 
alternatives) were ranked by three other judges as 


to the strength of that feature. The average rankings 
were reranked. Finally, if more than three photo. 
graphs were considered to possess a given feature, 
rankings were divided into two or three equal or 
about equal classes, these classes given weights of 
1, 2, or 3. Rankings of Categories A, 1 and 2, and 
D were made over all 30 photographs. i; 

Agreement among judges varied greatly fro 
category to category, judgment apparently being the 
most difficult for the various positions of the 
head and of the lips. Rank-order correlations be- 
tween the judges’ rankings averaged .89, .83, and 
.74 for the three interjudge comparisons (N — 20 
in each case). 

Significance of differences between factor scores’ 
of photographs with and without a given feature 
was determined by means of the Mann-Whitney U 
test (Siegel, 1956, p. 116). All correlations are Spear- 
man rank-order correlations, corrected for ties 
(Siegel, 1956, p. 202). 


Results 


Results are presented in Table 2. The direc: 
tion of a relationship is indicated by a key: 
word, referring to that pole of the EMF n 
which most photographs with the given featu 
were found. 

Since many of the features were present in 
only a few photographs, and since combination 
of features seemed more meaningful anyway, 
composite measures were constructed out of 
the elementary ones. These were obtained by 
counting the number of features, present if 
each photograph, which seemed relevant for 
a given measure, and subtracting the number 
of features of an opposite nature (e.g., avoid- 
ance features from approach features); where 
appropriate, features were given double or 
triple weight according to their strength class 
The composition of the composite measures 
is specified in Table 3. Most of them—pal 
ticularly Numbers 1, 3, 4, 7, 8, 9, and 10—are 
straightforward derivations from the elemen: 
tary measures. Others—Numbers 2, 5, 6, ai 
li—were constructed in view of the inter 
pretation of the EMFs. 

Rank-order correlations between these meas: 
ures and the EMSs are given in Table 3 
Each EMF is correlated with one or more 0! 
the composite facial feature measures. Mos 
facial feature measures are correlated wil} 
only one of the EMFs. 3 

Factor I’, pleasantness-unpleasantness, 1$ 
as may be expected, positively related t0 
smiling and laughter, and inversely. to frowt* 
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TABLE 4 
INTERCORRELATIONS AMONG COMPOSITE FACIAL FEATURE MEASURES 
Measure 1 2 3 E 5 6 7 8 9 10 m 

1. Defensive mouth —44| —38 75 49 | —30 60 50 45| —27 | —25 

P Approach-avoidance 15 | —53 | —33 16 | —21 | —13 | —10 49 57 

B Smiling, laughter —61 01 08 08 03 15 | —08 | —23 

4. Frowning 75 | —37 57 55 58 | —28 | —18 

5. Tenseness score —63 64 72 17| —35 | —11 

6. Subtlety score —59 | —62 | —20 22 27 

7. Expressiveness score 91 68 | —13 | —05 

8. Muscular activity 60. 05 21 

9. Mouth open-closed 02 | —06 
10. Eyes open-closed 90 
11. Attentional activity 


Note.—Rank-order correlations. 


ing. It is also inversely related to defensive 
reactions of the mouth and to other avoidance 
reactions. ` 

Factor II is positively correlated with 
subtlety of the expression, and negatively 
with tenseness and with amount of muscular 
activity. 

Factor III’, intensity of expression versus 
control of expression, is correlated with facial 
expressiveness, i.e., with number and degree 
of features present. The control end of this 
factor is, moreover, correlated with closure 
of the mouth and compressing or pursing the 
lips. These latter features, together with glanc- 
ing sideways with head straight, are the only 
features which run counter to the general 
relationship between control and absence of 
expressive features. 


with opening of the eyes. The correlation is 
increased somewhat by adding other attentive 
features, such as glancing upwards, sharp 
focusing and stretching the neck, and sub- 
tracting signs of disinterest, such as glance 
downward and head bent sideways. 

'The composite measures of facial features 
are not independent of each other. Inter- 
correlations are given in Table 4, and the 
results of a centroid factor analysis" in Table 
5. Factor loadings on the first four factors 
after graphic rotation for simple structure 
follow closely the pattern of correlations with 
the EMSs. In interpreting the intercorrela- 
tions, it should be noticed that some of the 
measures (1 and 3, 4 and 3, 5 and 6) hap- 
pened to be mutually exclusive. Therefore, 


2 This analysis was performed by Dini Ouweneel, 


Factor IV', attentional activity, is correlated to whom the authors express their gratitude. 
TABLE 5 
Factor LoADINGS OF FACIAL FEATURE MEASURES 
Unrotated Rotated 
Measure n 
I II Il 

1, Defensive mouth 48|- 43| —.28 

2. Approach-avoidance —455.|....41 .26 

3. Smiling, laughter —.16 | —.13 74 

4, Frowning 83 | .29 | —.44 

5. Tenseness score 74| 16 15 

6, Subtlety score —.59 | —.10 | —.36 

7. Expressiveness score .8| .40 | .40 

8. Muscular activity .68| .58 40 

9. Mouth open-closed «50115521 1317. 

10. Eyes open-closed —.51 46 | —.11 

11? Attentional CM —43| .84|—.15 

Total variance (in 
percentages) 39.0 | 19.7 | 12.9 
B 
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combining, for instance, Measures 1 and 3 
into a single measure results in a rank-order 
correlation of .86 with Factor I’. 


DISCUSSION 


Factor I’, pleasantness-unpleasantness, is 
identical to the pleasantness dimension of both 
Schlosberg (1952, 1954) and Osgood (1955). 
Factor III’, control of expression versus in- 
tensity of expression, is clearly the counter- 
part of Schlosberg’s sleep-tension dimension 
and Osgood’s intensity factor. There are some 
differences, however. Factor III’ does not 
straightforwardly represent level of activa- 
tion. Correlation with muscular activity is 
somewhat smaller than with expressivness, i.e., 
amount of expressive movement regardless of 
its origin in activity or in the passively sub- 
mitted pull of gravity. The lower extreme of 
the factor is represented, not by a sleepy face, 
but by a relatively bland but somewhat tight- 
lipped and covertly attentive expression. 
Factor rotations which might maximize the 
correlation with sheer amount of muscular 
activity were tried, but seemed unsuccessful. 
Correlations of the single scale sleep-tension 
with muscular involvment and expressivness 
were .65 and .10, as against .69 and .79 for 
the scale controlled-uncontrolled and .77 and 
.86 for Factor II’. For the scale passive-active 
these figures are .47 and .29. 

The conclusion seems valid that, as far as 
judgments of expressive meaning are con- 
cerned, level of activation is a less funda- 
mental dimension than intensity of expression 
versus control. As a matter of fact, high de- 
grees of inactivation, such as sleepiness or 
exhaustion are much more striking and re- 
ferable to a cause than the moderately active 
and controlled neutral states. 

Factor IV', attentional activity, resembles 
Schlosberg's (1952) attention-rejection dimen- 
sion, and this scale indeed has the highest 
loading on this factor. However, the negative 
pole seems to be disinterest rather than re- 
jection, considering the high loading of the 
passive-active scale, and the low correlation 
with defensive mouth postures, closure of the 
mouth, and the approach-avoidance score. The 
absence of correlation with closure of the 
mouth cóntradicts the statement by Wood- 
worth and Schlosberg (1954) about: 
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The opposite end, Rejection . . . in which the eyes, 


lips and nostrils are forcibly closed, as if to shut out | 


stimulation (p. 126). 


It is to be noted that opening or closure of 
eyes and mouth are uncorrelated in the present 
material. The difference of the present finding 
with the analysis of Schlosberg may well be 
due to his use of anchor-pictures, which 
makes his results ambiguous: It is not clear 
whether his subjects rated facial features 
along the open-closed continuum, or expres- 
sive meaning. 

Factor II’, naturalness and submission 
versus artificiality and condescension, is not 
found by either Schlosberg (1954) or Os- 
good (1955). The existence of a dimension of 
social acceptance-social rejection in interpret- 
ing facial expression seems reasonable enough. 
Why it should be coupled to the opposition 
naturalness-artificiality presents somewhat of 
a problem. It may be that the rather intel- 
lectual negative attitudes such as derision 
and despising are by their very nature more 
intentionally produced and intended to in- 
fluence other people’s feelings or behavior, in 
contrast to most meek, mild, and submissive 
attitudes, The facial features, found to go 
with this factor, do not give information on 
this point. In view of the fact that these 
facial feature measures had to be constructed 
ad hoc, further speculation about this factor 
has to await confirmation from other data. 

On the whole the present results tend to 
support a hypothesis on the process of recogni- 
tion of emotion, presented by the senior au- 
thor (Frijda, 1956, 1958). According to this 
hypothesis, recognition of emotion would con- 
sist of recognition of general attitudes and 
activity patterns, based upon expression, plus 
their specification in terms of emotions prope 
or of surmised inner experiences, based upon 
situational cues. Recognition of these general 
attitudes would be fairly easy to explain, 
since their meaning closely corresponds to 
the visible features: unpleasantness to avoid- 
ance and defensive movements; attention te 
use of the eyes and their support by in- 
vestigatory activity of the head, etc.; control 
to limited nonfunctional activity; lack of 
control by amount of visible activity, and the 
like. > 


.c-—————""-"-——— 
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The present findings also have some implica- 
tions for the theory of emotion. First, it is 
suggested that pleasantness-unpleasantness 
may be defined not only in terms of hedonic 
feeling-tone, but also in behavioral terms, as 
approach-avoidance, at least in part. While 
this is common knowledge, it seems to apply 
to expressive behavior too. Second, the im- 
portance of the factor of control of emotional 
expression predicts that only moderate cor- 
relations will obtain between muscular activity 
and, for instance, autonomic indices of emo- 
tional activation. Third, the emergence of a 
submission-condescension factor suggests the 
existence of specifically social emotions or 
attitudes, Finally, the evidence for independ- 
ence of attention and rejection, as suggested 
by the correlations of defensive and avoidance 
movements on the one hand, and activity of 
the eyes on the other, may reflect the in- 
dependence of cognitive and conative activity, 
or of curiosity and other, more vital, drives. 
Interpreting the attentional activity factor 
this way may lend support to the suggestion 
by Triandis and Lambert (1958) as to the 
similarity between this factor (considered 
equivalent to Osgood’s, 1955, control factor) 
and Osgood’s potency dimension, in the sense 
of potency of the cause of emotion. More 
important still is the similarity of this factor, 
in respect to meaning as well as facial pattern, 


to the orienting reflex as described by Russian 
investigators (Razran, 1961). 
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THE RETENTION AND GENERALIZATION OF OPERANT 
CONDITIONING EFFECTS IN AN 
INTERVIEW SITUATION * 


RUDOLF H. MOOS? 
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In a study of the conditioning of complex verbal habits, each of 30 female 
Ss had her established verbal habits assessed in 2 sessions by being asked to 
talk about herself. In subsequent sessions, the Ss in the experimental groups 
were positively reinforced (by a head nod and an “mm-hm”) for either 
independence or affection statements, whereas Control groups Ss were rein- 
forced every 30 sec. regardless of the content of their speech. The results 
showed that: conditioning occurred, ie., the experimental Ss increased in 
their use of the reinforced category; retention also occurred, ie. the condition- 
ing effects were stable over a 24-hr. period; and generalization did not occur. 


In the past several years there has been a 
great deal of interest in the operant condi- 
tioning of verbal behavior, and some investi- 
gators have felt that research in this area is 
relevant to furthering our knowledge of psy- 
chotherapeutic processes. 

Recent investigations have successfully 
shown that the phenomenon of verbal con- 
ditioning can be observed in relatively open- 
ended interview situations. Quay (1959) 
successfully reinforced both family memories 
and nonfamily memories in two groups of 
subjects, while Salzinger and Pisoni (1958, 
1960), using groups of normal and schizo- 


1 This paper is based on a dissertation submitted 
to the Graduate School of the University of Cali- 
fornia at Berkeley in partial fulfillment of the 
requirements for the PhD degree. The writer wishes 
to express his appreciation to Joseph C. Speisman, 
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ducted, and to Robert C. Tryon and John A. 
Starkweather, who served as members of the author's 
doctoral research committee. Appreciation is also 
expressed to Florence Willner, Ernest Bloomfield, 
and Louis Breger, who served as the three co- 
experimenters, and to Alfred Bloch, who assisted in 
establishing reliabilities. 
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United States Public Health Service Fellowship at 
the University of California School of Medicine and 
the Langley Porter Neuropsychiatric Institute and 
was supported in part by funds made available from 
USPHS Grant 2M-7082. 

This paper was read, in part, at the annual meet- 
ing of the Western Psychological Association, Seattle, 
Washington, June 15, 1961. 

2 Now in the Department of Psychiatry, Stanford 
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phrenic subjects, demonstrated, using a 10- 
minute reinforcement period, that self-referred 
affect statements could be reliably reinforced, 
and that they increased throughout the 10- 
minute period. These authors did not consider 
the question of whether verbal conditioning 
changes would generalize from the experi- 
mental situation to a different test situation. 

Rogers (1960) was interested not only in 
discovering whether an interviewer can pro- 
duce changes in a subject’s self-reference ver- 
balizations by consistent reinforcement, but 
also in whether such reinforcement can alter 
a subject’s concept of himself, as measured by 
personality tests. He used a battery of four 
personality tests, administered prior and sub- 
sequent to conditioning and found that, despite 
successful conditioning of  self-references, 
there were no changes in the tests, indicating 
that the effect of the reinforcing stimuli was 
confined to the conditioning situation. He 
has suggested that the particular psychologi- 
cal tests he employed to measure change may . 
well have been insufficiently sensitive to 
detect small changes. 

'The purpose of this study was to discover 
whether established complex verbal habits 
could be conditioned in free interview situa- 
tions, whether this conditioning effect would. 
be retained, and whether it would generalize 
to a different although similar situation. 


METHOD T 


Several conditions were adopted in order to make 
the verbal conditioning situation similar to a therapy 
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situation and different from verbal conditioning 
situations used previously. 

1, The categories which were conditioned rep- 
resented complex verbal habits which had been 
shown to be subject to the effects of reinforcement 
in actual therapy interviews (Murray, 1956). 

2. The sessions in which the subjects talked about 
themselves were 20 minutes in length. 

3, There were four conditioning sessions spaced 
one day apart. Previous investigators have generally 
used only one conditioning session. 

4. A session in which the retention of the effects 
of the conditioning was measured was given 24 
hours after conditioning. In previous studies, reten- 
tion, or extinction periods have followed condition- 
ing, and there has been no effort to measure the 
effects of conditioning over longer time intervals. 

5. A last session, in which the generalization of 
verbal conditioning was measured, was also given 
24 hours after conditioning. 


Design 


Thirty female undergraduate subjects were 
randomly assigned to three groups of 10 subjects each. 
These three groups were: the Independence group 
(Group I), the Affection group (Group A), and 
the Control group (Group C). 

Each of the subjects in each of the three groups 
went through the following eight 20-minute sessions: 
a preconditioning session with a co-experimenter 
(Pre-Co), a preconditioning session with the experi- 
menter (Pre-E), the four conditioning sessions, a 
postconditioning session with the experimenter 
(Post-E), and a postconditioning session with a 
co-experimenter (Post-Co). 

The preconditioning sessions were on the same 
day, 5 days before the first conditioning session, 
which always fell on a Monday. The four con- 
ditioning sessions took place on consecutive days 
with the last one, thus, falling on Thursday. Both 
postconditioning sessions were always the next day. 
Tt should be noted that the order of the pre- and 
postconditioning sessions with the E and the Co-Es 
was counterbalanced. Half the subjects saw E and 
then Co-E, whereas for the other half this order 
was reversed, Table 1 summarizes this design. 


* Procedure 
Before the sessions the subjects were told that: 


this study is concerned with how people think 
and feel about themselves. In the sessions I would 
like you to describe your personality characteristics 
as fully as you can. If you wish you might il- 
lustrate what you are like now by telling about 
some experiences or situations and how you re- 
acted to them . . . you will have 20 minutes per 
session and it is essential that you give a broad 
overview of yourself . . . sometimes you will 
find it difficult to think of things to say about 
yourself, but just try and do the best you can. 


TABLE 1 
EXPERIMENTAL DESIGN 
Sessions 
(20 minutes 
Group Pre- | Pre- each) Post- | Post- 
Co E | Co 

DRESA 

Independence (I) Independence 
N = 10) reinforced 

Affection (A) Affection 

(N = 10) reinforced 
Control (C) 30-second re- 

(N = 10) inforcement 


During all pre- and postconditioning sessions E 
and Co-Es listened and said nothing. During the 
four conditioning sessions the three groups were 
reinforced differently by E. The I group was re- 
inforced immediately after each independence ap- 
proach statement by a simultaneous head nod and 
*mm-hm. The A group was reinforced in the same 
manner, except after affection approach statements. 
The C group was reinforced as closely as possible 
to every 30 seconds, regardless of what the subjects 
were talking about. The 30-second interval was 
chosen in order to make the number of reinforce- 
ments given in the control group similar to the 
number in the experimental groups. It is possible that 
reinforcement, per se, regardless of its locus in time, 
might function to increase verbalizations in the 
specific categories measured. The use of the control 
makes possible the differentiation of the effects of 
similar amounts of reinforcements given at different 
times. 

At the conclusion of the experiment an interview 
was held with each subject in which E tried to 
discover whether or not she had been aware of the 
reinforcement procedure. Each subject was asked 
several questions about the experiment and was 
given ample chance to make her own comments 
about it. 


Categories 


The categories were derived from Murray (1956). 
The independence and self-assertion category in- 
cluded expression of needs to be independent, adult, 
mature, assertive, ambitious, competitive; standing 
up for one's rights, defending against attack, saying 
“no” when one has to, expressing an opinion or 
warranted criticism; description of behavior or 
plans to be independent. Some examples are: ^ 

“Jt was here that I. started to make decisions 
for myself." 

“T can make people do what I want." 

*P]l get an apartment and not live at home." 

“I Jove work that requires thinking and re- 
sponsibility." 

The affection category included direct expression 
of needs for love and affectionate behavior; needs 
for and appreciation of affection, friendship, ac- 
ceptance, understanding; feeling of lóving, giving 
affection, etc., or having entered into an affectionate 


54 RupotF H. Moos 


relationship; doing something positive for or with 
a person; holding someone in high regard, respecting, 
appreciating. Some examples are: 

“I have a very close relationship with my sisters.” 

“I like to try and understand other people's 
feelings.” 

“T am willing to do things for other people.” 

“Pm lonely . . . I want someone to love.” 

The category was broadened from Murray’s (1956) 
original one in order to increase the operant level 
of the response. Inclusion statements, or state- 
ments about moving toward people in general, not 
necessarily in close affectionate relationships, have 
been included. 

“I'm a person who likes to do things with people.” 

“There are so many nice kids here that I like 
quite well.” 


RESULTS 
Hypotheses 


1. The reinforced categories, Independence 
and Affection, will increase during the four 
conditioning sessions in the experimental 
groups, and not in the control group, ie., 
conditioning will occur. 

2, The reinforced categories, Independence 
and Affection, will be used more frequently 
in the postconditioning session with E than 
they were in the preconditioning session with 
E, i.e., retention will occur. 

3. The reinforced categories, Independence 
and Affection, will be used more frequently in 
the postconditioning session with the Co-E 
than they were in the preconditioning session 
with the Co-E, i.e., generalization will occur. 


Scoring and Reliability 


All sessions were tape recorded and were 
individually scored for number of speech units, 


12} 


MEAN PERCENT OF UNITS 
^ 6 
| SUM 


er 

4|- e---eacRouP 
0——0 I GROUP 

2 4—AaccAouP 

1 L L L 1 L | 
PRE- PRE- 1 2 3 4 POST POST- 
co E E co 
SESSIONS 


Fic. 1. Session by session changes for mean 
independence percentage. 


MEAN PERCENT OF UNITS 
o 


6 
^| o-o acrour 
O—o I GROUP 
27 aA C GROUP 
L 1 L 1 1 1 fi J 
PRE- PRE- 1 2 3 4 POST- POST- 
co E E CO 
SESSIONS 


Fic. 2. Session by session changes for mean 
affection percentage. 


number of independence units, and number of 
affection units. The procedures used were 
essentially the same as those used by Murray. 
(1956). 

Three typewritten sessions were judged in- 
dependently by two judges after they had 
familiarized themselves with the scoring rules. 
The mechanics consisted in simply making a 
stroke after each string of words judged to be 
a unit. The reliabilities for the three sessions, 
measured by the number of exact agreements 
divided by the total number of instances, were 
.92, .95, and .95. 

Since E, in the experimental procedure 
itself, was reinforcing units as the subjects 
emitted them, it was necessary to show that 
E could reliably score units from tape record- 
ings of pretest sessions with which he was 
not familiar. The independent judge scored 
the sessions using a typescript and E then 
scored the sessions by listening to the tape 
recording. The reliabilities for four sessions 
scored in this manner averaged .90. 

The reliability of scoring of content cate- 
gories was established on 17 further sessions 
scored independently by E and another judge. 
Nine of the sessions were pretest sessions and 
eight were posttest sessions. This procedure 
was followed in order to check on the pos- 
sibility of E's tendency to subtly change the 
criteria of scoring throughout scoring many 
tape recorded sessions. The independent judge 
did not know which session of which subject 
was being scored. 

It should be noted that the reliability is in 
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TABLE 2 
Summary OF Witcoxon Test RESULTS 


Group Category Conditioning Retention Generalization 
Independence Independence T = —9,5* Tta" T 10" 
Affection 16 32.5 20 
Affection Independence 9* 19 37 
Affection —8* —9.5* —25 
Control Independence 10 20.5 31 
Affection 19.5 24 23 


* Significant at .05 level. 
** Significant at .OL level. 


terms of data pooled for the session and not 
in terms of exact agreements between judg- 
ments of individual subject statements. The 
percentage of the category used in each session 
is the appropriate measure for comparison 
since the results to be presented later are in 
terms of percentage totals. The reliabilities 
for the 17 sessions (Kendall’s tau, Siegel, 
1956) were .88 for Independence and .80 for 
Affection. 


Conditioning 


The mean percentage of each category used 
in each session for each group is shown in 
Figures 1 and 2. In order to assess the effect 
of conditioning the percentage scores for each 
subject, for each category, in the four con- 
ditioning sessions were averaged and were 
compared to the subject's operant level, i.e., 
her percentage of the category in the Pre-E 
session. The difference between these two 
scores was used as a measure of conditioning. 
The Wilcoxon matched-pairs signed-ranks test 
(Siegel, 1956) indicated that there were sig- 
nificant increases for both independence 
statements in the I group and affection state- 


ments in the A group, while these categories 
showed no changes in the control group. The 
affection category in the I group did not show 
a significant change, whereas the independence 
category in the A group showed a significant 
decrease. Table 2 presents a summary of the 
results of the Wilcoxon tests. These results 
support the conclusion that conditioning oc- 
curred in the experimental groups and not 
in the control group, and that the conditioning 
was specific to the particular categories being 
reinforced in the sense that reinforcement of 
one category did not result in an increase in 
the other category. 

Next the conditioned categories in the ex- 
perimental groups were compared with these 
categories in the control group. The com- 
parisons were made utilizing the Mann- 
Whitney U test. The differences between the 
experimental and control groups for both 
Independence and Affection were significant at 
better than the .01 level, substantiating the 
fact that conditioning occurred. Table 3 pre- 
sents a summary of the results of the Mann- 
Whitney U tests. 

There is still another important finding with 


TABLE 3 
Summary OF MANN-WHITNEY Test RESULTS 6 
Group Category Conditioning Retention Generalization 

Independence Independence u= 11" U = 19** U = 39 
versus 

Control Affection 54 62 56 

Affection Independence 38 51 50 
versus 

ontrol Affection 19** | 26* 40 

j ! 6 


* Signifigant at .05 level. 
** Significant at .01 level. 
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regard to conditioning. The fourth condition- 
ing session, for both categories, showed a drop 
in the response level of the conditioned cate- 
gory when compared with Session 3. A com- 
parison was made between the first five sub- 
jects in each group, who had all been seen 
for their fourth session on a Thursday eve- 
ning on which E was very tired, and the second 
group of five, with whom an effort to avoid 
E's loss of motivation had been made. Whereas 
the first five subjects, when Sessions 3 and 4 
of conditioning were compared, showed de- 
creases in the categories being conditioned, 
this was not true of the second five subjects, 
who showed no change in affection and an in- 
crease in independence. 


Retention 


Retention was said to have occurred if the 
Post-E sessions showed a significant increase 
over the Pre-E sessions. 

The percentage of the category used in 
the Pre-E session for each subject was sub- 
tracted from the percentage used in the 
Post-E session, and this difference was used as 
the measure of retention. The Wilcoxon test 
results (see Table 2) showed that there was 
retention of the conditioned change in both 
Independence and Affection categories. There 
were no significant differences between the 
Pre-E and Post-E sessions for either category 
in the control group. When the comparisons 
were made for the unconditioned categories in 
the experimental groups, i.e., affection in the I 
group and independence in the A group, no sig- 
nificant differences occurred. Thus, while there 
was a significant change from the Pre-E to 
the Post-E session, ie., retention, for the 
conditioned categories, there were no signifi- 
cant changes when conditioning had not 
occurred. 

When the experimental and control groups 
were compared (see Table 3) with respect to 
Pre-E and Post-E session changes, significant 
differences were shown for both categories. 
This is a further indication that retention 
occurred. 


Generalization 


The petcentage of the category used in 
the Pre-Co session was subtracted from the 


percentage used in the Post-Co session, an 
this difference was used as a measure of 
generalization effect for each subject. Th 
intragroup comparison for independence ij 


tion effect (see Table 2), however, no sig 
nificant effect was shown for affection in thi 
A group. In the control group neither categor 
showed a significant change from the Pre-Qi 
to the Post-Co sessions, and the unconditiong 
categories in the experimental groups als 
showed no significant changes. 

When the changes in the experiment 
groups for the conditioned categories wer 
compared to the changes shown for thes 
categories in the control group (see Table 3 
there were no significant differences betwee 
them. Both comparisons showed differences ii 
the predicted direction, but neither reach& 
the .05 level. 

There was a slight increase in the use 0 
the conditioned categories from Pre-Co fi 
Post-Co sessions in the I group, an increas 
which was, however, not significantly greate 
than that which occurred from the Pre-Co fi 
the Post-Co sessions in the control group 
From these data it must be concluded that, 4 
best, only a small generalization effec 
occurred. 

Two further comparisons, between 
dependence in the I group and independend 
in the A group, and affection in the A grou 
and affection in the I group, were mad 
Neither of these comparisons resulted in sig 
nificant differences, indicating that the com 
ditioned categories in the experimental group 
did not show generalization when comparet 
to the unconditioned categories in thes 
groups. 7 

It will be remembered that the order 0 
the pre- and postconditioning sessions wa 
counterbalanced, The subjects in the tw 
orders were compared for generalization, 4 
well as for conditioning and retention, and ñ 
significant differences resulted. It was, thus 
concluded that there was no order effect. 


Interview 


Each subject was given an awareness intet 
view at the conclusion of the experiment 
Awareness was operationally defined as a 
explicit statement by the subject: that 


1 


CONDITIONING IN AN 


interviewer had responded with “mm-hm” or 
with a head nod whenever the subject gave 
an independence or affection statement. Tt 
was determined that only 3 of the subjects 
were aware of the head nod conditioning, and 
none of them connected the head nod with any 
particular topic, although all had considered 
this possibility. As for the *mm-hm" condi- 
tioning, only 1 of the 20 experimental sub- 
jects met the criterion for awareness, although 
15 spontaneously reported that they had 
noticed the *mm-hm." All subjects, in both 
the experimental and the control groups, had 
thought about the possibility of a connection 
between the “mm-hm” and their verbaliza- 
tions, however only 1 stated the connection 
correctly. This subject, incidentally, showed 
a large conditioning effect. 

The extent to which the experiment be- 
came an important and much thought about 
aspect of the subject's activities during the 
week was quite surprising. For the most part 
subjects did not particularly enjoy talking 
about themselves, however they were all very 
concerned about whether or not they had 
“done a good job." It was clear that they 
were quite involved in the task, and that 
most of them were trying to please the E 
and be good subjects. 


DISCUSSION 


To review, the purpose of the present study 
was to discover whether established complex 
verbal habits could be conditioned in free 
interview situations, whether this conditioning 
effect would be retained over a 24-hour in- 
terval, and whether it would generalize to a 
different although similar situation. The re- 
sults indicated that verbal changes, which 
shave been clearly demonstrated to occur in 
psychotherapy, and to be dependent on the 
therapist’s reinforcements (Murray, 1956) 
also occur in a verbal conditioning situation 
where they are clearly dependent on the rein- 
forcement of the experimenter. This sub- 
stantiates other demonstrations of the appli- 
cability of learning theory principles in situa- 
on as complex as psychotherapy (Bandura, 

61). 

Two important additional findings were that 
the subjects remained susceptible to rein- 
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forcement effects over a 4-day period, and 
that the categories, if not conditioned, tended 
to decrease in frequency. The latter finding 
substantiates Rogers’ (1960) conclusion that 
without reinforcement positive self-references 
tend to extinguish. 

The finding that the conditioning effect 
showed a decrease from Session 3 to Session 
4 for some subjects, but not for others, was 
notable. If it was E's lack of motivation and 
apathetic attitude in the Thursday session 
which caused the decrease to occur, then it 
would appear that the stimulus situation is 
more complicated than has been indicated in 
operant conditioning studies. The reinforce- 
ment value of the simple stimulus “mm-hm” 
may change from day to day even with the 
same subject and the same E, depending on 
E's motivation to reinforce or the subject's 
need to receive rewards. 

Since E had seen the results of the first 
five subjects in each group, he may have 
reinforced in a more potent and rewarding 
manner in the fourth conditioning sessions of 
the second five subjects. It could be that had 
E remained the same throughout all the ses- 
sions for all the subjects, the conditioning 
effect would have uniformly decreased in 
Session 4. Even this explanation, however, in- 
dicates a very definite effect of the manipula- 
tion of the reinforcement procedures by the 
therapist or experimenter. Further research is 
needed in this area. 

‘A retention effect was established, ie, the 
conditioning effects were stable and occurred 
after a 24-hour interval in a session in which 
no further reinforcement was given. Previous 
studies, such as Salzinger and Pisoni's (1958), 
have measured the effects of conditioning in 
an extinction period immediately after the 
conditioning. 

It has been shown (Eriksen, Kuethe, & 
Sullivan, 1958; Giddan & Eriksen, 1959} that 
simple response biases are easy to establish by 
conditioning, and that they will generalize 
to another situation. However, when the situa- 
tion in which the generalization is measured 
becomes more than slightly different from 
that in which the conditioning occurred, the 
effect will not transfer (Ekman & Friesen, 
1960: Rogers, 1960; Weide, 1960; Yadoff, 
1959). 


58 Ruporr H. Moos q 


It would seem, in this study, since reten- 
tion occurred, and since the total situation 
was almost the same in the retention and the 
generalization conditions, that generalization 
should occur. It should be noted that any 
conclusion on this point must be tenuous since 
the Post-Co minus Pre-Co session changes 
were significant for independence (generaliza- 
tion did occur) whereas they were not for 
affection (generalization did not occur). The 
reason that both comparisons were not sig- 
nificant when the control group change was 
utilized as the baseline was that the control 
group showed an increase in the use of in- 
dependence statements in the Post-Co session. 

Any change in the total situation, be it in 
the response required of the subjects or in the 
stimulus confronting them, apparently func- 
tions to lower a transfer effect. Systematic 
variation of all aspects of the total stimulus 
situation will enable us to discover. the limits 
of generalizability. 
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2 groups of female high school students (40 achievers and 40 underachievers, 
matched for IQ) were exposed to 2 experimental conditions (Neutral and 
of pictures (those depicting males and 


Achievement Oriented) and 2 types 


those depicting females). The overall effect 
arousal conditions for all girls was nonsignificant. 
was obtained: 
nificantly in response to Achievement 


nificant 2nd-order interaction effect 


tion scores of achievers increased sig! 
Oriented conditions when they produced stories 
did not increase in response to pictures of males; 


of the experimental achievement 
However, a highly sig- 
the achievement motiva- 


to pictures of females but 
by contrast, the achievement 


motivation scores of underachievers increased significantly in response to 
Achievement Oriented conditions when they produced stories to pictures of 
males but did not increase in response 


An impressive body of theoretically con- 
sistent information has accumulated which 
describes the operation of achievement mo- 
tivation in male subjects. Studies of achieve- 
ment motivation in men have demonstrated 
increases in achievement motivation scores in 
response to experimental achievement arousal 
conditions which stress intelligence and 
leadership ability (e.g., Lowell, 1950; Mc- 
Clelland, Clark, Roby, & Atkinson, 1949; 
Veroff, Wilcox, & Atkinson, 1953), and have 
described significant relationships between 
achievement motivation and risk taking 
behavior (Atkinson, 1957; Atkinson, Bas- 
tian, Earl, & Litwin, 1960; Atkinson & 
Litwin, 1960), conformity (Davage, 1956; 
Samelson, 1958), work partner selection 
(French, 1956), problem solving effectiveness 
(French, 1958; French & Thomas, 1958), 
learning (e.g., Hurley, 1957; Johnston, 1955; 
Lowell, 1952), academic performance (Pierce 
& Bowman, 1960; Ricciuti & Sadacca, 1955), 
speech behavior (Wagner & Williams, 1961), 
etc. However, the few comparable studies of 
achievement motivation in female subjects 
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to pictures of females. 


have shown neither consistency with the find- 
ings for men nor internal consistency with 
each other. The general theory of achievement 
motivation which has evolved to date (Atkin- 
son, 1957, 1958; McClelland, Atkinson, Clark, 
& Lowell, 1953) confines its empirical sup- 
ports to research with men and is forced to 
ignore the dynamics of achievement motiva- 
tion in women because of the scarcity, am- 
biguity, and inconclusiveness of the empirical 
findings. This, of course, has been a formidable 
restriction on comprehensive theory con- 
struction. 

One area of confusion in the results on 
samples of women (e.g., Veroff et al., 1953) 
is the apparent failure of female subjects to 
show the expected increase in n Achievement 
scores when exposed to experimental achieve- 
ment arousal conditions which stress “intel- 
ligence and leadership ability." Almost a 
decade ago, McClelland et al. (1953) stated: 


Women do not show an increase in n Achievement 
scores as a result of achievement-involving instruc- 
tions. . . . Why then don't women's scores increase 
under experimental arousal? This is the puzzler. Two 
possible explanations—invalidity of the scoring for 
women, scores too high to go higher—have been 
eliminated. Apparently the usual arousal instructions 
simply do not increase achievement striving in 
women . . " (p. 178). 
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Substantially the same unresolved state of 
affairs remains today. In general, we know 
very little about the operation of achievement 
motivation in female subjects, and, specif- 
ically, the lack of response to experimental 
achievement arousal conditions has not been 
explained. Field (1951) supplied some clarifi- 
cation of the experimental arousal issue by 
demonstrating that, although manipulating 
achievement orientation by reference to intel- 
ligence and leadership does not increase 
achievement motivation scores in women, 
experimental conditions which arouse a con- 
cern about social acceptance produce increases 
in their n Achievement scores. 

One research result has been reported in the 
literature on achievement motivation which 
suggests that, for certain female samples, the 
experimental instructions which appeal to 
intelligence and leadership ability can produce 
an increase in achievement motivation scores. 
Angelini (1955), employing a sample of 
Brazilian college women, found significant 
increases in n Achievement following experi- 
mental instructions which appeal to intel- 
ligence and leadership ability, and also found 
that this increase exceeded the increase pro- 
duced by experimental conditions which arouse 
concern about social acceptance. Angelini ex- 
plains the discrepancy between the results on 
Brazilian and American college women by 
indicating that in Brazil, where opportunities 
for higher education are greatly limited, only 
highly competitive girls who have placed great 
stress on intellectual accomplishment will 
succeed in enrolling in a university. It is 
reasonable to conclude that the achievement 
goals of Angelini's sample of Brazilian college 
Women are more career or intellectually 
oriented than the more socially oriented ob- 
jectives of the average American female 
subject. 

Thus, it appears possible that the failure 
to demonstrate increases in achievement mo- 
tivation in American women in response to 
experimental conditions stressing intelligence 
and leadership ability is related to the fact 
that the female subjects employed were in- 
sufficiently concerned with standards of intel- 
lectual excellence and with achieving through 
the development of intellectual skills. There 


are, of course, certain settings in which great 
emphasis is placed upon the intellectual ace | 
complishments of girls. One such institution 
is the Hunter High School for intellectually 
gifted girls; this is a seventh through twelfth 
grade demonstration school associated with 
Hunter College of the City University of 
New York. Hunter High School sets ex 
tremely high expectations and demands for 
intellectual achievement, provides intense? 
scholastic stimulation, and emphasizes com- 
petitive achievement goals which require long- 
range planning and persistence, including the’ 
pursuit of college training. The presence of 
a strong intellectual achievement orientation 
at Hunter High School is reflected in the 
following facts: (a) admission is highly com- 
petitive—only outstanding elementary school 
students are advised to apply and from this 
highly selected group of about 4000 candi- 
dates, approximately 150 are admitted; (b) 
more than 99% of Hunter High School girls 
proceed to college (and, of the remaining few, 
most choose other forms of advanced training, 
such as nursing); (c) no high school in 
New York State receives a higher percentage 
of State Regents Scholarships; ordinarily, 
about 75% of Hunter High School’s graduat- 
ing seniors receive this competitive award; 
(d) although no precise tabulations exist, à 
very large percentage of graduates win scho- 
lastic recognition in college and proceed to 
professional careers. 

Girls attending Hunter High School might, ” 
thus, be expected to display a concern for 
competitive standards of excellence and 
achievement comparable to the concern of 
the Brazilian college women in Angelini’s 
(1955) sample, and it was hypothesized that 
Hunter High School students would display | 
the same increases in achievement motivation 
scores in response to experimental conditions 
emphasizing intelligence and leadership abit 
ity which were shown by previous samples of 
men and by Brazilian college women. While 
no specific direction of results was predicted, 
it was further expected that academic 
“achievers” and “underachievers” might re- 
spond differently to the experimental achieve- | 
ment arousal conditions and to pictures 
depicting male or female characters. 
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METHOD 
Subjects 


The subjects were 80 juniors and seniors (ranging 
in age from 15-8 to 18-3 years) at Hunter High 
School.2 These girls are primarily from middle- 
class families. 

Because it appeared likely that there would be 
differences in response to experimental instructions 
stressing intelligence and leadership ability between 
girls who were successfully meeting the academic 
demands of the school and girls who were not, the 
80 students were divided into two equal matched 
groups of 40 achievers and 40 underachievers. 
‘Achievers in the junior class had cumulative scho- 
lastic grade averages in the first quartile for the 
four preceding semesters and in the senior class 
had first quartile grades for five of six preceding 
semesters; underachievers in the junior class had 
cumulative scholastic grade averages in the fourth 
quartile for the four preceding semesters and in 
the senior class had fourth quartile grades for 
five of the six preceding semesters. 

Achievers were individually matched with under- 
achievers for IQ score—Terman-McNemar (1941) 
Test of Mental Ability. The mean IQ score for the 
40 achievers was 132.36 (SD — 6.40) and for the 40 
underachievers was 131.86 (SD — 742); thus, the 
groups did not differ significantly in IQ (t— 0.71, 
$2.05). 

In order to accumulate a sufficient number of 
matched subjects, the experiment was spread over 
a 2-year period. In 1958, 20 achievers and 20 
underachievers were drawn from the junior and 
senior classes, and, again in 1960, 40 matched 
subjects were drawn from the junior and senior 
grades, All experimental sessions were conducted by 
the same female experimenter (assisted by another 
female experimenter) in 1958 and in 1960 under 
identical classroom conditions. 


Experimental Conditions 


Each student was exposed to two group experi- 
mental sessions separated by one week, the first 
under Neutral conditions and the second under 
Achievement Oriented conditions. The Neutral and 
Achievement Oriented conditions employed in this 
experiment were identical to those described 
dtiginally by McClelland et al. (1953, pp. 101-104) 
and used widely thereafter (e.g., Atkinson & Reitman, 
1956; Martire, 1956). In general, the neutral con- 
dition is one in which no experimental attempt is 
made either to arouse achievement motivation or to 
create an especially relaxed condition; the objective 
is to elicit the normal motivation level of the 
subject in an everyday school setting. The achieve- 
ment oriented condition introduces strong achieve- 


2'The cooperation of the Hunter High School 
administration is gratefully acknowledged. Cyril W. 
Woolcock, Mildred A. Busch, Della B. Meehan, 
Harriet Schueler, and many others contributed much 
valuable advice and assistance. 
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ment cues in the form of special instructions in 
order to arouse achievement related motives. 


Measure of n Achievement 


Content analysis of thematic stories provided 
the measure of achievement motivation. The group 
testing procedure which was followed for collecting 
the stories is described in detail elsewhere (Atkinson, 
1958, p. 837; McClelland et al, 1953, pp. 97-99). 

Each of the 80 subjects wrote 12 thematic stories, 
6 under neutral experimental conditions (3 to 
pictures in which a male was the central figure 
and 3 in which a female was the central figure), 
and 6 different stories under achievement arousal 
conditions (3 to pictures of males and 3 to 
pictures of females). 

The 12 pictures were selected from those used in 
previous research on achievement motivation. The 
6 pictures depicting females as central figures were 
derived from the research of Veroff, Atkinson, Feld, 
and Gurin (1960, pp. 3-4), and the 6 pictures of 
males are described by McClelland et al. (1953, p. 
375). These 12 pictures were divided into the 
following four sets of 3 pictures each; 


Female Set A 

Woman (mother) seated by a young girl reclining 
in chair. 

Group of four women, one standing, the others 
seated facing each other. 

Woman kneeling and applying a cover to a chair. 


Female Set B 

Two women preparing food in a kitchen. 

Two women standing by a table in a laboratory; 
one woman is working with test tubes. 

A woman in the foreground with a man standing 
behind and to the side. 


Male Set C 

Two men (“inventors”) in a shop working at a 
machine. 

Boy in foreground with vague operation scene 
in background. 

Father and son. 


Male Set D 
Two men, in colonial dress, printing in a shop, 
“Lawyer’s” office: two men talking in a well- 

furnished office. 

“Cub reporter” scene—older man handing papers to 

a younger man. 


The thematic stories were scored according to 
the method of content analysis (Scoring System C) 
also described in detail elsewhere (Atkinson, 1958, 
pp. 179-204, 685-735; McClelland et al, 1983, 
pp. 107-138, 335-374). Interjudge scoring reliability 
(Spearman rank-order correlation coefficient) was 
.93 for the six-story protocols of 30 subjects. 


3 We wish to acknowledge the assistance of Arlene 
Benson (Survey Research Center, Uniyersity of 
Michigan) and Shirley Feltman (Hunter College) in 
scoring test protocols. 
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TABLE 1 


MEAN n ACHIEVEMENT SCORES OF ACHIEVING AND UNDERACHIEVING GIRLS UNDER NEUTRAL AND 
ACHIEVEMENT ORIENTED EXPERIMENTAL CONDITIONS TO PICTURES CONTAINING 
FEMALE AND MALE CHARACTERS 
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Experimental conditions 
Total for all 
Neutral Achievement oriented experimental — 
conditions and — 
Group N pictures P 
" Three female Three male Three female Three male ] 
pictures pictures pictures pictures 
M SD M SD M SD M SD M SD 
Achievers 40 | 4.80 | 2.39 | 543 | 2.601 | 6.03 | 2.82 | 4.78 | 2.17 | 526 | 2.56 
Underachievers 40 2.93 2.40 4.18 2.82 2.25 2.22 6.20 247 3.89 2.918 
"Total for all 80 3.86 | 2.58 | 4.80 | 2.56 | 414 | 2.91 | 549 | 2.44 i 
subjects 
Design pictures. Thus, since there were one between-sub- 


Both groups of female subjects (achievers and 
underachievers) were exposed to both experimental 
conditions (Neutral and Achievement Oriented) and 
both types of pictures (those depicting males and 
those depicting females). For the variable of 
groups each achieving subject was matched for IQ 
with an underachieving control. For the variables 
of experimental conditions and pictures each subject 
was used as his own control, being exposed to both 
experimental sessions and to both male and female 


TABLE 2 


ANALYSIS OF VARIANCE OF n ACHIEVEMENT SCORES OF 
ACHIEVING AND UNDERACHIEVING GIRLS UNDER 
NEUTRAL AND ACHIEVEMENT ORIENTED Ex- 
PERIMENTAL CONDITIONS TO PICTURES 
CONTAINING FEMALE AND MALE 


CHARACTERS 
Source SS df | MS F 
Between subjects. 
Groups (G) 149.88 1 1149.88 | 16.60* 
Error (b) 704.72} 78 | 9.03 
Total between 854.60| 79 
Within subjects 
Pictures (P) 104.66 1 |104.66 | 15.06* 
Hxperimental condi- | 18.53| 1| 18.53| 2.53 
tions (E) 
PXE 4 3.40 1| 3.40) 145 
PXG 169.65 | 1 169.65 | 24.41* 
EXG -3$.00| 1| 3.00) 041 
PXEXG 104.66 | 1 [104.66 | 44.73* 
Errori (w) 542.44| 78 | 6.95 
Errore (w) 57122| 78 | 7.32 
Errors (w) 182.19| 78| 234 
Error (w) 1295.85| 234 | 5.54 
Total within 1699.75 | 240 
T otal 2554.35 | 319 
*p «.001. 


jects variable (groups) and two within-subjects vari- 
ables (experimental conditions and pictures); à 
2X2X2 Type VI analysis of variance design — 
described by Lindquist (1956) was employed. y 
A succession of studies (e.g, Birney, 1958; Haber 
& Alpert, 1958; Jacobs, 1958; Murstein, 1961) has © 
shown the crucial influence upon the motivational — 
content of stories of the cue strength of the thematic — 
apperception pictures. No fully adequate technique | 
for comparing and equating the relative cue strengths 
of thematic pictures has as yet been developed. In 
order to be able to unequivocally attribute any. 
Observed increases in achievement motivation scores 
to the experimental conditions rather than to the 
differential cue strength of the pictures used under 
Neutral and Achievement Oriented conditions, the 
sets of pictures were counterbalanced. Since the 
total experimental procedure was conducted first 
in 1958 and then repeated in 1960 with samples of 
equal size, the counterbalancing of sets of pictures 
was used; the data from the two administrations 
were then pooled in the Type VI analysis of variance 
design described. The effect of order of presenta- 
tion for different combinations of sets of pictures . 
could not be computed because all possible com- — 
binations of order of presentation were not rep- 1 
resented; that is, the combinations A-B, C-D and | 


ee ——— 


B-A, D-C were used in counterbalancing, but A-B, 
D-C and B-A, C-D did not appear. 


RESULTS 


Table 1 presents the mean achievement 
motivation scores for the eight subgroups. j 
Bartlett’s test for homogeneity of variance - 
indicated that analysis of variance was not - 
inappropriate for these data. V 

Table 2 presents the Type VI (Lindquist, 
1956) analysis of variance. This table in- - 
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dicates that the overall effect of the experi- 
mental conditions was nonsignificant; thus, 
the major expectation of this study, that ex- 
perimental conditions stressing intelligence 
and leadership ability would produce an in- 
crease in achievement motivation scores for 
this sample of girls, was not confirmed. Also, 
the experimental conditions were apparently 
no more effective in producing an increase 
"in the achievement motivation scores of the 
achievers than of the underachievers (i.e., 
- the interaction between experimental condi- 
„tions and groups was also nonsignificant). 
However, the second order interaction effect 
of Pictures X Experimental Conditions X 
Groups was highly significant (F = 44.73, 
* p< .001). Thus, the achievement motivation 
scores of achievers increase significantly in 
response to Achieverhent Oriented conditions 
when they produce stories to pictures of 
females but do not increase in response to 
pictures of males; by contrast, the achieve- 
ment motivation scores of underachievers in- 
crease significantly in response to Achievement 
Oriented conditions when they produce stories 
to pictures of males but do not increase in 
response to pictures of females. 
. The other significant results presented in 
Table 2 indicate that the achievers have higher 
n Achievement scores than underachievers for 
all experimental conditions and types of 
pictures combined (F = 16.60, p < 001), the 
pictures depicting males produce higher n 
Achievement scores than pictures depicting 
females for all groups and experimental con- 
ditions combined (F = 15.06, p< .001); 
however, this difference is attributable to the 
scores of underachievers and not of achievers, 
since for both experimental conditions com- 
bined, achievers produce higher n Achievement 
scores to female pictures than to male pic- 
tures while underachievers produce higher n 
Achievement scores to male pictures than to 
female pictures (F = 24.41, p< .001). 
Numerous studies (e.g., Ricciuti & Sadacca, 
1955; Veroff et al., 1960) have demonstrated 
the importance of controlling for individual 
differences in verbal fluency in computing 
scores derived from thematic content. How- 
ever, the correlation coefficient obtained in 
the present study between n Achievement 
scores and number of words in the story 


protocols ranged from —.01 to —.10. These 
negligible coefficients may have resulted from 
the fact that word fluency showed little vari- 
ability; all of the subjects in this experiment 
showed great verbal fluency and very few 
brief protocols were obtained. This finding is 
consistent with earlier reports (e.g., Atkinson, 
1950) in which negligible correlation coeffi- 
cients were found between n Achievement 
scores and length of protocol when the sam- 
ple was homogeneous and structuring ques- 
tions were used in the test administration. 
Since the correlation coefficients in the present 
study were so small (accounting for no more 
than 1% of the shared variance between n 
Achievement scores and length of protocol), 
no attempt was made to correct for individual 
differences in verbal fluency. 


DISCUSSION 


Contrary to expectation, the experimental 
introduction of Achievement Orientation con- 
ditions did not produce an overall increase in 
achievement motivation scores for this sample 
of girls. However, a highly significant second 
order interaction efféct emerged: achieving 
girls did display the expected increase in 
achievement motivation scores under achieve- 
ment orienting conditions, but only when re- 
sponding to stimuli depicting females; under- 
achieving girls also displayed the expected in- 
crease in achievement motivation scores under 
achievement orienting conditions, but only 
when responding to stimuli depicting males. 

A suggested interpretation is that the 
achieving girls perceive intellectual achieve- 
ment goals as a relevant part of their own 
female role; in contrast, the underachieving 
girls perceive intellectual achievement goals 
as more relevant to the male role than to their 
own female role. Since the achievers and 
underachievers have equivalent intellectual 
capabilities, the discrepancy between them in 
their perception of the strivings and behaviors 
relevant to female and male roles may also 
represent a reasonable explanation for their 
differential academic performance. These inter- 
pretations are consistent with growing evi- 
dence (e.g, Moss & Kagan, 1961) that the 
production of thematic material is, strongly 
influenced by the subject's conception of 
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what behaviors are appropriate to the hero’s 
social roles. 

The application of such a social role model 
to the finding that achieving girls respond 
to experimental arousal conditions with in- 
creased achievement imagery when reacting 
to female figures but not when reacting to 
male figures is relatively direct. The achieve- 
Ment strivings of these girls are aroused by 
the experimental instructions stressing intel- 
ligence and leadership, and this increase in 
achievement motivation is expressed im- 
aginally in response to the female figures since 
these female characters and their activities 
are most directly related to the girls’ own 
strivings. 

The concept of achievement motivation as 
a generalized pattern of strivings must be 
added to the social role interpretation when 
it is applied to the finding that the under- 
achieving girls respond to experimental arousal 
conditions with increased achievement imagery 
only to male figures. If achievement motiva- 
tion in this sample of underachieving girls is 
composed of generalized strivings to excel, 
then these generalized achievement strivings, 
when aroused, will be expressed imaginally 
by assigning increased achievement directed 
activity to the male figures for whom achieve- 
ment behaviors are considered more ap- 
propriate. Thus, this social role interpreta- 
tion suggests that when the achievement 
imagery of underachieving subjects is engaged 
by experimental instructions Stressing intel- 
ligence and leadership, the increased general- 
ized achievement imagery is attached to the 
figures whom the underachievers perceive to 
be the usual and proper agents for action 
directed toward achievement. 

Margaret Mead (1949) has observed that 
the adolescent girl in our Society begins to 
realize that her attempts to achieve place her 
in competition with men and elicit negative 
reactions from them; our society, thus, defines 
out of the female role ideas and strivings 
for intellectual achievement. The results of 
the present research suggest that these social 
conditions have different impacts upon dif- 
ferent girls. It appears that the girl who 
retains a perception of the female role as 
including,,intellectual achievement goals suc- 
ceeds intellectually under conditions of strong 


G. S. Lesser, R. N. Knawirz, AND R. PACKARD 


academic competition with other girls; by. 
comparison, the girl who accepts the social 
prescription that intellectual achievement 
strivings are relevant to the male role and | 
not the female role does not succeed as well 
in intellectual competition with other girls, 
Other results of the present study are con 
sistent with a social role interpretation and 
may help to explain previously reported find- ; 
ings on samples of women. Veroff et al. (1953) _ 
report that female high school students pro- | 
duce greater n Achievement scores to pictures - 
of men than to pictures of women, vit 
Angelini (1955) reports higher scores for 
Brazilian college women responding to pic- 
tures of females than for Brazilian college ` 
men responding to pictures of men. In the 
present study, the pictures of men elicited 
more achievement imagery than pictures of | 
women only for the underachievers and not | 
for the achievers. Table 1 shows that for the 
underachievers, the mean n Achievement 
score to pictures of men significantly exceeds ' 
the mean score to pictures of women under | 
both Neutral and Achievement Orientation 
conditions. However, for the achievers, the 
mean n Achievement score to pictures of men 
was significantly lower than the mean score ^ 
to pictures of women under Achievement | 
Orientation conditions, and under Neutral ] 
conditions the mean scores to pictures of men — 
and women were not significantly different. 
An alternative theoretical model, based 
upon the concepts of approach-avoidance 
conflict and conflict produced displacement, — 
has previously proved fruitful in research on , 
achievement needs (eg. Atkinson, 1954, - 
1957), aggression (e.g., Lesser, 1958), and 
sexual needs (eg, Clark, 1952; Clark & 
Sensibar, 1956). This conceptualization, intro- 
duced by Lewin (1931) and elaborated by 
Miller (1948, 1951), has become a conven- 
tional model for discussions of needs which 
are ordinarily accompanied by some degree of 
attendant anxiety. While this model may 
*In the absence of additional empirical evidence — 
on the operation of achievement motivation in ; 
women, the social role explanation is not more fully 
developed here. However, this theoretical approach 
is being expanded in a study of the characteristics 
of achievement motivation in college women con- - 


ducted currently by Elizabeth G. French and the 
senior author of this paper. " 
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ultimately prove to be of great value in 
conceptualizing the results of studies on the 
characteristics of achievement motivation in 
women, it is not clear at this time that it 
would unequivocally predict the present 
results. 

However, no matter what theoretical model 
is applied, it is apparent that attempts to 
clarify the impact of experimental achieve- 
ment arousal conditions upon female subjects 
must consider the interaction among the 
nature of the experimental conditions, the 
characteristics of the stimuli used to elicit 
thematic material, and the relative accomplish- 
ment of the subjects in achievement per- 
formance. 

One further comparison should be noted 
between the present results and the findings 
of earlier research. While a recent study by 
Pierce and Bowman (1960) on a sample of 
gifted high school girls reports the absence 
of a significant relationship between achieve- 
ment motivation scores and academic per- 
formance (when only pictures of men were 
used in assessing achievement motivation of 
the female subjects), studies relating academic 
performance to achievement motivation in 
samples of men have reported either strong 
relationships (e.g., McClelland et al., 1953; 
Morgan, 1952; Pierce & Bowman, 1960; 
Rosen, 1955) or at least moderate relation- 
ships (e.g., Morgan, 1953; Ricciuti & Sadacca, 
1955). The present study obtained a highly 
significant difference between achieving and 
underachieving girls in their achievement mo- 
tivation scores, even with IQ controlled 
through the matching of subjects. The sig- 
nificant interaction of Pictures X Groups re- 
ported in Table 2 indicates that the difference 
in achieyement motivation scores between 


.achievers and underachievers occurred in re- 


sponse to pictures of females but did not 
occur in response to pictures of males. Thus, 
the discrepancy between the positive findings 
of this study and the negative results of 
Pierce and Bowman in relating achievement 
motivation to academic performance in girls 
may be explained by the absence in the latter 
study of stimuli which allow the relationship 
to be displayed. Despite the present evidence, 
Atkinson’s (1958, p. 605) caution is still 
appropriate: that no simple explanation of the 


complex behaviors referred to by the variable 
of academic achievement can be expected 
through the consideration of the strength of 
any single motive alone. 
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This study investigated the effect of various instructional sets (speed, relaxed 
time instructions, and “set popular" instructions) on commonality of word 
associations, Findings were generally supportive of the hypotheses: significantly 
more common associations were given under speed as compared to relaxed 
instructions, Ss were able to significantly increase their commonality scores 
under a set to give popular responses, and need for social approval was 
differentially related to commonality under relaxed but not under speed in- 
structions, Contrary to a suggestion of Jenkins, social sensitivity (as reflected 
in the need for approval measure) was not related to the ability to increase 
one’s commonality score under “set popular" instructions. 


Psychologists have shown considerable in- 
terest in the behavioral correlates of responses 
given to the Word Association Test since the 
free association technique was first introduced 
by Galton (1879). Early work with the free 
association method emphasized the utility of 
the technique in the clinic and in the field of 
criminology (Jung & Riklin, 1904; Marston, 
1920). More recently, interest has spread to 
a consideration of possible relationships be- 
tween word association responses and such 
personality dimensions as masculinity-femi- 
ninity (Terman & Miles, 1936) and social 
desirability (Sarason, 1959). The majority of 
Such studies have attempted to relate psy- 
Chological variables to one particular aspect 
of free association responses, namely, com- 
monality of word association; ie., the tend- 
ency of an individual to give common or 
popular responses. These efforts, however, 
have yielded meager results and, as Jenkins 
(1960) has noted, *Commonality has stead- 
fastly refused to be linearly correlated with 
various scales of introversion, personality, 
achievement, and intelligence." 

Recently Jenkins (1960) has offered a 
multidimensional interpretation of the com- 
Monality construct which identifies the de- 
8ree of commonality manifested by a person 
as representing the extent to which the 
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individual is like the “standard” of the culture 
on a variety of behavioral dimensions. Jenkins 
has reported a series of investigations of this 
notion, including attempts to relate com- 
monality to measures of verbal style, verbal 
learning, and social-personality variables. He 
summarizes his findings by emphasizing that 
commonality reflects strong intraverbal habits. 
Jenkins further states that commonality “ap- 
plies to such behavior only in the absence of 
other sets of stimuli which furnish cues about 
the desired or correct response,” and suggests 
that “the more completely the situation de- 
pends on intraverbal relations alone, the more 
markedly will commonality manifest itself. 
The more other determinants are introduced 
(in instructions, in the use of nonverbal 
stimuli, in constraints on the response prob- 
abilities, etc.) the less will be the influence 
of commonality.” In a recent study, Jenkins 
(1959) found that subjects gave significantly 
more common responses when the Word As- 
sociation Test was administered under in- 
structions to give “popular” responses than 
when it was administered under the usual 
“speed” instructions. The data also showed 
that most of the increase in commonality 
scores is accounted for by those individuals 
who had low scores under the usuaf instruc- 
tions. Jenkins further reported that high 
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“gain scores” appeared to be related to “social 
sensitivity,” according to reports of instructors 
who were acquainted with the subjects in- 
volved. 

Considering the effect of the test taking 
situation, Siipola, Walker, and Kolb (1955) 
have argued that more fruitful information 
would be obtained when subjects are given 
more time to respond (i.e., relaxed instruc- 
tions) than is usual with the speeded Word 
Association Test. They believe that this re- 
laxed condition approximates the degree of 
freedom provided in other projective tests. In 
support of this hypothesis, Dunn, Bliss, and 
Siipola (1958) have related certain aspects 
of word association behavior in the relaxed 
situation to aspects of personality, especially 
the Allport-Vernon-Lindzey measures, whereas 
Jenkins (1960) using commonality (which 
overlaps the Siipola measure) in a speeded 
situation failed to find any such relationships. 

The studies cited above suggest that the 
situation in which the Word Association Test 
is given may contribute significantly to the 
behavior observed and its psychological sig- 
nificance. The study to be reported here con- 
stitutes an attempt to (a) determine the 
effect of various instructional sets on com- 
monality scores and (5) to assess the relation- 
ship between the tendency to give common 
responses and a personality variable, the need 
for social approval. 

With respect to instructional sets, it was 
hypothesized that subjects would obtain sig- 
nificantly higher commonality scores under 
speed instructions than under relaxed in- 
structions. This hypothesis was prompted by 
the reasoning that under speed instructions 
raw intraverbal habit strengths are primarily 
operative, whereas under relaxed instructions 
idiosyncratic response tendencies are more 
likely to be manifest due to the opportunity 
provided for personal stimulus variables to 

affect verbal habits. It was further expected 
(cf. Jenkins, 1959) that subjects would be 
able to increase their commonality scores 
significantly under “set popular” instructions 
as compared to their performance under 
relaxed instructions. 

The need for social approval variable was 
selected for this investigation because it 
seemed particularly relevant to Jenkins’ 
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(1959) suggestion that commonality s 
may be related to “social sensitivity,” 
to the extent to which the individual is 
the “standard” of the culture. The Marl 
Crowne (1960) Social Desirability scale 
used as the measure of the need for 
approval. Recently completed 
(Crowne & Marlowe, 1960; 
land, 1961; Marlowe, 1962; 
Crowne, in press) indicates that indivi 
with a high need for social approval are 
sensitive and responsive to situational 
cultural demands than individuals wi 
weaker need for social approval, and that h 
need for approval individuals are defer 
in test taking situations and prone to 
guarded (common) responses. 

Since verbal habits are largely cultu 
determined, and since the free associ 
technique was employed in a test format 
was hypothesized that high need for app! 
subjects would obtain significantly 
commonality scores under relaxed instr 
and under popular instructions as com] 
to low need for approval subjects. N 
ference between high and low need fo! 
proval subjects on commonality was exp 
under speed instructions where pel 
stimulus variables are less likely to” 
operative. 

METHOD 


A total of 295 introductory psychology stude 
at the University of Kentucky participated in 
experiment, All subjects took the Marlowe-Cro 
Social Desirability (M-C SD) scale and the 
Rosanoff Word Association Test during the 
week of the Spring 1961 semester. The associ 
test was readministered to these subjects appr 
mately 4 weeks later. Both testing sessions t 
place during regular class time. “3 

The subjects were divided into two groups. 
the testing. Group 1 which consisted of 153 sub 
(87 females, 66 males) took the Word Assod 
Test first under relaxed and then under pO 
set instructions. Group 2, which consisted Of 
subjects (90 females, 52 males) took the associal 
test under speed and later under popular 
structions. A total of 259 subjects, 13 
Group 1 and 125 from Group 2, were presel 
both test sessions. The specific instructions US 
each of these conditions were as follows. 


Relaxed Set. When you open these sheets 
will see a list of 100 stimulus words. Afte 
word write the word that it makes you 
Start with the first word; look at it; 
word it makes you think of; then go o! 
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next word. Use only a single word for each re- 
sponse. Work at your own speed, but do not. skip 
any words. Are there any questions? Ready, go. 

Speed Set. When you open these sheets you will 
see a list of 100 stimulus words. After each word 
write the first word that it makes you think of; 
then go on to the next word. Use only a single 
word for each response. Do not skip any words. 
Work rapidly until you have finished all 100 
words. When you are through, turn your paper 
over and write on the back the letter that appears 
on the board at that time. Are there any ques- 
tions? Ready, go. 


After 4 minutes the letter A was printed on the 
blackboard. Every 30 seconds thereafter a new letter 
was written down in alphabetical sequence. 


Popular Set. Y want you to take the same word 
association test you took a few weeks ago. This 
time you are to give popular responses. That is, 
give the responses that you think would be given 
most often by young adults like yourself. Use 
only a single word for each response. Do not 
skip any words. Remember, next to each stimulus 
word you are to give the response that you think 
would be given most often by young adults like 
yourself. Work at your own speed, but don't 
spend too much time on any one word. Ready, 
go. 

Commonality scores were obtained for each sub- 
ject on both the first and second Word Association 
"Tests. In addition, difference or gain scores between 
tests were computed for each subject present during 
both experimental sessions. Commonality scores were 
determined in the following manner. A separate 
sample of 332 introductory psychology students at 
the University of Kentucky were administered the 
Kent-Rosanoff Word Association Test under speed 
instructions during the Fall semester of 1960. 
Frequency counts were then computed for each 
response to the 100 stimulus words. A full scale 
commonality score was obtained for each experi- 
mental subject by assigning the normative frequency 
of Occurrence to each response given to the 100 
Stimulus words, A subject's full scale commonality 
Score was the mean response frequency for 100 
Words. A second commonality score was obtained 
by assigning each subject one point for each re- 
Sponse which was given by 50% or more of the 
normative group. Thirty-three stimulus words 
elicited responses which were given by at least 50% 
of the normative group. These 33 primary responses, 
all of which are also primaries on the Minnesota 
Word association norms (Russell & Jenkins, 1954) 
are presented in Table 1. 


RESULTS 
Prior to analysis of the experimental vari- 
ables, the 33-item commonality scale was 
evaluated with respect to its relationship to 


* Normative data are available from the authors 
“pon request. 


TABLE 1 


Tmumry-THREE PRIMARY RESPONSES WITH STIMULUS 
WORDS AND NORMATIVE PERCENTAGES 


Kent-Rosanoff Stimulus and UK % of 
Number response word Occurrence 

1 TABLE-Chair T8 

2 DARK-Light 13 

5 MAN-Woman 84 
11 BLACK-White 78 
15 SHORT-Tall 53 
21 SWEET-Sour 53 
23 WOMAN-Man 70 
24 corp-Hot 53 
25 SLOW-Fast 80 
28 waitE-Black 71 
35 NEEDLE-Thread 59 
39 CARPET-Rug 50 
40 GIRL-Boy 83 
41 HIGH-Low 71 
43 SOUR-Sweet 65 
48 HARD-SOFT 71 
49 EAGLE-Bird 70 
52 LAMP-Light 64 
57 Boy-Girl 88 
58 LIGHT-Dark 57 
64 CoTTAGE-House 52 
65 SWIFT-FAST 54 
72 LonG-Short 85 
76 BITTER-Sweet 69 
T HAMMER-Nail 62 
81 BUTTER-Bread 59 
83 Loup-Soft 60 
87 BED-Sleep 51 
88 HEAVy-Light 60 
89 TOBACCO-Smoke 50 
92 Scissons-Cut 53 
97 KING-Queen 84 
99 BLOSSOM-Flower 71 

full scale commonality scores. Product- 


moment correlations were computed between 
full scale commonality scores and the number 
of primary responses given under various 
instructional sets. The results of this analysis 
are presented in Table 2 and indicate that 
commonality scores based on the 33 primary 
responses very adequately reflect the same 
word association patterns as do full scale 
commonality scores. Accordingly, the remain- 
ing analyses were based on commonality 
scores derived from the 33 primaries. 

The major findings with respect to the 
influence of different instructional sets on 
commonality of word associations are pre- 
sented in Table 3. 

The data in Table 3 are based on the 
combined results for males and females since 
the means and variances of commontility for 
males were not significantly different from 
those for females in any of the conditions. 
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TABLE 2 


CORRELATIONS BETWEEN FULL SCALE COMMONALITY 
SCORES AND THE NUMBER OF PRIMARY RESPONSES 
"UNDER VARIOUS INSTRUCTIONAL SETS 


Group 1 Group 2 
Relaxed | Popular Speed Popular 
N 153 134 142 125 
r 97 .95 .96 .98 


The results reported in Table 3 are in clear 
support of the hypothesis that mean com- 
monality scores will vary as a function of 
instructional sets. As predicted, mean com- 
monality scores under speed instructions are 
significantly (p< .01) higher than mean 
commonality scores under relaxed instruc- 
tions. In support of the Jenkins’ (1959) 
finding cited above, Group 1 and Group 2 
subjects were able to increase their com- 
monality scores significantly under set popu- 
lar instructions. As a further indication 
that varying instructional sets exert a dif- 
ferential effect on commonality scores, cor- 
relations were computed between the two 
test administrations for Group 1 (.71) and 
Group 2 (.64). These r's indicate that ap- 
proximately 5096 of the variation in popular 
set commonality scores is not accounted 
for by knowledge of commonality scores 
obtained under either relaxed or speed 
instructions. 

The findings concerning the influence of 
need for social approval on commonality 
Scores are presented in Table 4. The mean 
need for approval score (N — 295) was 


TABLE 3 


SIGNIFICANCE TESTS FOR DIFFERENCES IN COM- 
MONALITY SCORES UNDER VARIOUS 
INSTRUCTIONAL SETS 


Condition N x SEn 1 
Relaxed 153 14.58 .88 3.14* 
Speed 142 17.34 
Relaxed | 134 | 1447 53 | 655" 
Popular 134 17.97 

“Speed 125" | 17.42 60 3.66* 
Popular 125 19.62 

D 
a Includes only those subjects taking both tests. 
*p <.01. à 


wk p < .001. 
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15.84 with a standard deviation of 5.38. 
High need for approval individuals were 
defined as those subjects who received a 
score of 20 or higher on the M-C SD scale. 
Low need approval individuals were those 
who scored 12 or below. These cutting points 
very closely approximate the upper and lower 
25% of the sample. The data reported in 
Table 4 provide partial confirmation of the 
hypothesis linking need for social approval 
to commonality of word association re- 
sponses. As predicted, high need for ap- 
proval subjects (Group 1) obtained signifi- 
cantly higher commonality scores than low 
need for approval subjects in the relaxed 
condition (¢ = 1.64). Under the popular set 
condition this difference is in the expected 


TABLE 4 


SIGNIFICANCE TESTS BETWEEN Hicu (H) NEED 
APPROVAL AND Low (L) NEED APPROVAL 
SUBJECTS ON COMMONALITY 


Condition Nu| Ni Xm Xu n 

Group 1 

H vs. L Relaxed |35 |41 | 16.49 | 13.80 | 1.64** 

H vs. L Popular |31 |33 | 20.61 | 17.79 | 1.38* 

H vs. L Gain 31|33| 3.94| 3.64| .18 
Group 2 | 

H vs. L Speed 36 |38 | 17.53 | 17.76 | .14 

H vs. L Popular |30|34|21.70 | 20.32 | .69 

H vs. L Gain 30|34| 3.68| 2.85 | 4 


* p = .08, one-tailed test. 

++p = 05, one-tailed test. 
direction but does not reach a significant 
level (£— 1.38). Contrary to prediction, 
gain scores (change in commonality scores 
from relaxed to set popular instructions) 
were not related to social approval scores 
(t= 18). 

As expected, speeded commonality and 
gain scores for Group 2 subjects were un 
related to need for approval scores. The 
failure of these subjects to show a significant 
effect under popular instructions was un 
expected, although this may be due to à 
carry-over of responses from the previous 
speeded situation. In any case, the effect 15 
in the expected direction and is not grossly 
different from that found in Group 1. 

Two additional findings give general sup- 
port to the above conclusions. In the relaxed- 
popular set group, those subjects who ob- 
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tained commonality scores in the upper 
quartile on both the first and second admin- 
istration of the association test scored sig- 
nificantly higher on need for approval 
(f= 2.50, p=.02) than subjects who re- 
ceived commonality scores in the lowest 
quartile on both association tests. When 
commonality scores for high need approval 
and low need approval subjects were com- 
pared for the relaxed and speed conditions, 
it was found that high need approval sub- 
jects obtained scores of similar magnitude 
under both conditions, while low need ap- 
proval subjects under speed instructions 
obtained significantly higher commonality 
scores than low need approval subjects under 
relaxed instructions (¢= 3.96, p< 01). 
This finding indicates that high need ap- 
proval subjects contribute little to the over- 
all significant difference found between com- 
monality scores under speed and relaxed 
instructions. 


Discussion 


It is clear that individuals give significantly 
fewer common responses when the association 
test is administered under relaxed as com- 
pared to speed instructions. This finding is 
entirely consistent with the suggestions of 
Jenkins (1960) and Siipola et al. (1955) 
that a reduction in the number of common 
associations can be expected under testing 
instructions that provide greater opportuni- 
ties for personal stimulus variables to oper- 
ate. Under pressure to respond quickly, 
overlearned, stereotyped language habits are 
likely to overshadow any influence that per- 
sonality variables might exert. This finding 
is of considerable significance for the use of 
the word association technique as it is cus- 
tomarily employed in diagnostic settings. As 
typically used, the test is administered under 
a set to “respond as quickly as possible." 
Since idiosyncratic responses are those with 
the greatest diagnostic importance, the usual 
speed sets would seemingly reduce the effec- 
tiveness of the word association technique in 
eliciting diagnostically meaningful material. 
That speed instructions do not produce 
only those responses of greatest relative 
frequency is, of course, indicated by the 
Significant increase in commonality scores 


obtained by the same subjects when the 
association test is administered a second time 
under a set to give popular responses. 

Although no specific attempt was made to 
test the “microgenetic” hypothesis advanced 
by Flavell and Draguns (1957; Flavell, 
Draguns, Feinberg, & Budin, 1958), the 
lower commonality scores obtained under 
relaxed as compared to speed instructions 
appear to contradict their notions. Flavell 
et al. suggest that in word association tasks, 
responses produced under time pressure 
should be more primitive and immature than 
those occurring under relaxed conditions. If 
this were so, we would have expected (con- 
trary to our predictions and findings) the 
relaxed instructions group to receive higher 
commonality scores than the speed instruc- 
tions group, unless there is some drastic shift 
in the norms under the two conditions, which 
is unlikely. A rough examination of the re- 
sponses given under relaxed instructions as 
compared to the normative data (obtained 
under speed instructions) revealed a flatten- 
ing of the response hierarchies with more 
idiosyncratic responses occurring, but no 
tendency for the most popular responses to 
change. 

The data provide partial support for a 
relationship between commonality of word 
association responses and the need for social 
approval. We would interpret the greater 
number of common responses given by the 
high need for approval subjects as reflecting: 
a combination of their test taking defensive- 
ness and greater sensitivity to the nuances 
of the culture. Previous findings with the 
M-C SD scale (cited above) have shown 
individuals high in need for approval to be 
more conforming or other directed, more 
responsive to situational demands, and more 
susceptible to the effects of social reinforcers 
as compared to low need for approval *indi- 
viduals. It seems reasonable to assume that 
language habits which are primarily the 
products of social learning would be more 
consistent with cultural stereotypes in indi- 
viduals with a high need for social approval. 
In a situation where speed is emphasized 
social-personality factors are of less impor- 
tance as indicated by the lack of eignificant 
findings under speed instructions. The failure 
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of Sarason (1959) to find any consistent 
significant relationships between commonality 
and the Edwards Social Desirability scale 
may be due largely to the fact that Sarason 
administered the Word Association Test 
under standard speed instructions. 

A further result that appears to have im- 
portant implications for research in this area 
is that while the need for approval variable 
was related to commonality scores under 
relaxed conditions, the reverse was not true 
with the entire range of commonality scores; 
ie. need for approval scores were not 
predictable from commonality scores. Meth- 
odologically, this effect suggests that com- 
monality may be such a complex or multi- 
dimensional trait that it' would be difficult 
to determine its relation with other psycho- 
logical characteristics if predictions were 
made only on the basis of commonality 
Scores. 

The extent to which need for social ap- 
proval is associated with a general cultural 
sensitivity is apparently not related to the 
amount of increase in commonality scores 
from relaxed or speed instructions to that 
of set popular instructions. This result is 
contrary to Jenkins' (1959) suggestion that 
gain scores might reflect a heightened “social 
sensitivity,” but it should be noted that the 
effect, if any, would be difficult to detect 
due to the initial higher commonality scores 
of the high need for approval individuals. A 
more sensitive test of Jenkins’ suggestion 
would be afforded by a comparison of high 
and low need for approval subjects who were 
low on initial commonality scores. This com- 
parison was not possible here due to the 
small number of subjects meeting the initial 
qualifications, 

As a final methodological note concerning 
future research on commonality using the 
lO0:word Kent-Rosanoff test, our findings 
indicate that the list of 33 primaries can 
be readily used in place of scores based on 
the full scale with virtually no loss in in- 
formation and a very considerable saving in 
scoring time. However, it would probably 
still be necessary to administer the entire 
100 words since some primary associations 
may be jnfluenced by previous associations 
to nonprimary words. 
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CRITIQUE AND NOTES 


TEST ANXIETY AND INTELLECTUAL PERFORMANCE * 


IRWIN G. SARASON 
University of Washington 


High school students were administered 2 anxiety scales, the Test Anxiety Scale 
and a Need for Achievement scale. Scores on these tests were related to scores 
on the School and College Ability Test (SCAT). Test anxiety was found to be 
negatively 'correlated with SCAT scores. The negative correlations obtained 
tended to be larger for female than for male Ss. The Need for Achievement 
scale showed only a slight tendency to correlate negatively with SCAT scores. 
The results were interpreted as being consistent with the conception of anxiety 
as an interfering nonintellectual influence on intellectual performance. 


It seems possible that anxiety scales which 
are specific to certain kinds of situations may 
prove to be more useful in those situations than 
general anxiety scales, ie. scales dealing with 
the subjects anxiety in a variety of situations. 
If this is true, then it might be expected that 
a measure of test anxiety would relate more 
significantly to performance in a testing situa- 
tion than would a more general measure. Evi- 
dence is available in support of this expectation 
(Sarason 1956, 1957, 1959). Of most relevance 
to the present report is a study (Sarason, 1959) 
which concerned the relation of anxiety to sev- 
eral indices of intelligence, The study, which 
involved college students, showed that the Test 
Anxiety Scale (TAS) was significantly and 
negatively related to intellectual performance, 
but that more general measures of anxiety were 
unrelated to performance. 

The present report represents an attempt to 
extend these findings. In the present study, TAS 
was related to the intellectual performance of 
high school students and measures of intel- 
ligence different from those hitherto used were 
employed. These measures were scores obtained 
from the School and College Ability Test 
(SCAT) (Educational Testing Service, 1957) 
Which is designed to provide a measure of the 
Student's ability to advance to higher levels of 
Schooling, e.g., the student's capacity to do col- 
lege level work. 

Numerous studies have been carried out in 
Which SCAT scores have been correlated with 
college academic performance (Educational 
Testing Service, 1958). Correlations of the 
order of +.50 have typically been reported 


_ between these variables. Klugh and Bierley 


(1959) have found validity coefficients for 


1 This research was supported by a grant from 
the Agnes H. Anderson Research Fund. 
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SCAT to be at least as high as those reported 
for the ACE, and about as high as those 
reported for high school grades. 

On the basis of previous evidence it was pre- 
dicted that TAS would correlate negatively with 
SCAT scores. Furthermore, because of the 
greater heterogeneity of a high school, as com- 
pared with a college population, it was pre- 
dicted that TAS’s negative relation to intel- 
lectual performance would be even greater in 
the present research than in previous ones. 

In addition to TAS one other measure of 
anxiety was employed. This was the Need for 
Achievement scale (NAS) of the Autobiograph- 
ical Survey (Sarason, 1958b). This scale was 
selected for study in view of the frequent find- 
ing of nonsignificant correlations between indices: 
of general anxiety and intelligence. The NAS. 
consists of items dealing primarily with anxiety 
over the pressure to achieve. It is a more gen- 
eral measure of anxiety than TAS in that its 
items are not limited to one particular class of 
situation. On the other hand, it does not seem 
to be as general a measure as, for example, 
Taylor's Manifest Anxiety scale. With these 
considerations in mind it was tentatively pre- 
dicted that NAS would be negatively related 
to SCAT but to a smaller extent than TAS. 


METHOD 


Subjects. The subjects were eleventh and twelfth ` 
grade high school students. There were 219 eleventh 
grade subjects, 122 of whom were male and 97 of 
whom were female. The twelfth grade subjects 
numbered 241, of whom 110 were male and 131 
were female. The means and standard deviations of 
the TAS and NAS scores are presented in Table 1. 

Procedure. Level 2 of the SCAT was administered 
to subjects in the fall of the school year and the 
two anxiety scales were administered during the 
winter of the school year. The subjects had no 
reason to associate the two testings. 
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TABLE 1 


CRITIQUE AND NOTES 


TABLE 3 


MEANS AND STANDARD DEVIATIONS or TAS AND  CoznELATIONS OF TAS AND NAS wiru SCAT FOR 


NAS Scores ELEVENTH GRADE FEMALES 
Grade 11 Grade 12 Test SCAT V SCAT Q SCATT 
et ond EUER |. | 4 
Me | SP Ms NAS E — 2 E 
TAS 
Male 122 | 6.52] 3.46 | 110 | 6.75) 3.74 HERE - 97. ERI 
wigmale| 97 | 803| 461 | 131 | 7.52] 385 — .Renicantat 05 level 
Male | 122 | 13.56 | 4.62 | 110 |12.8| 444 — twelfth grade females, was there a statistically 
Bemale | 97.1151 | $98 | 191, | 11.27) 444 significant difference between 7's. 
As Tables 4 and 5 reveal, there was only one 
RESULTS significant correlation between NAS and SCAT 


For both grades combined, the correlations 
between TAS and NAS were .35 and .32 for 
boys and girls, respectively. Table 2 contains 
the correlations of TAS and NAS with SCAT 
Verbal (V), SCAT Quantitative (Q), and SCAT 
Total (T) converted scores, for male eleventh 
graders. Table 3 contains the comparable cor- 
relations for female eleventh graders. The re- 
sults clearly demonstrate a significant negative 
correlation between TAS and the SCAT. The 
negative correlations appear to be stronger for 
females than for males and this sex difference 
was statistically significant in the case of SCAT 
total scores, 

The results for NAS were less consistent than 
for TAS. Among male subjects there was no 
evidence supporting the hypothesis of a negative 
relationship between NAS and SCAT. However, 
there was evidence in support of this hypothesis 
for the female subjects, 

Table 4 presents the correlations for male 
twelfth grade subjects and Table 5 correlations 
for female twelfth grade subjects. As was true 
for the eleventh grade subjects the TAS-SCAT 
correlations were greater for females than for 
males. In comparing the eleventh with the 
twelfth grade correlations it will be noted that 
in every case for each sex the TAS-SCAT r’s 
for Grade 11 were equal to or greater than the 
ones for Grade 12. However, only in the case of 
the TAS correlations with SCAT, in eleventh and 


TABLE 2 


CORRELATIONS OF TAS AND NAS wits SCAT ror 
ELEVENTH GRADE MALES 


for twelfth grade subjects, that for NAS-SCAT 
T for males (r = — .19). 


Discussion 


The present results provide the strongest 
support thus far reported for the view that test 
anxiety scores and intellectual performance are, 
negatively related. The hypothesis that the 
nature of the anxiety-intelligence relationship is 
a function of the anxiety scale employed was, 
given support. While, as expected, the NAS did 
show a greater tendency to relate to intellectual 
measures than have more general indices of 
anxiety, the degree of relationship was by no 
means strong. 

The present results also suggest that the nega: 
tive relationship between test anxiety and per- 
formance is stronger for females than for males. 
While the data cannot explain this sex dif 
ference, the indication that girls obtain higher 
anxiety scores than do boys suggests the pos- 
sibility that boys may find it more difficult or 
more unacceptable to admit to anxiety than do 
girls. This type of difference could well influ- 
ence the relationship between anxiety and 
intellectual performance for the two sexes. 

The results of the present study, together 
with previous findings (Sarason, 1960), clearly 
pose several important questions, The first and 
most general one is: why are test anxiety scores 
negatively related to measures of intelligence? 
Two possible answers come to mind. The first 


TABLE 4 


CORRELATIONS OF TAS AnD NAS wirg SCAT FOR 
TWELFTH GRADE MALES 


Test SCAT V SCAT Q SCATT Test SCAT V SCAT Q SCAT T 
TAS —.20* —.27** —.27** TAS —.14 —.20* — 27 
NAS 07 —.06 —.01 NAS —.18 —.09 zs 


i] 

Note.—N = 122. 
* Significant at .05 level, 
»** Significant at .01 level. 


Note.—N = 110. 
* Significant at .05 level. 
** Significant at .01 level. 
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TABLE 5 


Corretations OF TAS AND NAS wits SCAT For 
TWELFTH GRADE FEMALES 


Test SCAT V SCAT Q SCAT T 

TAS —33** —.30** —.36** 

NAS uA —.11 —45 
Note.—N = 131. 


* Significant at .05 level. 

*** Significant at .01 level. 
is that high TAS subjects are realistically more 
anxious about taking tests because of an actu- 
ally lower intellectual level. The second is that 
subjects differing in TAS scores do not differ 
in intellectual level but do differ in the extent 
to which anxiety interferes with their perform- 
ance. At the present time there is no strong 
basis for favoring one of these two possibilities 
over the other. However, there is some avail- 
able evidence supporting the second possibility 
of test anxiety as a personality characteristic 
influencing intellectual performance. Two studies 
have investigated the correlation of a measure 
of test anxiety with scores of college students 
on aptitude tests and course grades (Sarason, 
1957; Sarason & Mandler, 1952). Both studies 
found test anxiety to be negatively correlated 
with aptitude test scores but not with course 
grades throughout the college years. On this 
basis one might infer that over the period of 
a college year test anxiety may dissipate but 
that there is no opportunity for this to occur 
during an aptitude testing session. 

If the above inference is correct, then, another 
important question would appear to be: can 
procedures be developed which will reduce the 
interfering effects of anxiety during testing? 
Evidence relevant to this question is meager. 
In one of the few experimental studies which 
have attempted to develop anxiety reducing 
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A NOTE ON “DOGMATISM AND LEARNING” 


procedures, it was found that reassurance did 
seem to have a facilitative effect on the verbal 
learning of high test anxious subjects (Sarason, 
1958a). 

If one regards the experiencing of anxiety as 
a result of learning, then, a third question 
arises: in what respects do subjects differing in 
anxiety differ also in their histories and prior 
learning experiences? 
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C. M. CHRISTENSEN 
University of Alberta 


In a study of dogmatism and learning with a sample of 166 students in an in- 
troductory psychology course, no confirmation was found of Ehrlich's findings 
that the Dogmatism Scale predicts classroom learning. There was positive sup- 
port, however, for previous findings for the independence of aptitude and dog- 


matism. 


Rokeach (1960) hypothesized “. . . that the 
ance he will put up to forming new belief 


v 
ured by the Dogmatism Scale, the more resist- 
ance he will put up to forming new belief 


76 CRITIQUE AND NOTES 


TABLE 1 


INTERCORRELATIONS BETWEEN DOGMATISM, ACE, AND 
PSYCHOLOGY TESTS FOR FEMALES AND MALES 


Psychology tests 

ACE 
Objective Essay 
Dogmatism —.01 —.11 —.08 
py .00 04 46 
ACE — 48 43 
— .38 A0 
Psychology-Objective = ve 


systems” (p. 181). Following Rokeach, Ehrlich 
(1961) obtained evidence supporting two related 
hypotheses: dogmatism is inversely related to 
classroom learning and academic aptitude and 
dogmatism are independent. 

Although not identical, the present study is 
essentially a replication of Ehrlich's (1961) 
study. The major difference being Ehrlich’s 
study concerned itself with learning of sociology 
and the present study concerned itself with the 
learning of introductory psychology. In addition 
the present study used two measures of class- 
room learning, multiple-choice and essay tests. 

The use of two measures of classroom learn- 
ing permits a check on Rokeach’s (1960) finding 
that the abilities to synthesize and analyze 
relate differentially to dogmatism. It was neces- 
sary, of course, to assume that the essay test 
was more of a measure of ability to synthesize 
than the multiple-choice test. 

Three hypotheses were tested separately for 
male and female students in the present study: 
dogmatism is inversely related to classroom 
learning of psychology, dogmatism is more 
closely related to ability to synthesize as meas- 
ured by an essay test than to ability to analyze 
as measured by a multiple-choice test, and dog- 
matism and academic aptitude are independent. 


t METHOD 


Samples. A sample of 117 female and a sample 
of 49 male Faculty of Education students registered 
in an introductory psychology course were ran- 
domly selected. All students in the Faculty of Edu- 
cation are required to take this course and the 
majority do so during their freshman year. 


Procedure. An academic aptitude test, American 
Council on Education Psychological Examination 
(ACE), was administered shortly before the start 


of the term; Rokeach’s Dogmatism Scale was ad- 
ministered midway through the term; and the essay — 


and multiple-choice tests measuring learning of 


introductory psychology were administered at the _ 


end of the term. 


RESULTS AND DISCUSSION 


Intercorrelations of the four variables are’ 


given in Table 1. The first value in each cell 
of Table 1 is the coefficient for the sample of 
female students. Dogmatism scores did not cor- 
relate significantly with ACE or the psychology 
tests. Partial correlations, ACE held constant, 
were not substantially different or significant. In 
addition the data did not depart from linearity 
and the low correlations cannot be attributed to 
a restricted range of scores. 

Only one of the hypotheses, independence of 
aptitude and dogmatism, was supported by the 
results. There was no evidence that dogmatism 
is related to classroom learning or differentially 
related to abilities to synthesize and analyze. 

The results of the present study contrast 
markedly with the findings of Ehrlich (1961). 
The differences between the present findings 
and those of Ehrlich cannot be attributed to à 
restricted range of talent in the present study 
since the variances on the four variables were 
large enough to permit significant correlations 
if relationships had existed. Certainly Ehrlich's 
findings are plausible in terms of Rokeach's 
(1960) theory of “closed mindedness.” 
Perhaps the different results obtained in the 
two studies can be attributed to the learning 
tasks, sociology versus psychology. If it is 
assumed that psychology is a bit more 
rigorous and objectively oriented than sociology, 
one would expect dogmatism to be more im- 
portant in the learning of sociology than psy- 
chology. At least the present study indicates 
caution in generalizing about dogmatism and 
classroom learning. 
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HYPNOSIS: 


Lom 


EXPECTATION, SUSCEPTIBILITY, AND PERFORMANCE? 


DAVID ROSENHAN 
Personality Research Group, Educational Testing Service 


AND PERRY LONDON °? 
University of Illinois 


78 male undergraduates performed on a hand dynamometer, a weight endur- 
ance task, and a tremor task, prior to knowing that the experiment dealt with 
hypnosis. They were subsequently asked to volunteer for a hypnosis experi- 
ment. Of the 68 volunteers 16 were classified as susceptible to hypnosis and 16 
relatively unsusceptible on the basis of their performance on the Group Hyp- 
notic Susceptibility Scale. During the next session, all Ss were administered the 
physical performance tests in the unhypnotized and hypnotized states. The re- 
sults for the dynamometer and endurance tasks suggested that, if anything, dif- 
ferences between susceptible and unsusceptible Ss were more marked before 
they knew that the experiment dealt with hypnosis, becoming less evident dur- 
ing the postknowledge conditions. For the tremor test, only slight differences 
emerged between the susceptible and unsusceptible groups on the preknowledge 
condition, but these grew increasingly larger during the postknowledge condi- 
tions, favoring the susceptible group. Indeed, hypnosis seemed to markedly im- 
prove the performance of susceptible Ss, while having little effect on unsus- 


ceptible ones. 


A peculiar finding which emerged from an 
experiment on the relative effects of hypnosis 
and motivation on physical performance (Lon- 
don & Fuhrer, 1961) was that people who are 
relatively unsusceptible to hypnosis are generally 
superior in performance to individuals who are 
very susceptible to hypnosis, including base rate 
measures of performance. Another study, more- 
over, in which people's attitudes towards co- 
Workers were examined, also found some base 
rate differences between tranceable (T) and un- 
tranceable (UT) subjects (Fiedler, London, & 
Nemo, 1961). Comparable findings are not seen 
in the research literature on hypnosis, possibly 
because previous experiments have concentrated 
on the study of treatment variables (ie., hyp- 
notized-unhypnotized conditions) without much 
consideration of subject variables (ie. suscep- 
tible-unsusceptible subjects). 

Another peculiarity of the physical perform- 


l' This study was supported in part by a grant 
to the first author from the Haverford College 
Faculty Research Fund, and from the National 
Science Foundation Research Participation Program 
(G-15695). 
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ance study was revealed in the analysis of the 
weight holding endurance test. While hypnosis 
generally had no more effect in facilitating per- 
formance on this task than did simple instruc- 
tions to hold the weight up as long as possible, 
there was a significant interaction between sus- 
ceptibility and hypnotic condition. Analysis of 
simple effects revealed that UTs held the weight 
longer than Ts in all conditions, but that the 
imposition of hypnosis improved the perform- 
ance of Ts and depressed the performance of 
UTs relative to their own unhypnotized per- 
formances. 

There was no a priori reason to expect any 
of the differences which were obtained, More- 
over, these differences could not be explained 
by gross differences in the height, weight, or age 
of the comparison groups. The authors propose 
that there may be subtle personality differences 
between T and UT groups which account for 
their differing performances. It is conceivable, 
however, that anticipations about hypnosis were 
a critical variable in the performances of T and 
UT subjects, Perhaps the Ts, sensing that they 
would be expected to improve their perform- 
ances when hypnotized, “unconsciously” de- 
pressed their performances during the unhypno- 
tized base rate condition. The UTs, on the other 
hand, sensing some threat in the experimental 
situation, may have been alerted to werk harder 
than they might otherwise have done. A sig- 


78 


nificant difference in motivation of the groups 
may have been thus inherent in the experiment. 
Such an argument is a variation on what Orne 
(1959) calls *the demand characteristics of the 
experimental situation," but while Orne's thesis 
refers to behavior during the hypnotic situation, 
the hypothesis offered here would extend to 
behavior alterations which result from the very 
idea that hypnosis will, sometime in the future, 
be used in the experiment. 

The present experiment served two purposes: 
it replicated the London-Fuhrer (1961) experi- 
ment on a male population, and it investigated 
the relative effects on performance of knowledge 
that the experiment involved hypnosis. For the 
latter purpose, it was hypothesized that base 
rate performances obtained prior to the subjects’ 
knowledge about the nature of the experiment 
would yield fewer differences between groups 
than would the same measures obtained after the 
introduction of hypnosis into the experiment. 


METHOD 
Subjects 


Seventy-nine male students, mainly freshmen and 
sophomores taking introductory psychology, were 
required to take several individually administered 
physical performance tests, ostensibly to complete a 
statistical assignment concerning the relationship 
between age, height, weight, and physical perform- 
ance. Actual scores were not revealed to the subjects. 

Ten days after all subjects had completed this 
session, the experimenter solicited volunteers for 
hypnosis research from the same class. Sixty-eight 
students volunteered and were administered the 
Group Hypnotic Susceptibility Scale (Rosenhan & 
London, 1961) in groups of three to five subjects. 
This scale is derived from Forms A and B of the 
Stanford Hypnotic Susceptibility Scale (Weitzen- 
hoffer & Hilgard, 1959), with an appropriate 
minimum of alterations which permit its use with 
small groups of subjects. 

Eligibility for subsequent phases of the experi- 
ment was determined by the same criteria used by 
London and Fuhrer (1961): subjects were cate- 
gorized as Tranceable (T), i.e, highly susceptible to 
hypnosis, if they passed at least 8 of the 12 items 
on the Group Hypnotic Susceptibility Scale, in- 
cluding at least 1 of the 4 most difficult items 
(Numbers 4, 8, 10, and 12). To qualify in the 
Untranceable (UT) category, ie. relatively unsus- 
ceptible to hypnosis, a subject had to fail at least 
8 of the 12 items, including all of the 4 most 
difficult items. 

Of the 68 volunteers, 33 failed to meet the 
criteria for inclusion in either susceptibility cate- 
gory, 1 was disqualified because of physical in- 
capacity, and 2 subjects failed to appear for all 
sessions of the experiment. Of those remaining, 16 
subjects were T and 16 were UT. 


CRITIQUE AND NOTES 


Apparatus 


Three performance tasks were used in this experi- 
ment, always presented to subjects in the following l 
order: | 

Hand dynamometer. A Stoelting Company pistol- 
grip dynamometer, calibrated in one kilogram units, 
was employed to measure grip strength, 

Weight endurance. A 2.5-pound flat circular 
weight of the kind used in bar bells, was taped. 
and equipped with a strap which would enable it 
to remain stationary on the back of the subject's 
hand while his arm was outstretched. When held 
straight out from the shoulder and parallel to the 
floor, the weight could be brought into contact 
with a board which clicked audibly, indicating to 
the subject that he was maintaining the weight 
at the appropriate height, and closing an electrical 
circuit which operated a Standard Electric Company 
chronometer. 

Tremor test. Stability of eye-hand coordination 
was measured by the subject's ability to hold a 
metal stylus in a hole without touching the sides 
of the hole. The stylus was attached electrically to 
a metal eyelet which was embedded in a board and 
placed on a laboratory stand at shoulder height 
and arm’s length from the subjects. The eyelet was 
762 inches in diameter. Whenever the stylus made 
contact with the eyelet, an electrical circuit was 
completed so that an attached counter cumulatively 
registered one digit at each contact. 


Procedure 


Session 1, Preknowledge condition. For the "sta 
tistical assignment," as well as the postknowledge 
experimental conditions, students were all tested 
individually. The subject was seated in a comfort- 
able armchair in much the same position one might 
normally be seated when hypnotized. The tasks 
were presented consecutively with one minute of 
rest between each. The dynamometer and endurance 
tests were always taken with eyes closed. For the 
dynamometer, the subject was asked to test his 
grip strength using his dominant hand. He was 
told not to begin squeezing the device until in- 
structed, and to continue pulling on it until told 
to stop. Five seconds were allowed for the task 
then a 30-second rest period, after which the task 
was repeated. For the weight endurance test, the 
subject was told to hold the weight up against # 
board for as long as he could, and was informi 
that the timing device would be operative only 
when the weight actually pushed the board up. The 
test was discontinued as soon as the subject droppe 
his hand, bent his elbow, or otherwise gained supi 
port. After resting from the weight test, the subject 
was told to remain seated, but to open his ey 
when the experimenter counted to three. The experi- 
menter then introduced the tremor test, specifically 
instructing the subject to hold his breath during 
the trials. This instruction corrected a methodo- 
logical error which invalidated the results obtained 
by London and Fuhrer (1961) with this measure. 
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Three 30-second trials of the tremor test were ad- 
ministered, with 30-second rests between each trial. 

Session 2, Screening. The hypnotic screening 
session took place about 2 weeks after Session 1. 
Subjects who did not qualify as either T or UT 
were discharged after the susceptibility scale was 
administered. The remaining subjects were immedi- 
ately rehypnotized and given some simple prepara- 
tory instructions for the subsequent experimental 
sessions. Most of the instructions which the experi- 
menter read for this induction were taken from 
Form B of the Stanford scale, and the same in- 
structions were employed for the later induction of 
hypnosis during the experiment. When he had com- 
pleted reading the induction instructions, the ex- 
perimenter went on to read material which said, in 
effect, that the individuals were very good subjects 
for the purposes of this experiment, that they would 
be hypnotized again in later sessions, and that hyp- 
nosis would, in the future, be induced by a simple 
signal rather than a long and involved procedure. 
Subjects were then awakened and, individually and 
privately, asked about their subjective reactions to 
this experience, 

Session 3. Postknowledge conditions. This session, 
which took place from 3 to 8 days after Session 2, 
consisted of two parts: The Postknowledge Base 
Rate condition, and the Hypnotic condition. For the 
Hypnotic condition, the subject was hypnotized by 
means of the signal he had been taught previously, 
and the three tasks were administered in the same 
way as in the Preknowledge condition. The subject 
was then aroused from hypnosis. 

Session 4, Final screening. Two or three days 
after the third session, Form B of the Stanford 
scale was administered individually to determine 
whether the subject’s categorization had shifted 
during the experiment. Subsequently, the subject 
was told that the experiment was ended, asked for 
his subjective reactions to its various phases, and 
discharged. 


RESULTS 


Scores were computed for each of the three 
tasks as follows: For the dynamometer, scores 
equaled the average kilograms of pull exerted 
over two trials. For weight endurance, a score 
was the total time in seconds that the subject 


TABLE 1 


MEAN ENDURANCE SCORES FOR ALL EXPERIMENTAL 
CONDITIONS 


i JA Unhypnotized | Hypnotized 
Subject group | Preknowledee| postknowledge postknowledge 


181.75) (152.22) 
Tranceable 175.94 151.37 152.69 
(160.00) 
Untranceable| 187.56 153.06 153.44 
(164.69) 


Note.—Dimension means appear in parentheses. 


TABLE 2 
MEAN Tremor SCORES FOR ALL EXPERIMENTAL 
CONDITIONS 
Unh: tized | Hypnotized 
—1 Done 
Tranceable | 46.06 3900 | 27.81 
(37.62) | 
Untranceable 46.87 40.62 42.56 
(43.35) 


Note—Dimension means appear in parentheses. 


held the weight aloft without supporting or 
bending his arm. For the tremor test, scores 
equaled the average number of touches recorded 
during three trials, Mean scores of the two 
groups on the weight endurance and tremor tests 
are presented in Tables 1 and 2. 

The data for each of the tasks were submitted 
to two separate analyses of variance, Type I 
(Lindquist, 1953, p. 267). These analyses con- 
sisted of one between-subjects dimension (T-UT), 
and one within-subjects dimension (Preknowl- 
edge-Postknowledge-Hypnotized). These analyses 
are presented for the weight endurance and 
tremor tasks in Tables 3 and 4. 

The T-UT dimension was not significant at 
the .05 level for any test. The within-subjects 
dimension was significant beyond the .001 level 
for both the weight endurance and tremor tasks, 
but not for the dynamometer task. Only the 
tremor data revealed significant interactions be- 
tween susceptibility and treatments (p < .01). 

Analysis of simple effects for the weight en- 
durance data indicates that both T and UT 
groups performed better on the Preknowledge 
condition than on either of the conditions which 
followed the introduction of hypnosis into the 
experiment. There were no significant differ- 
ences in endurance at the .05 level between the 
two Postknowledge conditions for either group. 
While differences between subject groups are 
not significant for any treatment condition, they 
approach significance in the Preknowledge con- 
dition only, where UT subjects held the weight 
an average of 11.62 seconds longer than did the 
T subjects. 

On the tremor test, both groups improved 
significantly from the Preknowledge condition 
to the subsequent conditions, but the T group 
improved still more when hypnotized, while the 
UT group did not change significantly from one 
Postknowledge condition to the other. Differ- 
ences between subject groups are not significant 
at the .05 level for any single treatment condi- 
tion, but they approach significance in the 
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TABLE 3 
ANALYSIS OF VARIANCE FOR ENDURANCE SCORES 


Source df MS F 
Between subjects (31) 
"Tranceable vs. Untrance- 1 | 52734| .93 
able (T) 
Error (b) 30 |5671.48 
Within subjects (64) 


2 |9044.15 | 12.41** 

2 | 290.53 
(60)| 728.71 
95 


"Treatments (A) 
AXT .398 


Error (w) 
Total 


** p < .001. 


Hypnotized condition, where UT subjects aver- 
aged 14.75 more touches (errors) per 30-second 
trial than did Ts, The significant interaction 
between susceptibility and treatments for this 
task may, therefore, be interpreted to mean that 
task repetition and hypnosis facilitates the per- 
formance of Ts more than of UTs. 

Analysis of order effects for all three tasks 
indicates that the prior presentation of one 
Postknowledge condition does not significantly 
influence performance on the other, though there 
is a significant triple interaction for the tremor 
task. ` 

A correlation of .86 between the Group 
Hypnotic Susceptibility Scale (GHSS) and the 
Stanford scale indicates that the GHSS is about 
as satisfactory an instrument for assessing sus- 
ceptibility to hypnosis as is the Stanford scale 
(since the retest reliabilities of the latter scale 
range from .78-.87). From the screening to the 
final session, mean scores for Ts rose .62 points, 
while mean scores for UTs rose 2.10 points, 


Discussion 


The results of this study do not permit any 
easy integration of the findings from the three 
performance measures, either with respect to 


TABLE 4 
ANALYSIS OF VARIANCE FOR TREMOR SCORES 


s Source df MS F 
Between subjects 3 (31 
Tranceable vs. Untrance- 1 | 787.76) .69 
able (T) 
Error (b) 30 | 1137.87 
Within subjects (64) 
Treatments (A) 2 | 1029.13 | 11.32** 
T 2 | 489.57| 5.38* 
Error (w) (60)| 90.94 
Total 95 
*p»«.01 3s 
#5 S001 


the different treatments used or to the subject 
groups. Hypnosis clearly affects performance on | 
the tremor task, but not on the other tasks, 
Moreover, the preknowledge condition on the 
endurance task shows the greatest, rather than 
the least, difference between groups, whereas on 
the other tasks, preknowledge performance is 
generally about the same for both groups, Under | 
the circumstances, it is necessary to discuss the 
individual tests and treatment effects singly, | 
and to make comparisons between the London- 
Fuhrer (1961) experiment and the present one 
on that basis. 

In the London-Fuhrer (1961) experiment, 
women were selected as subjects largely be 
cause it was felt that the masculine challenge 
implicit in the administration of strength and 
endurance tests would preclude obtaining dif 
ferences between men as a function of shifts of. 
instruction. Since shifts of instruction are the 
only operational changes involved in the dif. - 
ferent experimental conditions, it was not to be 
expected that differences between males would 3 
be obtained on these tests in the present experi — 
ment. Yet on both dynamometer and endurance 
tests, the mean performance of UTs is slightly — 
better than that of T subjects on all conditions, 
including the preknowledge ones. This result is 
plainly consistent with that obtained by Lon- 
don and Fuhrer on these tests. On the endur- 
ance test, moreover, the UTs do comparatively 
better on the preknowledge than on the other - 
conditions relative to Ts. If this trend is con- 
firmed by further experimentation, it would 
suggest that preknowledge conditions elicit still - 
greater differences between groups than do post- 
knowledge ones. Contrary to our original hy- 
pothesis that knowledge of the hypnotic nature 
of the experiment elicits artificial differences 
between Ts and UTs, the present results could 
more plausibly be interpreted as suggesting that 
such knowledge tends to mask differences, bring 
ing the means of the groups closer together than 
they would otherwise be. E 

With respect to the tremor test, it is quite 
clear that while both groups improve signif- 
cantly with practice, only the T group improves 
still further under hypnosis. Though differences 
between the groups are not quite significant abo 
the .05 level on the hypnotized condition, 
nevertheless it is evident that the imposition of 
this condition has a beneficial effect only on the 
T group. The tremor test seems to involve à 
different order of experience and performance 
than the other tasks. It is only one of the three 
tasks which is demonstrably facilitated by 
practice, despite fatigue, and perhaps evel 
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because of it. (The tremor test was always 
administered last of the three, and both post- 
knowledge conditions were administered at one 
sitting.) 

It is difficult to compare the results obtained 
on tremor in this experiment with those ob- 
tained by London and Fuhrer (1961) on women. 
Though the latter demonstrated a trend in favor 
of superior performances by UTs, the difference 
was not significant. More important, the authors 
specified that there had been a vital methodologi- 
cal flaw in their administration of the tremor 
test. The present experiment corrected the error 
in methodology, and the very clear results ob- 
tained here, together with the otherwise consist- 
ent results between the two experiments on 
dynamometer and endurance, suggest that differ- 
ences in the results on tremor may be most par- 
simoniously viewed as a function of methodology 
rather than sex. Conversely, lack of differences 
on dynamometer and endurance may best be 
viewed as representing a sex difference, as pre- 
dicted in the previous experiment. The differences 
between T and UT subjects, in any event, can- 
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not be properly explained as a function of ad- 
vance knowledge that the experiment involved 
hypnosis. 
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RECALL IN NORMALS AND SUBNORMALS OF LIKE MENTAL AGE 


N. O'CONNOR anp BEATE HERMELIN * 
Institute of Psychiatry, Maudsley Hospital, University of London 


Normal children and young adult imbeciles were matched for mental age and 
were given sets of 6 pairs of words to learn by association. The words were re- 
corded on tape, and presented to subgroups at a sound intensity of 90 db. or 
55 db. Each subgroup was further subdivided—t the Ss being given 10 and 4 
given 20 repetitions. 1 minute, 2 days, and 1 month later they heard the stimu- 
lus word of each pair and were asked to give the appropriate response word. 
It was shown that there was no difference in learning or memory scores be- 
tween normals and imbeciles. Frequency of presentation affected immediate re- 
call scores of both groups, with intensity level contributing to a lesser degree. 
Memory over longer time intervals was determined by the amount initially 
learned, rather than by the conditions under which such learning had occurred. 


No adequate studies of recall among the 
severely subnormal have been presented in the 
postwar literature, although studies among 
normals are frequent. Clinical opinion varies. 
Tredgold and Tredgold (1952, p. 300) describe 
cases of mnemonic savants who are found oc- 
casionally in mental deficiency hospitals. Evi- 

1Qur thanks are due to J. Gibson and B. W. 
Richards of St. Lawrence Hospital, Caterham, and 
L E. Owen of Dog Kennel Hill School, who kindly 
Provided the facilities for this research. 


dence showing a close parallel between fong- 
term memory and intelligence is rare, Sarason 
(1952, p. 69) points out that of the 20 items 
in the Binet test, in which according to Thomp- 
son and Magaret (1947) normals surpassed 
defectives, 8 involved memory. He concludes, 
however, that as fear and anxiety adversely 
affect performance on memory items, such items 
cannot be relied upon as a valid indication of 
intellectual capacity. It is therefore “uncertain 
whether memory follows a normal or an ab- 
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normal pattern in imbeciles. Tredgold and Tred- 
gold (pp. 93-94) quoting E. O. Lewis, say that 
defectives do not recall so well because they 
fail to record. The question remains however, 
whether they recall what they record. The ex- 
periment reported below attempts to answer this 
question for the conditions specified. 

House and Zeaman (1960) suggest that owing 
to attention difficulties many trials may pass 
before the learning curves of defectives show 
any rise, Alternatively, raising the intensity level 
of a stimulus may help to focus attention on it. 
Sheridan? stresses the importance of an ade- 
quately high level of intensity of sound stimuli 
for language development in young normal chil- 
dren. We therefore wished to explore the relative 
effects of frequency and intensity of presentation 
of verbal stimuli on both learning and recall in 
normals and subnormals. Pairs of associated 
words were to be used as material. One may as- 
sume that the ease of learning and remembering 
to give one word as a response to another, would 
be a function of the strength of association 
between them. We therefore used two sets of 
paired associates. One set contained word pairs 
which normal children frequently associated. 
The other contained words which they associ- 
ated less frequently. 


METHOD 
Subjects 


The subjects were 40 normal children and 40 
adult subnormal, institutionalized patients. An equal 
number of male and female subjects were used. 
The subnormals were clinically heterogeneous, and 
only those with hearing defects were excluded. Their 
IQs on the Stanford Binet ranged from 40-57; 
mean IQ, 46.4; with CA from 15-35 years, mean 
22 years. The mean mental age was 6 years 10 
months with a range from 5 years 10 months to 
7 years 6 months. The normal children were 
matched for sex and chronological age with the 
mental ages of the patients. They were boys and 
girls from the first two classes of an infant school 
in a predominantly working class area. It was 
assumed that the mental and chronological age of 
the normals would correspond. It followed from 
this assumption that the mental ages of the two 
groups were equal. No IQs were available for these 
children and regulations prevented testing. 


Procedure 


Subnormals and normals were each randomly al- 
located to one of four subgroups of 10 subjects. The 
number of boys and girls in each group was approxi- 
mately but not precisely equal The task was the 
learning of wards presented as paired associates. Two 


2M. Sheridan, personal communication, March 
1962. 
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different lists of six pairs of associated words were 
used alternately, half the subjects being presented 
with List a and half with List b. The stimulus 
words in each list are the same, and the associates 
are those words which according to Woodworth 
(1938, p. 346) are most frequently given as spon- 
taneous responses by children (List a) and adults 
(List b), respectively. The words used as stimuli 
were: DARK, MAN, TABLE, DEEP, SOFT, and MOUNTAIN 
in that order. The corresponding responses for List a 
were night, work, eat, hole, pillow, and high and 
for List b light, woman, chair, shallow, hard, and 
hill. 

The words were recorded on magnetic tape and 
presented through a loud speaker. A Dawe noise 
level sound meter was used to determine the ap- 
propriate distance of the subject from the noise 
source at which the stimuli would be received at the 
specified intensities. The level of background noise 
in the experimental rooms was approximately 
40-45 db. Subjects were seen individually. They 
were told in advance that they would hear some 
words, two at a time, that word pairs belonged 
together and that they would hear them many times 
to help them to remember them. 

Four treatments were used with subgroups of 
10 normals and 10 subnormals. In these the paits 
of words were presented at either 90 db. or 55 db. 
of sound intensity, with either 10 or 20 repetitions 
for each intensity level. Temporal intervals wet 
1 second between associated words and 5 seconds 
between successive presentations of word pairs. 

There were three recall sessions, after 1 minute, 
after 2 days, and after 1 month. In these only the 
stimulus word was given, at the same intensity level 
at which it had initially been presented. The sub- 
ject had to give the appropriate response word 
within 15 seconds, after which followed the next 
stimulus. The recall sessions consisted of a single 
response trial for each word. No corrections were 
given. It is possible to regard the first test as 9 
measure of learning or of immediate recall, W d 
responses given aíter 2 days and 1 month can 
regarded as short- and long-term recall, respectively. 


RESULTS 


Mean scores for each group of 10 subjects 
out of an optimal six are given in Table 1. 

Analyses of variance for groups and treat- 
ments on each of three separate occasions 
not show a significant difference between 
two word lists used. The data over Lists a and! 
were therefore combined for a further analysis 
of variance of treatments and occasions. 
results of this analysis are given in Table 2 

As the data over occasions were obtaind 
from the same subjects and correlated, while 
that over treatments and groups were not, 
analysis of "mixed design” was used. To test 


3 We are very grateful to A. E. Maxwell for his 
statistical advice on this analysis. 


the 
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significance of the effects of groups and treat- 
ments the “between people" term was used. The 
remaining terms were tested against the residual. 
There was no significant difference between 
scores of normals and subnormals on any of the 
Occasions or under any treatment. There was 
also no significant difference for either group 
between the number of items they recalled on 
any of the occasions, nor did any of the inter- 
actions reach the .05 significance level. How- 
ever, different treatments affected both groups 
differently in immediate as well as short- and 
long-term recall. Subsequent £ tests between 
treatments showed that this difference is most 
marked when a presentation at 90 db. intensity 
and 20 trials is compared with one of 55 db. 
and 10 trials (p — 001). However, even with 
intensity held constant at either 90 db. or 55 
db. those subjects who had 20 presentations 
had significantly higher scores than those with 
10 presentations (p=.01 and .025, respec- 
tively). Keeping the number of presentations 
constant while varying intensity level has no 
significant effect, and under conditions of 90 db. 
intensity and 10 trials and 55 db. and 20 trials, 
the scores also do not differ significantly. This 
is shown in the totals of the row means in 
Table 1. 


DISCUSSION 


In this experiment there was no significant 
difference in recall scores between normals and 
subnormals of like mental age on any of the 
three occasions. These results support the con- 
clusion that verbal material of the kind used 
in this experiment is relatively well retained 
by imbeciles. It should be noted however that 
one trend in the data although not to a signifi- 
cant extent was the tendency for reminiscence 
in the normal group over the three occasions. 
This can be seen from the column totals in 
Table 1. There was no similar trend in the 


TABLE 1 
MEAN SCORES By GROUPS AND OCCASIONS 


Normals Imbeciles 
db. E Total 
dua ofrow 
1 2 1 1 2 1 means 
minute] days| month|minute| days| month 
ss(10 43 | 45| 44 | 40 |42| 42 | 25.6 
20 51 | 51| 50 | 49 | 49| 44 | 29.4 
so[10 45 | so| 53 | 45 | 43] 39 | 27.5 
20 5.0 | 5.2| 5.5 | 5.5 | 5.4] 5.4 | 32.0 
Total of | 18.9 |19.8| 20.2 | 18.9 |18.8| 17.9 


column 
means 


Note.—Optimal score = 6. 


D 


TABLE 2 


ANALYSIS OF VARIANCE OF GROUPS, TREATMENTS, 
AND OCCASIONS 


Source grise o edens 
Between groups (G) 1| 4.004} 4.004 
Between treatments (T) 3| 36.613| 12.204* 
Between occasions (O) 2| 0259| 0.129 
GXT a 6.112 | 2.037 
GXO 2| 3.908| 1.954 
TXO 6| 2474| 0412 
GXTXO 6 3.026 | 0,504 
Between People (within 72|137.867| 1.915 
groups and treatments) 
Residual 144 |119.033| 0.827 
"Total 239 | 313.296 


Note.—Mixed design. 

* F significant at .001 level. 
imbecile group which rather showed a decline in 
scores. Such trends might have differentiated the 
groups significantly if memory had been tested 
over a longer period of time. 

Frequency of presentation had an effect on 
immediate recall or learning scores. The result- 
ing differences in the scores continued to appear 
relatively unchanged in the two subsequent 
recall sessions. This suggests that frequency of 
presentation affects learning but not recall. 
Those who were presented with the material less 
frequently, retained as much of what they 
initially learned as did those who had more 
frequent presentations. Though frequency of 
presentation is the chief determinant of the 
amount learned, intensity level also appears to 
contribute, The best results are obtained under 
conditions of many presentations at high in- 
tensity level. Many presentations at a lower 
intensity give no significantly better results than 
fewer at a higher intensity. 

It might have been expected that children and 
imbeciles would learn and retain better those 
paired associates which contained stimulus and 
response words most frequently found in chil- 
dren’s word associations. They did not. Though 
the two sets of responses differed in the fre- 
quency with which they were given as assocjates 
by children (Woodworth, 1938, p. 346) neither 
was easier to learn nor better remembered than 
the other. A similar failure to obtain differences 
in the case of paired-associate learning as a 
function of the frequency of association has 
been reported by Haun (1960). Haun concludes 
that free association is not only a function of 
the strength of association between words but 
also of various uncontrolled factors such as 
familiarity of usage and set. Thus failure to 
respond correctly in recall (low strength of 
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association) need not imply a weak tendency to 
respond with that word to the same stimulus in 
a free association test. 

The practical implications of the results are 
that many repetitions and a high intensity level 
of stimulus presentations seem to provide 
optimal learning conditions for imbeciles, and 
that provided material is learned well it is 
relatively well retained by such subjects, com- 
pared with normal children of like sex and 
mental age. 
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NEED CORRELATES OF *GULLIBILITY" 


NEIL A. CARRIER + 


Southern Illinois University 


The gullibility of college students in accepting false personality analyses of 
themselves was found to be positively related to the achievement, deference, 
and introception variables of the EPPS among male Ss, Among female Ss, 
gullibility was positively related to abasement and introception and negatively 


related to endurance. 


The use of "fake" analyses to illustrate to 
students the dangers of the subjective valida- 
tion of personality tests can be a convincing 
demonstration of the importance of empirical 
procedures in validating such tests (Forer, 
1949; Stagner, 1958; Sundberg, 1955). The 
“gullibility” of students, as Forer and Stagner 
put it, in accepting fake analyses may be easily 
demonstrated. Stagner has reported that in 
using this technique 62% of a group of college 
students, 81% of a group of industrial super- 
visors, and 90% of a group of personnel man- 
agers gave an overall rating of a fake analysis 
of themselves as either “amazingly accurate” or 
“rather good.” The present writer’s experience 
with» students of introductory psychology has 
typically produced percentages comparable to 
those of the latter two groups. 

The question may be asked, however, whether 
certain personality characteristics of students 
are related to the degree of their acceptance 
of such analyses, Is their acceptance rendered 
more understandable for having been labeled 
“gullibility,” implying some unitary trait, or 


1 The assistance of Robert A. Lorinskas in the 
preparation of data is gratefully acknowledged. 


may it better be viewed as motivated behavior 
whose influences are yet to be determined? 
Within the framework of need relevance theory, 
one may inquire whether certain need states 
mediate the degree of one’s acceptance of a 
personality interpretation in classroom, confer- 
ence, or psychotherapeutic situation. 

On several occasions the present writer has 
used the Edwards Personal Preference Schedule 
and the fake personality analysis items of Forer 
(1949), as revised by Stagner (1958), in class- 
room demonstrations. Since the promised 
“correct” personality scores were confidentially 
given at the end of each demonstration, the 
EPPS was chosen for its relative lack of 
potentially threatening “depth” variables. The 
use of the EPPS, however, provided a stimulus 
to a conceptualization of the kind of need varl- 
ables likely to be related to the acceptance of 
false analyses. 

Consequently, the descriptions and items of 
each of the 15 EPPS variables were examined. 
Seven were selected as having a possible rela- 
tionship to the behavior in question. Seven cor 
responding hypothetical relationships to gullibil- 
ity were predicted. They were: 


|. High achievement subjects will be more 
gullible than low achievement subjects. The 
rationale was that when presented in a class- 
room setting as a part of course work, the higher 
achievement student would tend to accept a 


“presumed test produced personality interpreta- 


‘tion more readily than the low achievement 


gullible than low deference subjects. The strong 

implication within this variable of “getting sug- 
estions from others” and of “accepting the 
‘leadership of others" suggested that the high 
scorer would be more likely to accept the 
instructor proffered fake analysis than would 
- the low scorer. 

3. High autonomy subjects will be less gul- 
“lible than low autonomy subjects. The implica- 
n of independence of conformity and author- 
by those scoring high on autonomy dictated 
this prediction. 

— 4. High introception subjects will be more 
gullible than low introception subjects. The 
itrospective, self-, and other analyzing high 
introception scorer was presumed to be more 
likely to accept the results of a “personality” 
test than would a lower scorer. 
5. High succorance subjects will be more 
gullible than low succorance subjects. The offer 
by the instructor of an analysis concerned with 
Ie subjects personal problems was presumed 


6. High abasement subjects will be more 
llible than low abasement subjects. The de- 
cription of the high abasement individual 
eemed consistent with the expectation that he 
d be more likely than would the low 
ement person to accept somewhat critical 
ing) personal comments. 


than low endurance subjects. The greater 
fidelity of the high endurance student was 
umed to mediate a greater literal acceptance 
the false analysis by him than by the low 
durance student. 


j METHOD 

bjects 

udents enrolled in introductory psychology at 
uthern Illinois University served as subjects. The 
ple was composed of 87 males and 41 females. 


dure 
During the middle of the term subjects completed 


EPPS during one class period. They were told 
Would be given their results later when the 
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unit of work on "personality and its measurement" 
was studied. Two weeks later each subject was 
handed a sealed envelope with his name on it. 
He was asked to open it and study the contents, 
keeping them confidential The enclosed “confi- 
dential personality evaluation” was a mimeographed 
two-page booklet presenting 25 statements of per- 
sonality characteristics. Thirteen of these were 
identical to those published by Stagner (1958). The 
remaining interspersed 12 statements were equally 
ambiguous, double-barrelled, and critical. The 
numbers of the 13 comments were circled in red 
pencil and the subject’s name was written in red at 
the top of the first page. Each subject thus appar- 
ently received a tailor-made report of his EPPS 
results, whereas all received identical fake reports. 

After examination of his report, each subject 
was asked to evaluate the accuracy of his person- 
ality description. He was provided with a “per- 
sonality evaluation rating sheet” containing a five- 
step scale with headings as follows: amazingly 
accurate, rather good, about half and half, more 
wrong than right, almost entirely wrong. He rated 
each of the 13 items marked in red pencil. 

Following the collection of these rating sheets, 
subjects were asked to compare their reports; then 
the deception was revealed. The next class meeting 
presented the tabulation of ratings, the presentation 
and discussion of the genuine EPPS results, and the 
discussion of the problems of test validation in 
general. 


Analysis 


The ratings of the 13 false statements were 
quantified and a mean gullibility score was com- 
puted for each subject. While no differential pre- 
dictions were made by sex, the injunction of 
Carlson and Carlson (1960) to be aware of pos- 
sible sex differences in personality research prompted 
separate sex analyses. Gullibility score distributions 
were subjected to a median break to produce high 
and low gullibility groups (44 and 43 males, and 
21 and 20 females, respectively). EPPS score dis- 


TABLE 1 


CHI SQUARE AND SIGNIFICANCE LEVELS FOR 
Tests OF HYPOTHESES 


Male Female 
EPPS 
variable 
ric 2? 
Achievement 6.25* 2.80 
Deference 6.02* 1.36 
Autonomy .08 .30. 
Introception 11.64** 10.62** 
Succorance 1.57 37 
Abasement 89 14.80*** 
Endurance T0 6.79* 


* Direction of relationship opposite to that predicted. 

* Significant at .05 level. P 
** Significant at .005 level. 
*** Significant at .001 level. 
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TABLE 2 
INTERCORRELATIONS OF SEVEN EPPS VARIABLES 
Achievement Deference | Autonomy Introception Succorance | Abasement Endurance 

Achievement 2» | 0 12 =.25 — 45 I 
Deference .05 —.04 222* —.17 .00 | .03 
Autonomy —.36 —A7 | —.02 —.40 | —.18 | .09 
Introception .02 —.06 | -16 =.14 | HOS mS 
Succorance .10 16 | 01 ays | .00 | —.40 
Abasement —.10 .01 —.08 290 16 | =12 
Endurance =,21 | —.01 —.01 —.16* —.15 FRO | 


Note.—Correlations for males appear above diagonal; correlations for females appear below. 


^ Correlations most relevant to significant findings. 


tributions were broken into thirds to produce high, 
moderate, and low scoring groups. Thus, a 2X 3 
contingency table was constructed for each hypothe- 
sis. Distributions were tested for independence using 
the X? statistic for nonindependent proportions. 


RESULTS 


As Table 1 shows, three of the seven pre- 
dicted relationships were supported for male 
subjects, Statistically reliable positive relation- 
ships were found between the gullibility meas- 
ure and the achievement, deference, and intro- 
ception variables. The remaining four predic- 
tions were not supported. Data from the female 
subjects supported the predictions regarding the 
introception and abasement variables. The 
hypothesis for the endurance variable was 
contradicted by evidence for a relationship op- 
posite to that predicted. The remaining four 
predictions were not supported. A check for 
relationships between the gullibility classifica- 
tions and each of the remaining nonpredicted 
EPPS variables showed no additional significant 
relationships for either sex. 

Table 2 shows the intercorrelations of the 
seven EPPS variables used in this study. It is 
apparent from the coefficients marked with an a 
that the significant findings of the study have 
a high degree of independence. 


Discussion 


The utility of the fake personality interpreta- 
tion technique as an instructional device has 
been demonstrated. But its use to produce a 
behavioral variable which itself may be studied 
has not been exploited. The finding here that 
certain EPPS measured personality variables 
are related to this behavior is not intended to 
imply that gullibility is a general trait broadly 
displayed by high achievement and deference 
males, high introception males and females, 
high abasement females, and low endurance 


females. Rather, such relationships may obtain 
in the kind of classroom situation employed in 
this study. Within this context the achievement 
oriented male student may display greater agree- 
ment with a fake analysis than one less achieve- 
ment oriented. Similarly, the deference, intro- 
ception, and abasement needs of subjects may 
operate in this context to produce such accept- 
ance behavior. Also, ad hoc re-examination of the 
contrary finding regarding the endurance vari- 
able with female subjects provides at least à 
reasonable appearing view of the (compulsive) 
high endurance subjects as less likely to be 
induced to immediate acceptance by a proffered 
personality analysis than is the low endurance 
female. 

The results suggest the wisdom of need rele- 
vance theory. If behavior is oriented toward 
the reduction of certain needs, then task situa- 
tions containing cues for the gratification of 
these needs may set the occasion for certain 
need related task behaviors. It seems likely 
that the subjects in this study behaved in part 
according to such a paradigm. 

The different results from male and female 
subjects gives one more bit of support to the 
Carlsons’ (Carlson & Qarlson, 1960) plea that 
sex differences be tested as a standard procedure 
in personality research. 
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SOME EFFECTS OF SOUND, BACKGROUND BRIGHTNESS, AND 
ECONOMIC STATUS ON THE PERCEIVED SIZE 
OF COINS AND DISCS* 


DONALD D. DORFMAN ax» ROBERT B. ZAJONC 


University of Michigan 


*Rich" and "poor" children estimated the size of coins and discs under sound 
vs. no sound conditions and 2 levels of background brightness. Contrary to pre- 
diction, no relation was found to exist between sound level and size estimation, 
although such a relation did exist for background brightness. The findings 


failed to replicate the data of Bruner and Goodman, 


whereas they were in sub- 


stantial agreement with those of Carter and Schooler. These results were as 


follows: (a) large coins tended to be judged larger, 
comparable size; and 


smaller than neutral discs of 


was observed on the estimations of coins 


and small coins judged 
(b) no differential effect 
and discs as a function of economic 


status, There was, however, a reliable tendency for the children of lower eco- 
nomic status to overestimate both coins and discs when compared with those 


of higher status. 


In a classical study of the effects of motiva- 
tion upon accentuation of magnitude, Bruner 
and Goodman (1947) reported that coins were 
estimated as larger than neutral discs and that 
the overestimation was larger for children of 
low economic status. Similar findings were ob- 
tained by Ashley, Harper, and Runyon (1951) 
in a study in which fictional life histories of 
poverty and wealth were hypnotically induced, 
and by Lambert, Solomon, and Watson (1949) 
and Lambert and Lambert (1953) who reported 
accentuation of neutral discs when these were 
associated with candy. Other investigators have 
reported that social drives or incentives pro- 
duced increases in weight (Dukes & Bevan, 
1952) and brightness judgments (Atkinson & 
Walker, 1956). 

Conflicting results have also been obtained. 
Carter and Schooler (1949) failed to replicate 
the findings of Bruner and Goodman (1947), 
reporting that accentuation was à function of 
coin size rather than value and that the smaller 
coins were actually underestimated when com- 
pared with neutral discs. This divergent result 
was also obtained by Bruner and Rodrigues 
(1953) who found that the penny and nickel 
were underestimated, and by Tajfel and 
Cawasjee (1959) with foreign coins. Intro- 
ducing a further complication, Rosenthal (1951) 
found the size estimations of rich 10-year-olds 


lThis study was supported by Grants AF 
49(638)-367 and G 4951 of the Air Force Office of 
Scientific Research and the National Science 
Foundation issued to the Research Center for Group 
Dynamics, We wish to thank Arthur Platz for 


many helpful comments. 
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to be greater than those of poor subjects for 
all coins estimated and the reverse to be true 
for 6-year-olds. 

Although not customarily associated with the 
findings reported above, investigations of inter- 
sensory interaction have also yielded accentua- 
tion of stimuli with certain manipulations. For 
instance, Ide (1919) reported that both “cold” 
and “hot” weights were estimated as heavier 
than identical weights judged at room tempera- 
ture. Similarly, Newhall (1923) found that a 
sound or pressure stimulus increased the judged 
brightness of visual stimuli, and Cason (1936) 
that the judged intensity of either sound or 
light was enhanced when one stimulus was 
accompanied by the other. As for size accentua- 
tion, Kravkov (1933) reported that olfactory or 
auditory stimuli increased the apparent size of 
rectangles. 

The above data are not unambiguous. Never- 
theless, there are instances cited above where 
extraneous stimulation has produced accentua- 
tion effects. If it may be assumed that drive 
stimuli and external stimuli function in a 
similar fashion, then the motivational findings 
reviewed earlier may be viewed as special Cases 
of sensory interaction, If such is the case, then 
external stimuli should have the same accentua- 
tion effect upon estimation of coin size as, for 
instance, low economic status and monetary 
value, This point of view implies that per- 
ceptual accentuation may occur irrespective of 
whether the extraneous stimulus bears any rela- 
tionship to the objects estimated. The following 
experiment was a test of this implication. A 
sound served as the extraneous stimulus and 
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TABLE 1 
NUMBER OF CASES IN EACH CONDITION 


Coins Discs 
Variable 
Rich | Poor | Rich | Poor 

White background 

Sound 9 9 6 6 

No sound 9 9 6 6 
Black background 

Sound 9 9 6 6 

No sound 9 9 6 6 


Note.—N = 120. 


was administered in the presence of coins or 
discs. Because of evidence that sensory inter- 
action effects on size estimation interact with 
the background brightness of the figure (Lon- 
don, 1954), the latter was also varied. 


METHOD 


Subjects. The subjects consisted of 120 children 
from 9 to 11 years of age. Half of the children 
were students at a grammar school located in a 
low income district and half at a grammar school 
in a high income district in the neighborhood of 
Detroit. The low income school was publicly sup- 
ported, the high income school privately endowed. 

Apparatus. A Hewlett-Packard audio-oscillator 
was used to present tones of 1000 cps through 
rubber-cupped earphones. Coins and discs were 
viewed through a tachistoscope adapted for this 
purpose. The major temporal relations were con- 
trolled by electronic timers. Size estimations were 
measured by means of a matching apparatus similar 
to the type employed by Bruner and Goodman 
(1947). A lens focused the light from a 60-watt 
incandescent light bulb upon a ground glass screen 
615" X 614". An iris diaphragm located between the 
light bulb and the lens produced a circular patch 
of light which varied in diameter from 1.50 to 6.00 
cm. by turning a knob on the front end of the box. 

Stimuli, The stimuli consisted of United States 
coins of denominations 1¢, 5¢, 10¢, 25¢, and 50¢ 
and medium gray discs of 39% reflectance, matched 
with the coin for size, Each stimulus was pasted 
in the center of a rectangular piece of cardboard 
9” X11” X 6”. Two sets of cards were employed, 
a homogéneous black and a homogeneous white. 

Procedure. The study consisted of the manipula- 
tion of four variables: sound versus no sound, black 
versus white background, coins versus discs, and 
high versus low economic status. Independent 
groups of subjects were run under each combination 
of the above variables. The distribution of subjects 
and the various conditions are shown in Table 1. 
Within each condition, measures were obtained on 
all five stirpulus sizes. 

Each child was seated before the tachistoscope 
and the matching apparatus which were placed 


adjacent to each other on a table. The child was 
told that he was participating in a game, and that 
it was his job to make the circle of light on the 
screen the same size as the various objects which 
he observed when looking in the viewer of the 
tachistoscope. Both the stimuli viewed tachistoscop- 
ically and the circle of light were slightly below 
eye level. Before testing, the child was given a 
series of practice trials to acquaint him with the 
matching apparatus and the tachistoscope. No feed- 
back as to accuracy was given. During the test, 
each stimulus was presented tachistoscopically for a 
duration of .5 second and after offset of the stimu- 
lus, the child attempted to adjust the circle of 
light to the size of the object he had just seen, 
Thus, matchings were made from immediate 
memory. In the sound condition, a 1000 cps tone 
(68 db., re .0002 dynes/cm?) was administered 5 
second before onset of the coin or disc, with both 
the sound and the latter offsetting simultaneously, 
Each stimulus was presented twice in a random 
order for a total of 10 matchings or trials for each 
child. Intertrial interval was approximately 60 
seconds, E 


TABLE 2 


ANALYSIS OF VARIANCE OF APPARENT SIZE (cm.) OF 
Corns AND Discs UNDER VARIOUS CONDITIONS 


Source SS df | MS F 
Sound (S) .452| 1| 452| 1.59 
Material (M) .686| 1| .686| 241 
Background (B) 1.235| 1| 1.235| 4.33* 
Wealth (W) 1.807| 1| 1.8907 6.66* 
SXM .007| 1 «1.00 
SXB .023| 1 «1.00 
sx Ww 191) 1 <1.00 
MXB 415| 1 «1.00 
MXW .002| 1 «1.00 
BXW 494| 1| 494| 1.73 
SXMXB 058) 1 «1.00 
SXMXW 004) 1 «1.00 
MXBXW 449| 1| .419| 147 
SXBXW 067} 1 «1.00 
SXMXBXW 056) 1 «1.00 
Error (b) 29.623 |104| .285 d 
Size (Si) 424.087| 431.022 | 705.045 
sixs 114) 4 <1.00 
six M 1.736| 4| .434| 13.409* 
Si X B 446| 4 «1.00 
Six W .236| 4| .059) 1.341 
SiXSXM .072| 4 «1.00 
SiXSXB 170| 4 «1.00 
sixsxW 113) 4 «1.00 
SiXMXB :219| 4| .055| 1.250 
SiXMXW .036| 4 «1.00 
SiXBXW 440| 4 «1.00 
SIiXSXMXB .081| 4 «1.00 
SiXSXMXW .063| 4 «1,00 
SiXMXBXW .004| 4 «1.00 
SiXSXBxW 409| 4| .102| 2.318 
SIXSXMXBXW| .026| 4 «1.00 
Error (w) 18.343 |416| .044 

> 
2$ S001. 


RESULTS AND DISCUSSION 


Contrary to the predictions proposed earlier, 
analysis of variance (see Table 2) revealed no 
‘significant Sound effect or Sound X Back- 
ground interaction, although there was a small 
"tendency for the sound to decrease the apparent 
‘size of all stimuli (see Figure 1). However, 
the findings do show a reliable Background 
effect (F = 4.33, p < .05, df= 1/104) (see Fig- 
"ure 1). As can be seen in Figure 2, both coins 
"and discs were estimated as larger when exposed 
"on a black background, and no interaction be- 
"tween material (coin versus disc) and back- 
ground was obtained. Findings on the relation 
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4 Fic. 1. Perceived size (diameter) of stimuli as a 
* - function of sound level and background brightness. 
— (The diagonal line with slope of one represents per- 
fect accuracy.) 


of apparent size to background brightness have 
been reported for neutral stimuli (Oyama & 
Nanri, 1960), and are in accordance with the 
- data reported here. These data suggest one con- 
founding source of variance in that the con- 
textual conditions within which the stimuli are 
viewed are not the same among the different 
- Studies. For instance, in some cases the objects 
- were placed on a table (Carter & Schooler, 
-. 1949) with background reasonably well-specified, 
and other cases in the child's hand (Bruner & 
— Goodman, 1947; Tajfel & Cawasjee, 1959) with 
background unspecified. The present study in- 
dicates some need for future concern and control 
of background. 

In summary, the above findings do not sup- 
port the hypothesis proposed earlier that any 
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stimulation will enhance apparent size ir- 
respective of its relevance to the object to be 
judged. Although some data (Ide, 1919; 
Kravkov, 1933; Newhall, 1923) do suggest such 
an effect, the conditions under which it occurs 
are not presently specifiable. If these conditions 
can be determined, the above proposal may 
merit some future consideration. 

Most relevant to the motivational studies re- 
viewed earlier is the significant Size X Material 
interaction (F=13.41, p< .001, df= 4/416). 
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TABLE 3 


ANALYSIS OF VARIANCE OF APPARENT SIZE OF LARGE 
AND SMALL STIMULI 


Small stimuli Large stimuli 
(10¢, 1¢, 5¢ size) (25¢, 50¢ size) 
Source - 
df |MS F df | MS F 
Material (M) | 1/2.10| 15.63**| 1| .22 .95 
Error (b) 118| .13 118| .23 
Size (Si) 2(2.64| 54.11**| 123.41 601.85** 
MXSi 2| .04| «1.00 1| .02,«1.00 
Error (w) 236| .05 118| .04 
** p < 001. 


As can be seen in Figure 3, the findings are in 
sharp contrast to those reported by Bruner and 
Goodman (1947) whereas they do in general 
confirm the data of Carter and Schooler (1949). 
The large coins (50¢, 25¢) were estimated as 
larger than the discs of comparable size, and the 
small coins (10¢, 1¢, 5¢) smaller than the cor- 
responding discs. Statistical tests revealed that 
the estimates of the small coins were signifi- 
cantly less than the small discs of the same size 
(F= 15.63, p< .001, df = 1/236); the large 
stimulus comparisons were in the opposite direc- 
tion, but not significant (see Table 3). Also 
relevant to the Bruner-Goodman study is the 
significant Wealth effect (F= 6.66, p< .05, 
df = 1/104). The estimates of coins as well as 
discs by the rich children were smaller than 
those of the poor children. Although poor 
children did estimate coins as larger than rich 
children, the absence of a Wealth X Material 
interaction is also in acordance with the findings 
of Carter and Schooler in that no differential 
effect on coins versus discs was obtained as a 
function of economic status. 

In general, the findings obtained in the present 
study are in contradiction to the data reported 
by Bruner and Goodman (1947), but do pro- 
vide good support of those of Carter and 
Schooler (1949). The fact that estimations in 
the present study were made from immediate 
memory renders inadequate one explanation of 
the divergent findings in this area. It has been 
proposed that the Bruner-Goodman phenomenon 
is obtainable when the stimulus object is equivo- 
cal as in the case of judgments made from 
memory (Brown, 1961; Carter & Schooler, 
1949). In this study, the coins were present for 
only .5 second and judgments were made in the 
absence of the stimulus. If equivocality of the 
stimulus is the critical variable, it does seem 
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reasonable that it should have yielded findings 
similar to those of Bruner and Goodman in the 
present study. 
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120 elementary school pupils in a lower-class neighborhood were tested in a 
situation providing for measures of the occurrence and magnitude of a form of 
psychopathic behavior (stealing) and of temporal orientation. The 49 Ss who 
stole money from the E during her brief absence chose fewer temporal concepts 
and told stories with shorter time durations than the nonstealers. This relation- 
ship between stealing and time orientation was uninfluenced by race, sex, age, 
IQ, academic achievement, or school and home behavior problems. 


The finding that the time spans of stories by 
‘middle-class children were longer than those of 
lower class ? led LeShan (1952) to speculate con- 
cerning a positive relation between limited time 
orientation and psychopathy and delinquency. 
His suggestion has produced uncertain evidence. 
Barndt and Johnson (1955) found that 17-year- 
old delinquent boys told stories with shorter 
temporal duration than matched nondelinquents 
but Davids and Parenti (1958), in a comparable 
study, obtained a difference in the opposite direc- 
tion. In the latter study, employing 11-year-olds, 
it was concluded that better adjusted children 
tended to be more present oriented than those 
judged as less well-adjusted emotionally. Davids 
and Parenti attempted to resolve their difference 
with Barndt and Johnson by suggesting that dif- 
ferent time orientations are “normal” at different 
developmental levels. A failure to demonstrate a 
relationship between frustration tolerance, an in- 
tervening explanatory construct proposed by Le- 
Shan, and time orientation was reportel by Ellis, 
Ellis; Mandel, Schaeffer, Sommer, and Sommer 
(1955). Ambiguous results were obtained in stud- 
les comparing the time perspectives of adult 
Schizophrenics and matched controls (Wallace, 
1956) and of high and low achieving preadoles- 
cent boys (Teahan, 1958). In both studies sta- 
tistically significant differences in the "predicted" 
direction were obtained for only half the story 
situations. 

Tn order to provide a clearer test of the hy- 
Pothesized relation between temporal orientation 


— l'The invaluable suggestions and comments of 
W. J. Meyer are gratefully acknowledged. The re- 
Search was conducted while the first author was at 
the University of Pittsburgh. 

?However, from their recent reanalysis of Le- 
Shan’s data, Greene and Roberts (1961) concluded 
that, “LeShan’s findings are equivocal and should 
ot be interpreted as clearly showing a difference in 
time perspective between middle and lower class 


children” (p, 141). 
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and psychopathy, an attempt was made to elimi- 
nate some of the sources of ambiguity in previous 
research. A situation was designed to provide 
unequivocal measures of the occurrence and 
magnitude of a form of psychopathic behavior, 
stealing, and of the use of temporal concepts. 


METHOD 
Subjects 


Subjects were 120 children attending an elementary 
school in a lower-class neighborhood in Pittsburgh. 
The community consists of 40 city blocks between 
produce yards and a commercial area; tenements 
are located among small factories, warehouses, and 
freight yards. Typical occupations range from chronic 
unemployment to unskilled laboring jobs, the mean 
annual income being less than $2,000. Living is sub- 
standard with overcrowding and poor sanitation. 

In each of three grade levels (2-3, 4-5, 6-8) 40 
subjects were recruited, half of whom were males, 
and half of whom were Negroes. An analysis of 
variance of IQ and academic achievement ratings in 
this twelvefold classification yielded Fs less than 
unity. Within this design, subjects were selected on 
a random basis except that: children presenting disci- 
pline or behavior problems, eg., truancy, of any kind 
were excluded; no more than one subject was re- 
cruited from the same classroom during the same 
class period. 


Procedure 


An opportunity for stealing was devised in the fol- 
lowing fashion, The experimenter, officially present 
in the school with facilities to conduct hearing sex- 
aminations, sent a message to the teacher of the sub- 
ject with a note explaining that the subject was 
needed by the experimenter. Upon entering the room, 
the subject saw the experimenter standing beside her 
desk in an apparently flustered fashion. Beside the 
experimenter and near the desk on the floor was 
a woman’s handbag, overturned, with its contents 
spilled out. Cards, a comb, assorted papers, and $.65 
(2 dimes, 2 nickels, and 35 pennies) were scattered 
about and nearby was an opened small chamge purse. 
The experimenter said: 
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TABLE 1 
NUMBER or SUBJECTS SELECTING VARIOUS TEMPORAL WORDS 


AND NOTES 


Stealers Nonstealers 
Grade: 2-3 4-5 6-8 2-3 4-5 
Boys | Girls | Boys | Girls | Boys | Girls | Boys | Girls | Boys | Girls | Boys 
N:| 9 7 9 3 16 5 11 13 11 17 4 
MINUTE 3 2 1 1 8 1 4 7 8 10 2 
YESTERDAY 6 3 4 1 6 0 9 11 11 13 3 
WEEK 6 2 3 1 4 2 8 7 6 11 3 
TOMORROW 7 6 4 1 3 2 6 10 4 11 3 
SECOND 6 1 5 0 3 2 3 4 7 9 2 
HOUR 6 2 4 2 8 2 6 6 6 14 2 
DAY 5 2 2 0 5 3 5 7 + 10 1 
MONTH 5 3 2 0 4 2 3 8 4 10 2 
TIME 0 T 3 0 5 2 8 9 8 12 3 
YEAR 2 4 2 0 $ 1 4 7 8 6 3 
M 5.11 | 3.71 | 3.33 | 2.00 | 3.06 | 3.40 | 5.09 | 5.85 | 6.00 | 6.24 | 6.00 


Note.—The higher the mean, the greater the number of temporal words chosen. mb ai x 
a Three of the nine boys in the second or third grade who stole money picked MINUTE in the word choice situation. 


Look at this mess. I spilled my purse and every- 
thing fell out. I have to go up to the kindergarten 
now [the room most remote from the experimental 
room] and I don’t have time to clean this up. 
Would you mind getting these things together? I 
don’t care about the change. I don’t know how 
much it is but I think it is only pennies. Most of 
it rolled away I guess. Just try to get the papers 
together. I'll be back in about five minutes. 


In 5 minutes, the experimenter returned and asked 
the subject to sit down at the desk opposite her. 

A measure of the use of temporal concepts was 
obtained. The experimenter presented the subject 
with 20 index cards (3" X 5”) containing the fol- 
lowing words, boldly printed: SCHOOL, MINUTE, DOOR, 
YESTERDAY, HOUSE, WEEK, SAND, TOMORROW, WINDOW, 
SECOND, SHOE, HOUR, HOME, DAY, ROOM, MONTH, 
CHAIR, TIME, DOG, YEAR. Ten of the words, the even 
numbered in order of presentation, were considered 
time related concepts and the others, nontemporal. 
All 20 were matched for Thorndike-Lorge frequency. 
"The experimenter explained: 


Now I want you to do something for me. Let’s 
pretend that you are going to tell a story to your 
teacher or to your mother. What are the most 
important things you would need to do this? 
Words, of course. Now, some words you will want 
more than others, some words will make your story 
better and easier to understand. I want you to 
choose ten words that you think are the very best 
words to tell your story with. Choose them care- 
fully. Let's read them together first to be sure you 
know them all. 


"The subject read the words aloud to the experimenter 
who recé?ded the subject’s choices and put the cards 
away. 


A second measure of temporal orientation wasi 
same as one employed by Barndt and John! 
(1955), Davids and Parenti (1958), Wallace (198 
and Teahan (1958). The experimenter said: 


Now I want to see what kind of a story 
can tell me. I'll start a story and then you 
it any way you want to. Do you underst 
Here is the beginning. About four o'clock 0 
bright sunny afternoon in May, two boys [gii 
are walking together down 39th street . . . 


The experimenter recorded the story inconspicuoti 
and when the subject finished, inquired, “What 
is it now?” or depending on the story material, “Hi 
* long did this take?" The experimenter recorded 
child's estimate and her own. Twelve subjects Wi 
either unable or unwilling to tell stories. Howey 
since they did not differ from the others on al 
the obtained measures, this loss was not expec 
bias the results, 
After the subject was dismissed, the experimeli 
noted the amount of money stolen, if any. 


RESULTS 


Forty-nine subjects stole; the mean amot 
was 7.3 cents with a range from 1-55 cents. 
groes and whites were equally represented amol 
stealers and nonstealers and racial backg 
was not related to amount stolen or to differen 
on the temporal measures. The racial vari 
was not further considered. The number of tê 
poral words selected is presented in Table 1. T 
overall average for stealers was 3.63 and for n 
stealers, 5.82. Results for the second measure 
temporal orientation are presented in Table 2. 


six instances where the child's estimate of sto 
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time span was different from the experimenter's, 
the experimenter's estimate was employed. The 
overall average for stealers was 1.51 hours and 
for nonstealers, 2.53 hours. The results presented 
in Tables 1 and 2 were evaluated by analyses of 
variance (Table 3). The only significant outcome 
was a main effect of stealing versus nonstealing 
on both measures of temporal orientation. Sub- 
jects who stole selected fewer temporal concepts 
than nonstealers (p < .001) and told stories with 
shorter time spans (5 « .01). 

It was evident from the cell frequencies in 
Tables 1 and 2 that the proportion of males to 
females who stole tended to increase with age 
(grade). However, the same highly significant 
differences were obtained when separate analyses 
were made for males and females in the two 
higher grade categories. 

Differences between stealers and nonstealers in 
their selection of individual temporal concepts 
were evaluated by computing chi squares for the 
difference between proportions. For example, 
from Table 1, wiNvTE was chosen by 33% of 
the stealers and 58% of the nonstealers. For this 
comparison, x? = 6.35, p < .02. For all tempo- 
tal concepts, the proportion for nonstealers was 
greater than that for stealers, Chi square p values 
for the other words were: YESTERDAY, p < .001; 
WEEK, p < .02; TOMORROW, p < .30; SECOND, $ 
< 50; HOUR, p< .50; DAY, p < .30; MONTH, 
b < .10; TIME, p < .001; YEAR, p < .001. These 
results were used to examine two post hoc hy- 
potheses: nonstealers are more future oriented 
than stealers, i.e., prefer TOMORROW more than 
they prefer YESTERDAY; nonstealers, more than 
stealers, emphasize words denoting long duration 
(YEAR, MONTH, TIME) rather than short dura- 
tion (seconp, MINUTE). The difference between 
Proportions for Tomorrow was not significant 
but the corresponding difference for YESTERDAY 
Was reliable and it was more than three times 
that for tomorrow. Thus, to the extent that 
YESTERDAY and TOMORROW reflect direction of 
orientation, differences in past rather than future 


TABLE 2 


Mean Time Spans or STORIES TOLD IN 
NUMBER or Hours 


Stealers Nonstealers 
Grade 
E. Boys Girls Boys Girls 
2-3 | 1.66(8) 1.20(5) | 2.19(7) | 3.79(12) 
e 2.28(9) | 0.35(2) | 2.15(10) | 2.28(17) 
129(15 | 1.38(4) | 2.38(4) | 2.27(15) 


Note.—Cell Ns are in parentheses. 


te 


TABLE 3 


ANALYSES OF VARIANCE OF TEMPORAL WORDS SELECTED 
AND TIME SPANS OF STORIES 


Temporal words* Story time 
Source 
df| MS F df| MS E 
Stealing (A) 1 | 16.845 2347** 1 | 39.669 9.27* 
Sex (B) 1 242 1 1 376 | <1 
Grade (C) 2| .325 «1 2 233 | «1 
x 1 784 1.09 1 | 12.885 3.01 
AXC 2| 1475 2.05 2 127 | <1 
BXC 2 081 «1 2| 5.476 1.28 
AXBXC 2| .550 1 2 426 | «t 
Error 108| .7177 96 | 4.2774 


Note.—Because subclass frequencies were unequal, an 
approximation by Walker and Lev (1953, p. 381) was employed. 

* The sum of squares for each cell was computed across 
subjects, not across words. 

*p «.01 


=p < 1001. 


orientation were more clearly associated with re- 
sponse to the temptation. Evidence on the second 
hypothesis was equivocal: nonstealers, in com- 
parison with stealers, did choose TIME and YEAR 
more than sEconD but differential selection of 
MONTH and MINUTE was about the same for the 
two groups. 

An interesting question concerned magnitude of 
psychopathic behavior and temporal orientation. 
Product-moment correlations between amount 
stolen and the two temporal measures yielded 
coefficients close to zero (—.04 and —.10). 
Hence, the fact of stealing, rather than amount 
stolen, was a predictor of differences in tempo- 
ral orientation. 

The product-moment correlation between tem- 
poral words and story time spans for stealers 
was —.08 (df— 42) and for nonstealers, —.21 
(df = 64). Thus, the two variables, need for 
temporal concepts in fantasy production and 
the time span of such productions, though 
conceptually similar, were empirically unrelated, 

Thirty-three of the 49 stealers made no gesture 
of redressment to their victim (the experimenter) 
while 16, on her return, said that the money was 
all in the purse or that not all the money could 
be found, The 16 “redressers” differed from the 
other stealers only in choosing fewer temporal 
words (t= 1.81, p< .10, two-tailed). It is ün- 
clear how this tendency might be interpreted. 
Greater concern or guilt over the theft may have 
motivated relevant verbalization to the returning 
victim. However, a facile explanation for the 
money loss would’ not be unexpected from 
persons with psychopathic predispositions. 

In sum, strong support was found for a posi- 
tive relation between stealing and limited tempo- 
ral orientation. This relation was not influenced 
by race, sex, age, IQ, or academic achievement, 
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and it was obtained for two unrelated measures 
of time orientation. More importantly, the pres- 
ent subjects were all normal in that they were 
unknown to have behavior or discipline problems. 


DISCUSSION 


An empirical generalization linking psycho- 
pathic tendency and temporal foreshortening 
would seem to require few restrictions or qualifi- 
cations, It might be alleged that, for the present 
lower-class subjects, proficiency in stealing was a 
"realistic" mode of adjustment; at least one 
cannot assume that such behavior elicits the same 
severity of disapproval and sanction as in middle- 
and upper-class strata. However, the relation was 
obtained within lower-class subjects and there 
was no evidence that magnitude of pressure to 
steal, as measured by amount stolen, was dis- 
proportionately associated with any of the 
variables studied. 

A definitive explanation for the relation be- 
tween stealing and time orientation is undoubt- 
edly dependent on further experimentation. At 
present, it may be suggested that stealers cannot 
take account of the negative consequences of 
their actions since, to do so, implies an elaborated 
temporal perspective. When an impulse to violate 
some norm is aroused, as in the present tempta- 
tion situation, it is readily acted upon because 
a foreshortened temporal perspective prevents 
consideration of sanctions or other negative 
reinforcements which might occur in the future. 

A final issue concerns ambiguity in definition 
of the dependent variable. In their recent review 
of research on temporal experience, Wallace and 
Rabin (1960) emphasized the lack of a con- 
sistent theoretical and methodological approach 
to time perspective and time orientation, “with 


the meaning of neither concept being explicitly 
specified.” Suggestive psychoanalytic discussions 
(e.g., Fenichel, 1945, pp. 282, 587; Greenacre, 
1945) might be considered in subsequent research 
focused on experimental manipulation of time 
orientation in fantasy. 
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A COMPARISON OF AUGMENTATION AND REDUCTION 
AS MODES OF INFLUENCE ' 


KENNETH RING 


University of Connecticut. 


AND 


HAROLD H. KELLEY 


University of California, Los Angeles 


A 2X2 factorial experiment was conducted which attempted to influence 
the opinion responses of Ss (introductory psychology students) through the 


use of 1 of 2 methods, augmentation 


or reduction. Also varied was the 


plausibility of the standard employed by the influencer to assess the “correct- 
ness” of the Ss’ opinions. It was found that reduction (giving mild praise 
for correct answers, but extreme criticism for wrong ones) when coupled with 
an implausible standard produced the greatest concealment of opinions, the 
least learning of the influencer’s standard, and the least acceptance of it. 


With the plausible standard, reduction 
effects from augmentation (giving high 
reproof for wrong ones). 


If a teacher or trainer is to be effective in 
a social learning situation, he must provide 
a consistent schedule of reinforcements, re- 
warding the behavior he considers correct 
and/or punishing that which he considers in- 
correct. Being consistent is sometimes made 
difficult for him by the fact that the student 
or trainee has ways of preventing his re- 
sponses from coming ‘under the trainer's 
surveillance. For this reason, it is necessary 
for the trainer to consider the effects of his 
reinforcement schedule not only upon the 
particular response discrimination he is trying 
to teach but also upon the trainee's tendency 
to conceal or reveal his responses. 
7 A fact that seems likely to be important 
in affecting concealment (and subsequent 
learning) is the relative magnitude of the 
rewards and punishments the trainer delivers 
when he does have information about the 
trainee’s choices. We consider two cases: The 
trainer delivers large rewards for the trainee’s 
performance of the desired behavior but only 
very small punishments for “errors,” or he 
delivers only small rewards for correct 
responses but massive punishments for in- 
correct ones. 

These two procedures correspond closely to 
two modes of influence described by Mc- 
Gregor (1948). The first is similar to the 


*This study was performed while the authors 


Were in the Laboratory for Research in Social 


Relations at the University of Minnesota, under 
ae G-5553 from the National Science Founda- 
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was not noticeably different in its 
praise for correct answers and mild 


method of augmentation wherein a powerful 
person (the trainer, to whom we will refer 
as A) provides rewards in order to induce the 
trainee, B, to perform the desired behavior, 
while the second is similar to the method of 
reduction under which, for the same purpose, 
A applies negative sanctions whenever B 
performs a behavior other than the desired 
one. McGregor was concerned with the 
relative effectiveness of these two influence 
methods, arguing that the judicious use of 
augmentation would be more likely to bring 
about the desired behavior in a subordinate 
than would the reliance upon reductive tech- 
niques. He did not, however, mention the 
implications of augmentation versus reduc- 
tion for response concealment, a factor which 
we believe cannot be disregarded in account- 
ing for the differences in learning and in- 
fluence attributable to these two techniques, 

With respect to the effect of these methods 
upon B’s concealment of his responses, 
Thibaut and Kelley (1959) point out that in 
the case of augmentation B will, if necessary, 
present evidence to A of his compliance 
with A’s wishes in order to insure receiving 
his reward. If, however, A employs reduction 
as his technique, B may find it advantageous 
to conceal his behavior or in other ways to 
make it difficult for A to monitor it. The 
reason, of course, is that B can get the best 
outcomes the relationship has to offer (ie., 
he can avoid punishment) through either 
compliance or concealment. He will be espe- 
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cially likely to choose the latter if he has diffi- 
culty in complying as when, for example, he 
simply does not know how to meet A’s be- 
havioral standards or prefers not to conform 
to them. In sum, augmentation should lead 
to showing which in turn will insure con- 
sistent feedback from A with the result that B 
should soon learn A’s behavioral standards. 
In contrast, reduction should, when com- 
pliance is difficult, lead to concealment and, 
hence, continued ignorance of A’s behavioral 
standards. 

These two modes of influence not only 
have different implications for self-monitor- 
ing and the learning of A's behavioral stand- 
ards, but also for B’s acceptance of the 
standards. Where augmentation is used B 
should, in time, show a tendency to accept 
A's standards and to adopt them as his 
own. This would be expected on the grounds 
that as a consistent source of reward to B, 
A would take on positive value for B. Hence 
B will come to value what A values and to 
disvalue what A disvalues. Where reduction 
is used, however, one would expect that 
while public conformity might be elicited, B 
will not accept privately A’s standards—at 
least not to the same extent that he would 
if A were employing the mode of augmenta- 
tion. Thus, under reduction training, even 
though B might eventually learn what A's 
standards are (despite his concealment 
tendencies), he would be less likely to accept 
them. 

Let us consider further the question of 
how the differences predicted above will be 
affected by the difficulty B experiences in 
complying with the behavioral standard A 
regards as correct. Let us assume that, at 
first, B has a bias to reveal rather than con- 
ceal his responses. Such tendencies would 
be expected in the numerous everyday re- 
lationships which are affected by norms which 
encourage commitment and candidness. Per- 
son B will, then, tend to reveal his responses 
to A unless actively discouraged from doing 
so. The augmentation procedure will not in- 
crease this incidence of "showing," but the 
reduction technique should depress it. 
Furthermore, the extent to which B will 
be discotiraged from “showing” will depend 
on how often he makes mistakes and is 


punished. If he makes them infrequently (as 
when A’s standard or criterion—as we shall 
henceforth call it—is an "easy" one in the 
sense that B already tends to share it or 
for some other reason prefers to make “cor- 
rect” responses), B will rarely be punished 
and will tend to maintain his initial bias to 
“show.” The augmentation procedure will 
yield much positive feedback to B as com- 
pared to the relatively neutral reactions 
obtained under the reduction procedure, and 
while this may make for more favorable | 
attitudes toward A, B should be able to learn 
A’s criterion as readily in the one case as in 
the other. 

On the other hand, if A’s criterion is a 
difficult or implausible one for B and he 
makes many mistakes, he will receive a good 
deal of punishment under reduction in con- 
trast to fairly neutral treatment under 
augmentation. The former will encourage B's 
tendency to conceal and thereby produce a 
large difference between the two procedures | 
with respect to the degree of learning what 
A's criterion is. The punishment which 
derives from the conjunction of the reduction 
technique and the implausible criterion will 
also be expected to produce disliking of the 
trainer and rejection of his criterion, follow- 
ing the argument given earlier. 

In the present study we have evaluated 
the effects of augmentation versus reduction 
as they interact with a behavioral criterion 
of high versus low intuitive plausibility. The 
plausibility of the criterion influences the 
probability that B will select it of his own 
accord without A's intervention. We have 
also used an interaction situation in which 
it would be natural for B to reveal his 
responses to A, at least at first. According 10 
the reasoning outlined above, we would eX 
pect that the reduction procedure would havé — 
a particularly pronounced effect when it iS 
combined with the low plausibility criterion. 


METHOD 
Subjects 


Eighty males, all enrolled in introductory p- 
chology classes at the University of Minnesota 
were used in this experiment. 


Mopes or INFLUENCE 97 


Procedure 


Each experimental session involved one subject 
as a trainee and a confederate serving as the 
trainer? Each subject was greeted by the experi- 
menter (KR) who informed him that he was 
conducting a study of conceptions of mental ill- 
ness. The subject was asked to participate, he 
was told, in order that the experimenter would be 
able to teach more effectively about the subject 
in introductory psychology classes such as that in 
which the subject himself was enrolled. The ex- 
perimenter said that his assistant was actually 
carrying out most of the work on the project and 
that the subject would be interiewed by him (the 
confederate, of course). This person was repre- 
sented as being a graduate student in clinical psy- 
chology who was, naturally, interested in the prob- 
lem of mental illness himself. He was also de- 
scribed as someone who had pretty strong opinions 
on the subject of mental illness and who would not 
be hesitant to express them. This behavior was 
justified on the grounds that it “tends to stimulate 
persons to think more critically about the subject 
of mental illness than perhaps they’ve done in the 
past.” At this point, the subject was told that 
afterwards he would be rated by the interviewer 
on either one of two characteristics: half the sub- 
jects were told the rating would be based on “how 
likable you seemed to be, how cooperative and 
agreeable and so on” while the other half were 
told that they would be rated on “how frankly and 
honestly you express your own opinions.” Everyone 
was encouraged to get as high a rating as possible. 
This manipulation was intended to affect the sub- 
ject’s willingness to express his opinions and his 
responsiveness to the confederate’s praise or reproof. 
However, subsequent analysis showed that it did 
not have the desired effect; indeed, it seemed to 
exert no effect whatever. Consequently, we shall 
discuss the results of this experiment without 
considering it further. 

The subject was then taken to another room 
where he was introduced to the confederate. The 
latter explained the experimental task to the subject. 
He was asked simply to go through a series of 
cards on each of which were printed two state- 
ments describing symptoms of maladjustment, 
MMPI-style. For example, the statements on Card 
Number 5 read: 

1. “I become pretty depressed when I fail at 
Something important.” 

2. "I have nightmares every few nights,” 

For each pair of items on each of the cards, the 
Subject was asked to indicate which item indicated 
the greater degree of mental illness.” Where he 
could not decide or did not know, he was en- 
couraged to say so. (The trainer always responded 
to a “don’t know” response with the noncom- 
Bel comment, “Go on to the next card, please.”) 

ence it was not necessary for the subject to 
D ———— 


x * We are again indebted to John Hatton for 
erving in this capacity, 


^ 


commit himself on every pair of statements. There 
were 30 cards in all. 

As the subject proceeded through the series, 
making choices, the trainer responded in one of 
two ways. If a subject had been assigned to an 
augmentation condition, the trainer responded very 
generously when the subject's choice was “correct” 
(“That certainly shows a lot of insight, Mr. H 
most persons don't get that one" or "For a 
beginning psychology student, Mr. , you're 
certainly doing very well") Naturally, all these 
comments and others were made at a time when 
they would seem reasonable to the subject and 
would have a good effect. When the choice was 
"incorrect," the trainer would pass it off lightly 
(*Well, I guess I can't go along with you there" or 
“No, I’m inclined to choose the other one.”), the 
comment being made in a friendly, corrective fashion 
without a trace of irritation. If a subject had 
been assigned to a reduction condition, he would 
get token praise for being correct such as a simple 
nodding of the trainer's head or a passive "um 
hum." If, however, the subject did not agree with 
the trainer, he was made to know it ("We have 
been conducting this survey for 3 months and in 
all that time I think you're only the second person 
who made that choice" or “Well, I’m sure you 
must have a reason for that choice, Mr. 3 
but Pm darned if I can see what it could possibly 
be.) In such instances, the trainer would be 
sarcastic, critical, and—toward the end—take on 
an attitude of resigned exasperation. 

When the subject had finished going through the 
stack of (30) cards, he was taken back to the 
experimenter’s room by the confederate and the 
experimenter, presumably interrupting himself from 
his other work, asked the subject if he would be 
willing to do several further tasks. When he in- 
variably consented, the experimenter had him 
complete the following tasks in this order: 

1. Go through a series of 42 cards (30 were 
ones he had already seen before—although in a 
scrambled order—and 12 were new, but similar) 
and indicate which item of each pair the trainer 
himself would select as representing the greater 
degree of mental illness. 

2. Go through the original set of items—again 
in a new order—and indicate his own opinions (both 
Tasks 1 and 2 were done silently in writing). 

3. Fill out a post-experimental questionnaire. 
These chores completed, the subject was givem a 
full explanation of the experiment, and before he 
departed was requested not to talk about it with 
other students. 

The statements printed on the cards were con- 
structed in such a way that for any given pair, 
one could be characterized as an anxiety statement 
while the other was a depression statement, Previous 
research (Kelley & Ring, 1961) using these items 
has shown that the probability of a subject’s select- 
ing the depression item on any given cafd is sub- 
stantially greater than that of his Choosing. the 
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TABLE 1 
TESTS OF EFFECTIVENESS OF THE MANIPULATIONS 


Experimental condition 


Dependent variable 
Plaus- | Plaus- | Implaus- | Implaus- 
Aug Red Aug Red 


Agreement with trainer | 19.4 19.8 15.1 12.6 
during interaction 
Ratings of trainer's —1.8 | +0.7 -1.6 +0.8 


criticalness 


anxiety item (depression items received about 65% 
of the choices, anxiety items, 35%). 

In this experiment, the confederate regarded as 
correct the anxiety statements for half the subjects; 
for the other half, he deemed the depression items 
correct. Thus, in the first case, he was choosing 
items in accordance with a criterion of mental 
illness which had been viewed by previous subjects 
as relatively implausible. In the second case, his 
criterion was a relatively plausible one. 

Altogether, then, there were four conditions 
which may be labeled as follows, giving first the 
trainer’s criterion and then his method of influence: 
plausible-augmentation (Plaus-Aug),  plausible-re- 
duction (Plaus-Red), implausible-augmentation (Im- 
plaus-Aug), implausible-reduction (Implaus-Red). 
The N in each condition was 20. 


RESULTS i 
Effectiveness of the Manipulations 


Plausibility of the criteria. If what we 
have asserted about the relative plausibility 
of the criteria is true, we should expect to 
find that agreement with the depression 
criterion is significantly higher during the 
trainer-trainee interaction. The first row of 
Table 1 presents the relevant data giving 
the average number of agreements with the 
criterion for each condition.9 (On trials when 
a subject failed to express an opinion he was 
given half-credit for agreement.) By analysis 
of variance, the criterion difference .is sig- 
nificant at better than .001. The interaction 
term is not significant. (Two-tailed tests are 
used throughout this paper.) 

Augmentation-reduction. If our manipula- 
tion of this variable has been successful, we 
should expect to find that the subject per- 
ceives the augmentative trainer as more 
generous and the reductive trainer as more 


5 The difference between the Plaus-Red and 
Implaus-Red conditions in number of agreements 
is clear aiid unmistakable even by the end of the 
first 10 trials. 


critical. On the post-experimental question- f 
naire, we included items to elicit the sub. - 
ject’s perception of the trainer with respect 
to these characteristics. For each subject the 
difference between his ratings of the trainers 
criticism and generosity was determined. The 
averages of these differences are given in the 
second row of Table 1, a positive score in. | 
dicating that the trainer was judged relatively 
more critical than generous. The only sig- 
nificant effect is the augmentation-reduction 
difference which is significant at better than 
the .0001 level. Also relevant to this manipu- 
lation is the item on the post-experimental 
questionnaire which asked the subject to 
indicate how much he liked the trainer. The 
ratings made by the augmentation subjects 
were significantly higher (p < .001) on this 
dimension and there were no other signifi 
cant effects. 


Experimental Hypotheses 


Concealment of opinion. The frequency 
with which the subjects refused to express 
their opinions is indicated in the first row | 
of Table 2 which presents the mean number 
of trials on which this occurred for the sub- 
jects in each condition. Since there were 30 
trials altogether, it can be seen that the 
absolute level of failure to disclose opinions 
was very low—less than 10% —for both the 
plausible groups and the Implaus-Aug group. 
For the Implaus-Red group, however, the 
percentage is substantially higher—close to 
25%. An analysis of variance shows both à 
significant criterion and trainer orientation 
effect, but, as is apparent, this is largely due 
to the high mean of the Implaus-Red group. 


TABLE 2 
Tests OF THE EXPERIMENTAL HYPOTHESES 


Experimental condition 


Dependent variable |——————_———__—_ 


Plaus- | Plaus- | Implaus- | Implaus- 
Aug | Red | Aug ed 
Refusals to state 2.65 | 2.25 | 245 6.95 
opinion 
Knowledge of the 33.0 | 32.6 32.8 26.7 
trainer's choices 
Adoption of the +30 | +.23 | 427 | —.14 
trainer’s opinions 
Disagreement with 690 | 5.68 | 9.45 | 15.56 


opinions attributed to 
trainer 
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The interaction sum of the squares is, in fact, 
greater than that of either main effect and 
is significant at the .01 level. As anticipated, 
the method of influence makes the greatest 
difference when the trainer uses a criterion 
of low plausibility. 

Learning the trainer’s criterion. The reader 
will remember that after the “interview” 
had concluded, the subject was returned to 
the experimenter who asked him to state 
which item of each pair he thought the 
trainer would select. In some cases he was 
reporting on items about which the trainer 
might already have revealed his opinion; in 
other cases, he was “guessing” how the trainer 
would answer a “new” item. (The two sets 
of items give identical results and are, there- 
fore, combined.) The index of knowledge of 
the ‘trainer’s criterion (average number of 
correct statements about how the trainer 
would have made his choices) is presented 
for each condition in the second row of 
Table 2. Inspection of the table reveals that 
all groups but the Implaus-Red have learned 
pretty well (a perfect score would be 42). 
Unfortunately, an analysis of variance 
showed neither main effect to be significant 
(the criterion effect reaches a level of sig- 
nificance between .05 and .10), nor was the 
interaction significant (.10 < p < .20), con- 
trary to our predictions. Nevertheless, the 
trend of the data is as expected, even though 
the significance levels leave much to be 
desired. 

In support of our assumption that con- 
cealment acts to retard learning, there was 
obtained, over all conditions, a significant 
hegative correlation between frequency of 
concealment and the index of learning 
(r= — 33, p < .01). 

Acceptance of trainer’s criterion. Another 
of the post-experimental chores imposed on 
the subject after his interview was again to 
indicate which item of each pair on the 
Original 30 cards he regarded as representing 
the greater degree of mental illness.” When 
compared with the answers he gave during 

€ interaction, these data give us some idea 
of the degree of adoption of the trainer’s 
Opinions. An index of this was derived by 
computing the difference between the number 
Of times the subject was in agreement with 
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the trainer on this task and the number of 
times he had actually voiced agreement while 
being interviewed, and dividing this difference 
by the extent of possible change in the 
direction of the difference. For example, if 
a subject agreed on 20 items afterward, but 
on only 10 during his interview with the 
trainer, his adoption score would be Ora) 20 19 
= .50. A positive score indicates the subject 
is closer to the trainer in his opinions after 
the interview than he was during it; a nega- 
tive score means the reverse. 

The pattern of results yielded by the 
average adoption scores, shown in the third 
row of Table 2, is by now familiar. All groups 
except the Implaus-Red are influenced in a 
favorable direction. An analysis of variance 
results in a significant criterion effect (5 
< .025), a significant effect due to method 
of influence (p<.01), and a borderline 
interaction (p= .06, approximately). In 
summary, the pattern of results is very much 
like what one would anticipate from the data 
presented previously. 

Another analysis was performed which 
sought to get at the matter of acceptance 
from a different angle. It could be argued 
that the preceding analysis unfairly penalizes 
subjects in the Implaus-Red condition, yield- 
ing low adoption scores simply as a. reflec- 
tion of their lack of knowledge of the trainer's 
criterion and not as any indication of their 
tendency to reject it. To eliminate any pos- 
sible effect of amount of knowledge of the 
trainer's criterion, actual agreement with it 
was disregarded and instead each person was 
scored in terms of his agreement with the 
opinions he imputed to the trainer, whether 
correct or not. Accordingly, the subject's own 
post-experimental choices on the original 30 
items were compared with those attributed,.to 
the trainer and a count made of the number 
of instances of disagreement. The results are 
given in the last row of Table 2.* As can 


* For this analysis, the few subjects who con- 
cealed their opinions very frequently during the 
"interview" were eliminated because the rejection 
indices computed for them would have been highly 
unreliable, being based on few instances. One sub- 
ject had to be dropped from the Plaus-Ked group 
and four from the Implaus-Red group. It should 
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be seen, rejection is much greater when the 
criterion of low plausibility is being em- 
ployed, as one would expect. However, the 
greatest amount of rejection occurs in the 
Implaus-Red group (? < .01 for the com- 
parison with the Implaus-Aug group). The 
results of this analysis are therefore consistent 
with the preceding ones which show the 
powerful effect of the reduction procedure 
when coupled with an implausible criterion. 


Discussion 


This study is the second in a series in- 
vestigating the effects of certain training 
schedules on the performance of trainees. We 
wish, in this section, not only to consider the 
findings of the current study, but to relate 
them to those of the first experiment. In that 
experiment (Kelley & Ring, 1961), we com- 
pared “trusting” and “suspicious” training 
schedules in terms of their effects on the 
trainee’s tendency to reveal his responses and 
his learning to conform to the trainer’s be- 
havior standard. On each occasion during the 
training when the trainee made a choice be- 
tween the two response categories with 
respect to which the trainer was trying to 
teach him a particular preference, he also had 
the option of showing or hiding his choice. 
The two schedules differed in the actions the 
trainer took when he knew that the trainee 
had concealed his response (but, of course, 
did not know what the response was). A 
Suspicious trainer who assumed the trainee 
had made the incorrect choice when he hid 
was compared with a trusting one who under 


similar circumstances assumed the trainee, 


had made the correct response. The former 
procedure was found to be more effective in 
encouraging showing and produced a higher 
final rate of adherence to the trainer’s behav- 
ior, criterion. 

These results can probably be generalized 
to a number of real-life situations. Particu- 
larly important are their implications about 


also be noted that inasmuch as the subjects were 
forced to indicate what they thought the trainer’s 
opinions were, to the extent that the Implaus-Red 
subjects were less confident of their “guesses” than 
the subjects in other conditions, this method of 
analysis dües not adequately control for differences 
in lack of knowledge of the trainer's criterion. 
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the disadvantages of a trainer's providing 
uncontingently positive feedback (as in the 
trusting condition) when the trainee makes 
some response analogous to hiding. Some 
cases likely to involve such feedback would 
include the winsome child who says something 
disarming just before the parent's evaluative 
comment is forthcoming, the boss who in 
presenting his ideas to a subordinate un- 
wittingly emphasizes his status so that the 
subordinate tends to become a “yes man,” 
the disabled person who emphasizes his 
handicap at appropriate times in a way that 
makes it difficult to make any but favorable 
comments about his performance, and the 
insecure person who by his extreme selí- 
deprecation tends to extract a perfunctory 
compliment no matter what his performance, 
Under circumstances such as these, the 
trainer, (parent, subordinate, etc.) may be 
tempted to act in a manner similar to the 
trusting trainer, giving positive feedback 
regardless of the performance as long as it 
is accompanied by the appropriate auxiliary 
response (hiding in the experiment; the 
charming comment, allusion to status, etc, 
in the examples). The results of the first 
experiment suggest that if the trainer suc- 
cumbs to this temptation, the auxiliary re- 
sponse may be learned and this will operate 
to the detriment of the trainee's learning the 
true evaluations others place upon his be 
havior. 

Both the earlier study and the present one 
show that it is desirable for a trainer to con- 
sider whether his trainee has opportunities 
to make relevant choices while not under 
the former's surveillance. If so, he must take 
account of possible effects of his training 
schedule on encouraging the trainee to at 
tempt such concealment. The first study 
deals with the case where the trainer knows 
about all the occasions on which the trainee 
has made a relevant choice. Under these 
conditions the suspicious schedule, which 
assumes the worst of the trainee, is equivalent 
to punishing all instances of hiding and deal- 
ing with instances of showing in terms of 
their merits. This method is more effective, 4 
we have said, both in encouraging showing 
and learning of the trainer's criterion, tha? 


‘the trusting procedure which tends to 
ward hiding. 

he present study has application to a 
oader range of training situations be- 
use it involves schedules which permit the 
ainer to make no response or merely a 
committal one when he fails to detect a 
evant choice. Thus, he need not be aware 
of many of the instances on which the 
rainee confronts such choice situations. Our 
evidence suggests that as long as he uses an 
"augmentation technique, providing large re- 
wards for revealed compliance and being 
tolerant of revealed noncompliance, the 
nee will tend to maintan an adequate rate 
f showing his responses and will, then, 
isfactorily learn the trainer’s criterion. On 
the other hand, a reduction method with its 
mphasis on punishment for errors will dis- 
courage showing and interfere with learning 
‘the criterion. 

What are the implications of these two 
dies considered together? The first sug- 
ests the value of using punishment to dis- 
ourage hiding and the second, the value of 
reward to encourage showing. Perhaps the 
t overall strategy which would combine 
the advantages of the suspicious and augmen- 
tion schedules would be one that provides 
‘punishment for detected instances of con- 
cealment, is only very mildly punishing 
D merely to let the trainee know he is wrong) 
_ for revealed errors, and is strongly approving 
for revealed correct behaviors. When it is not 
lear that a relevant response has been made, 
eutral reaction from the trainer is prob- 
bly best. To attempt any other treatment is 
© tun the risk of providing a reinforcement 
chedule that affords no basis for learning 
either to show or to make the desired choice. 
erhaps the most important finding of 
Present study relates to the interaction 
tween the trainer’s influence method and 
the Plausibility of his criterion. Specifically 


le reduction mode of influence with the 
üplausible criterion to depress considerably 
level of revealing one's opinions, learning 
trainer’s criterion, and accepting the 
ainer’s criterion. We would explain this out- 
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trainer was made deliberately small by using 
an implausible criterion. Finding himself 
often in error and being rather harshly 
criticized for it, the subject comes to feel 
rather anxious and confused about having to 
make any kind of choice at all—his expected 
outcome is not very enticing. If he refuses 
to make a choice he can avoid the censure 
of the trainer. It is true that in so doing he 
passes up the opportunity for a small reward 
(in the form of mild praise), but the prob- 
ability of his receiving this small reward is 
substantially lower than that of his receiving 
condemnation from the trainer. So he begins 
to conceal his opinions (at least, relatively 
speaking, more than subjects in the other 
conditions do). This results, of course, in a 
noncommittal response from the trainer. 
Thus, to the extent the subject refuses to 
commit himself, he fails to find out anything 
from the trainer concerning his (the trainer’s) 
opinions. So the trainee’s learning is impaired. 
There are two reasons why the trainee does 
not adopt the trainer’s opinions. One obvi- 
ously derives from the fact that, because of 
his refusal to express his own opinions, the 
trainee simply does not know what the 
trainer’s opinions are. The second factor is 
the negative affect that is generated by the 
reduction technique. The trainee just does 
not care very much what the trainer's 
opinions are and when he does have some 
ideas about them, he has no motivation what- 
ever to accept them. It is quite possible, of 
course, that a lowered motivation to learn 
(as well as the anxiety associated with mak- 
ing a choice) might in part account for the 
low level of learning manifested by this 
group of subjects. 

The interaction between the trainer's 
method and his criterion which our results 
so clearly indicate has one possible implica- 
tion that is of considerable interest, especially 
as concerns practical applications of these 
results. Reduction is least effective, indeed 
most detrimental, when the trainer’s standard 
is such that the trainee makes many errors 
at the outset of their interaction. Consider 
now a real trainer—not one who is programed 
to follow consistently one method or another, 
but one who is free to change his &echnique 
when he feels like it. Is it not likely that 
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whatever method he starts with, a high rate 
of mistakes by the trainee will tend to frus- 
trate him and cause him to turn to reduction? 
The point is not that this will happen every 
time but simply that the spontaneous adop- 
tion of reduction as an influence technique 
may be most likely under those very con- 
ditions where our results show it to be least 
effective. 

A finding of some interest in this experi- 
ment is that relating to the performance of 
the Implaus-Aug subjects. This group learned 
the criterion of the trainer as well as any 
group and they used it (ie., accepted it) 
virtually to the same extent that subjects in 
the plausible conditions used that criterion. 
This was so despite the fact that the plausi- 
bility of the anxiety criterion was decidedly 
lower than that of the depression criterion. 
Apparently, after the augmentation mode of 
influence these subjects received when they 
used this criterion, they found that it was 
not so implausible after all. This result seems 
similar to that described by Kelman (1956 
unpublished ë) as following from the process 
of identification. This process: requires that 
the influencing agent be attractive to the 
influencee. Kelman presents evidence that the 
resulting influence is manifested only in situa- 
tions where the influencing agent is salient 


® Privately circulated monograph entitled, “Social 
Influence and Personal Belief: A Theoretical and 
Experimental Approach to the Study of Behavior 
Change.” 
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in the thinking of the influencee. The present 
evidence is entirely consistent with this 
view though we have no evidence that the 
influence exhibited by the Implaus-Aug 
group would appear only under conditions 
of “salience.” In general, the results of this 
experiment are consistent with the assertion 
by Thibaut and Kelley (1959) that the dis- 
tinction between augmentation and reduc- 
tion parallels two of Kelman’s influence 
processes, identification and compliance. The 
former is thought to yield conformity when- 
ever the relationship with the influencer is 
brought to mind, as is probably the case 
with our augmentation data. The latter is 
thought to yield conformity, if at all, only 
under the influencer’s surveillance. In the 
present case, the reductive influencer did not 
insist on compliance. He in fact permitted 
the influencee to avoid the consequences of 
his surveillance by not taking a stand. The 
main purpose of this study has been to 
demonstrate the consequences of the trainer's 
so doing. 
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THE ANALYSIS OF PROBLEMS AND PREDICTION 


OF GROUP PROBLEM SOLVING* 


JAMES H. DAVIS? anp FRANK RESTLE* 
Michigan State University 


The time to solve a multiple-stage problem is the sum of the times consumed by 
separate stages. If each stage is a random all-or-none process, then time to com- 
plete a stage is an exponentially distributed random variable, and time to com- 
plete the problem will have a gamma distribution (under suitable simplifying 
assumptions). From this theory, the number of stages in a problem can be 
estimated and goodness-of-fit tested. Results on 3 word puzzles, administered 
to 178 individual college students, agreed with theory, and the estimates of 
number of stages in each problem agreed well with independent judgments of 
number of stages. The same problems also were administered to groups of 4 Ss. 
The data suggest that all Ss progress at their usual pace toward solution, except 
that an S who made a mistake in interpreting the problem consumes and 
wastes his share of the group’s time. This result agrees with the additional ob- 
servation that the apparent social structure of the groups, as determined from 


analyses of sociometric choices, was equalitarian. 


The question of whether groups can solve 
problems better than individuals has, in recent 
years, evolved into the question of predicting 
the performance of small groups from known 
performance of comparable individuals work- 
ing separately on the same problems. The 
result of any experiment which compares 
groups and individuals depends upon the 
problems to be solved and the social organiza- 
tion of the groups. 

The data usually considered consist of the 
proportions of individual and group solutions 
within some fixed experimental time limit. 
The times to solution of individuals and groups 
can also be recorded, however. In previous 
discussions Davis (1961), Restle and Davis 
(1962), and Restle (1961) have put forward a 
mathematical theory which is intended to 
explain the distribution of solution times ob- 
tained for individuals, and to predict the 
distribution of solution times of groups of 
size r given the distribution for individuals. 
According to that theory, the distribution of 
solution times depends upon the number of 
independent stages (k) of the problem solving 


1 Developed from part of a doctoral dissertation 
submitted to Michigan State University by James H. 
Davis (1961). The second author directed the dis- 
Sertation problem. 

* During part of the work on this problem, Summer 
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Miami University (Ohio). 
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process as well as the probability-per-unit-time 
of passing a given stage (A). It was assumed 
that completing each stage is an all-or-nothing 
process and yields an exponential distribution 
of time. Considerations of simple waiting-time 
theory led to the hypothesis that solving a 
complex problem could be represented as 
passing all of k stages successively. Assuming 
that the stages are equally difficult, this leads 
to a gamma distribution; the density of 
solution of the whole problem at time t, given 
that a stage is passed with density à and there 
are k stages, is: 
£6) o np Toor E 
See Pearson (1922), Parzen (1960), and Bush 
and Mosteller (1955, pp. 316-320), This 
gamma distribution has mean ju = k/\ and 
variance c? = k/M. Using the obtained mean 
i and variance s? from data on a large number 
of subjects, one can estimate the number of 
stages in a problem, k, as: ? 
k= se [2] 
by the method of moments (Mood, 1950, p. 
161). Given a problem with an unknown 
number of stages, one can estimate the number 
of stages using Equation 2. One purpose of the 
present experiment is to test problems which 
are designed to involve different nwmbers of 
stages. If gamma distributions of individual 
problem solving time are obtained and È varies 
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with the intended number of stages in various 
problems, then this Distribution of Solution 
Times Model can be employed as a basis for a 
theory of group problem solving. 

A number of experimental studies have 
compared individuals with groups, in solving 
the same problem (Lorge, Fox, Davitz, & 
Brenner, 1958). A classic study (Shaw, 1932) 
showed that groups attained a higher propor- 
tion of solutions than individuals, and provided 
evidence for the hypothesis that group inter- 
action facilitates individual cognitive processes. 
Shaw's position was that groups could correct 
errors committed by individual members, and 
otherwise improve performance of individuals. 
Lorge et al. and Marquart (1955) have sug- 
gested that the group advantage results from a 
statistical pooling of the abilities or accomplish- 
ments of their members, and some experimental 
evidence seems to support this position (Faust, 
1959; Husband, 1940; Taylor & Faust, 1952; 
Watson, 1928). 

Lorge and Solomon (1955) formulated the 
statistical pooling theory in mathematical 
form. Their assumption is that the population 
of subjects can be divided into “solvers” and 
“nonsolvers” differing in ability. The propor- 
tion of solvers in the population is estimated 
directly from the proportion of solutions by 
individuals. The expected proportion of solu- 
tions by groups is given by the probability that 
a group of size r will contain at least one solver. 
If Pris the proportion of individuals who solve, 
then P, the proportion of groups of size r 
which will solve, is given by: 


P,-1— (1— Py). [3] 


This is the Lorge-Solomon “Model A." An 
alternative model supposes that the problem 
involves several stages or parts, and solution 
of the problem requires solution of all parts. 
“Model B” includes the possibility that a group 
may pool “‘part-solutions” and solve a problem 
even though no individual in the group could 
solve the problem alone, and permits group 
performance to be greatly superior to in- 
dividual performance. In the event that all 
parts are equally difficult and the abilities to 
solve the separate stages are distributed 
independently over members of the population, 
a simple Special case of Model B arises. In a 
problem with & stages, and a proportion Pr of 
individual solvers, the probability that a group 
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of size r will solve is: 
Ru r9). 


The Lorge-Solomon models do not take ex- 
plicit account of the time to solution, but it is 
possible to suppose that their models apply 
irrespective of the amount of time given the. 
group. If time to solution is recorded, the 
cumulative distribucion of solutions by time t 
can be taken as P;(t) for individuals, and P,(t) 
for groups of size r. It can thus be hypothesized 
that P,(t) will be related to Pr(f) by Equation 
3 or 4, independent of the time / considered, 
In this case, a distribution of group solution. 
times can be calculated from the distribution 
of individual solution times. 

Though the notion of “stages” of the prob- 
lem appears in the writers’ waiting-time 
analysis, and also in the Lorge-Solomon Model 
B, the two approaches cannot be integrated 
directly. The reason is that the Lorge-Solomon 
models refer to pooling of abilities, and imply 
that differences between subjects are due to 
individual differences in (fixed) ability. The 
strictest interpretation would say that if a 
given subject takes 10 minutes to solve a 
problem, he would again take just about 10 
minutes if he could be retested without 
memory. The waiting-time theory says that 
differences in time to solution arise through 
intrinsic variability ; solution of a stage is taken 
to be an all-or-nothing event which either 
occurs at time / or does not, and which can be 
characterized as having a fixed probability ^ 
of occurring given that it has not. As formu- 
lated above, the waiting-time theory has no 
individual differences. 

A pooling theory, analogous to the Lorge 
Solomon models, can be constructed on the 
basis of waiting-time, and has been called the 
“Combination of Contributions” Model 
(Restle & Davis, 1962). This theory supposes 
that any subject in the group may master à. 
stage, each subject independently having 
density A. In one variation of the Combination 
of Contributions Model, the Hierarchical 
Model, it is assumed that subjects who make 
wrong decisions or interpretations, and woul 
arrive at wrong answers to the problem, ate 
effectively excluded from the group discussion: 
Then if some number A of the r subjects in the 
group will solve correctly, each of the A has 
his original density of solution, A, and the group. 


a whole has density AX. In the Equalitarian 
del, a second variation of the Combination 
‘Contributions Model, it is assumed that non- 
Ivers enter fully into group discussion though 
ey do not contribute to success. They are 
sumed to consume their share of the group’s 
me, so that each of the A solvers has a 
ced density, (A/r)A. The group therefore 
as density A(A/r)A = A?d/r. 

oth the Hierarchical and the Equalitarian 
odels suppose that solving the problem re- 
“quires completing the same & stages required 
‘by an individual, and merely change the 
‘density of solution of stages. The density 
tion describing time to solve, in a group 
th A solvers of r subjects, is given by the 
erarchical Model as: 


T faults k) = 964) I 
and by the Equalitarian Model as: 
LO felti A,r, k) = g(t; (A%/r), k). E61 


assumed that “nonsolution,” i.e., making 
decision or interpretation which leads to a 
‘ong answer, is a random event. Hence, the 
J proportion of groups with exactly A solvers 
can be predicted from the assumption that 
ith probability 1 — a, a subject makes a non- 
3 "solving decision. Then (the probability of 
i exactly A solvers in a group of size r is: 


pay = (7) «a - à 


i le binomial distribution. Then it follows that 

the probability of success by time / for a col- 
Tection of r-man groups, when each person has 
probability a of success, is given by the Hier- 


[7] 


fu) = Y PAE; AN b) [8] 
nd by the Equalitarian Model as 
fa) = 25 P(A)g(t; (42/2). £9] 


ermits estimation of the three parameters 
essary to evaluate Equations 8 and 9. Some 
toportion of individuals arrive at a wrong 

wer; this proportion can be taken as an 
ate of 1— a, the probability of nonsolu- 
ion The remaining subjects, who solve, yield 
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a distribution of solution times which can be 
used to estimate k and A, using Equation 2 as 
an estimate of k and then the formula: 


À = ids? [10] 


derived from the method of moments. These 
estimates of a, k, and from individual data 
are sufficient to lead to a predicted distribu- 
tion of solution times, and a proportion of 
solvers, for groups of size r. The parameter 7, 
of course, is fixed by the experimenter, With 
this theoretical apparatus, a priori predictions 
of group performance can be computed and 
compared with the data obtained. 

This theoretical method is more efficient 
than merely counting the proportion of in- 
dividuals and groups who solve, because there 
is no difficulty deciding on the amount of time 
to give, and more of the data are used, since 
the mean and standard deviation of times to 
solution are added to the conventional pro- 
portion of solvers. 

The Hierarchical and Equalitarian Models 
give different pictures of the internal social 
organization of the problem solving group. 
The Hierarchical Model suggests that able and 
efficient subjects will dominate the group and 
less efficient members will be excluded—it 
corresponds to an intellectual hierarchy, The 
Equalitarian Model suggests, at the opposite 
extreme, that all group members contribute 
about the same amount of talk even though 
some are not contributing to solution. Inspec- 
tion of the solution-time data may permit a 
decision between these models, but it is also 
possible to collect sociometric choices and 
observational data which may add to the 
plausibility of one or the other model. 

The experiment reported below provides 
data on three questions: first, the Distribution 
of Solution Times Model is investigated by 
giving three problems, intended to have difer- 
ent numbers of stages (k), to a large number of 
individual subjects. The question is whether 
the gamma distribution yields a reasonable 
description of the data and whether the 
parameter k varies with what appears to be the 
number of stages in the problem. Second, a 
sizable number of groups of subjects are given 
the same problems and comparedgwith the 
individuals, to determine whether any of the 
pooling models (Lorge and Solomon or Restle 
and Davis) agrees with the obtained data. 
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Third, the results of the pooling experiment 
are compared with social-psychological anal- 
yses of the group process to see if these two 
very different sorts of data yield compati- 
ble pictures of the group interaction. 


METHOD 
Subjects 


The subjects were 267 volunteers from an introduc- 
tory psychology course at Michigan State University. 
When a group of four subjects of the same sex (Group 
condition) could not be formed the students present 
were tested as individuals (Individual condition). This 
method of assignment was not random, but the assign- 
ment of any particular subject who appeared on time 
at the laboratory depended solely on the behavior of 
other students who did or did not sign up, or who did 
or did not report on time. Subjects in the two experi- 
mental conditions were considered to be comparable 
samples of the population. There were 12 groups of men 
and 10 groups of women; 98 men and 81 women were 

* tested as individuals. 


Materials 


The experiment was conducted in a 40 foot X 22 foot 
seminar-type classroom, containing one table at which 
the subjects were seated and a table, 6-8 feet away, 
behind which the experimenter was seated during 
experimental sessions. A large-faced clock, reading, 
"Time Remaining, was on the experimenter’s table in 
view of the subjects. The experimenter had four cumu- 
lative frequency counters and an electric timer which 
emitted a soft click every 6 seconds. 

Each of four eureka-type problems, described below, 
was mimeographed on a separate sheet of paper. A 
sociometric questionnaire, developed for the Group 
condition, was as follows: 


Your answers to the following questions will be 
kept in the strictest confidence. Consider each 
question carefully before you answer. Your response 
should refer only to the immediately preceding problem! 
Problems differ a great deal. 

In a few weeks, new groups may be assembled for 
work on similar problems; participants would be paid 
and considerable prize money would be awarded to 
tke best-performing group. Which members of this 
group would you like to have with you, if you were 
to decide to participate in such a contest? Consider 
performance on the immediately preceding problem only! 
Indicate your choice(s) by a check mark(s) before 
the appropriate letter, You may decide to check none, 
one, two, or three of the other members of the group. 
Think carefully. 


— — Person C 
——Person D 


— —Person A 
—Person B 
A second question, requesting the group members 


to rank each other, was not used as data and is not 
given here, 
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Procedure 


Subjects, tested individually or in small groups, sat 
about a small table with identifying cards (reading 
A, B, C, D) standing in front of them. A practice 
problem, with a 5-minute time limit, was administered 
first in each session. This sample problem was: 


A drunk is walking down the sidewalk; his condi- 
tion is such that he is unable to proceed without 
stumbling and staggering. He decides that he wants 
to cross to the other side of the street. Unfortunately, 
he is so inebriated that he staggers back and forth, 
from one side of the street to the other—wasting a 
great deal of effort. Finally, he is able to stay on the 
side of the street he had been aiming for. He con- 
tinues his journey on that side of the street. 

Among the following, which is the best guess as to 
the number of times he crossed the street in either 
direction? (1) 4; (2) 5; (3) 2; (4) 8; (5) 6. 


Subjects’ understanding of the procedure was 
checked at the end of the sample problem. Following 
this introduction and any necessary discussion, the 
following three problems were presented, with a 12- 
minute time limit for each. Each of the six possible 
orders of these three problems was used with ap- 
proximately equal frequency, to counterbalance 
practice or fatigue effects. 


Rope problem (modified from Dudeney, 1958) 


My dungeon did not lie beneath the moat, but was 
in one of the most high parts of the castle. So stout 
was the door, and so well secured withal, that escape 
that way was not to be found. By hard work I did 
remove one of the bars from the narrow window, and 
was able to crush my body through the opening; but 
the distance to the courtyard below was so exceed 
ingly great that it was certain death to drop thereto 
I did find in the corner of the cell a rope that had been 
left there and lay hid in the darkness. But this rope 
had not length enough, and to drop in safety from 
the end was nowise possible. So I made haste t0 
divide the rope in half and to tie the two parts 
thereof together again. It was then full long and did 
reach the ground, and I went down in safety. How 
could this have been? 


Word Tangle problem (modified from Kaufman, 
1954) 


Here is a simple question. It happens to be couched 
in complex phraseology, but that shouldn't disturb 
you if you can find a way to reduce it to the funda- 
mentals. It can be done by rephrasing the question 
in such simple terms that the answer is immediately 
apparent. Now go ahead: “If the puzzle you solved 
before you solved this one, was harder than the 
puzzle you solved after you solved the puzzle you 
solved before you solved this one, was the puzzle you 
solved before you solved this one harder than this 
one?" Answer: Yes . No— Why? 

Gold Dust problem (adapted from the Water Jue 
problem) 


As a prospector you have been very successful. 
You have a container that holds exactly 163 ounces 
of gold dust. You must discharge a debt by payit 
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exactly 77 ounces of gold dust (no more no less) to 
a friend. You have containers that hold exactly 14 
ounces, 25 ounces, and 11 ounces. How would you 
remove the exact amount from the safe, using only 
these four containers? You must use all four con- 
tainers. 


The three problems were intended to involve different 
numbers of stages. The Rope problem requires only that 
the subject realize that the rope must have been divided 
lengthwise and not across. This single idea leads to the 
solution with little supplementary thought, hence, the 
problem was judged to be essentially a one-stage 
problem. The Word Tangle requires the subject to note 

- that the problem states that the previous problem was 
harder; and he must realize that the subsequent 
verbiage refers only to the present puzzle which is less 
difficult. There are two pairs of mutually canceling 
phrases to identify. Thus it seems that solving the 
problem should require from two to perhaps five stages. 
The Gold Dust problem requires several successive 
calculations or “pourings,” and since the answer 
requires either five or six manipulations (there are two 
methods used by the subjects) it was judged that this 
problem required from four to perhaps eight stages. 

After the experimenter had distributed a copy of the 
problem to each subject present, he retired to his ap- 
paratus table and gave the signal to begin. Subjects 
wrote out their answers and indicated when they were 
finished, on each problem. 

Group condition. Only one four-person group was 
tested in an experimental session. Following the pre- 
liminaries, a group was instructed that they were to 
receive some word puzzles or problems on which they 
were to work as rapidly as was consistent with good 
performance, talking freely among themselves, and 
"thinking out loud" as much as possible. The ex- 
perimenter stressed the necessity of an accurate record 
of solution time; hence, the group was not to hold on 
to an answer believed correct while the alloted time 
expired. Any member was allowed to record the 
answer and elapsed time. 

The problems were passed out separately, and fol- 
lowing each problem group members were given 
approximately 3 minutes in which to complete the 
Sociometric choice questionnaire. Every experimental 
session was closed with a request for secrecy concerning 
the problems and, as an incentive in this connection, 
the statement was made that the best performing group 
would receive a prize amounting to $2.00 per individual. 

In addition to solution-time and sociometric choice 
data, the frequency of intermember communication, 
Weighted for length of speech, was tabulated. The 
tecording was accomplished by means of four hand-tally 
Counters, one for each group member. The clock and 
timer shielded this activity from the subjects and no 
Subject gave any indication he was aware of being 
observed in this way. Each time a member addressed a 
communication to another member or to the group he 
Was tallied as having “talked.” If the same subject (or 
Subjects) was still speaking at the end of 6 seconds, 
indicated by the soft click of a timing device, another 
tally was registered on one (or more) of the counters. 
This yielded a systematic time sample of verbal activity 
Sufficient to indicate which subjects talked most often 
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TABLE 1 


INDIVIDUALS VERSUS GROUPS ON EACH 
EXPERIMENTAL PROBLEM 


Individual condition Group condition 


Problem 
Na Solv- | Proportion Solv- | Proportion 
ers | of solvers ers | of solvers 
Rope 178| 133 -735 |22| 21. |. 955% 
Word Tangle|177| 89| .503 |22| 17 | .773** 
Gold Dust |163| 71 436 |22| 17 | .773*** 


Note.—All statistical tests make use of normal curve theory 
where z = Pr — Pr/[P(1 — P) (Ne + ND /NeN1. 

a Relative frequency of correct answers for Individual con- 
dition was based on different numbers of subjects, because some 
few subjects*were discarded for failing to follow instructions, 
leaving early, etc, No systematic bias was noted. 

*p «05, significant difference between individuals and 


groups. 
D <.01, significant difference between individuals and 


groups. 
***5 < .001, significant difference between individuals and 
groups. 


and longest, prerequisite to the description of a com- 
munication structure. 

Individual condition. The one, two, or three subjects 
tested individually at an experimental session were 
required to work silently and alone. Otherwise, in- 
structions were as similar as practicable to those given 
to groups. Following each problem, the experimenter 
requested the subjects to write on a sheet of paper the 
names of three acquaintances they believed would do 
well on such problems. The listing of names was de- 
signed solely as a counterbalancing activity to the 
questionnaire completion of the Group condition; the 
names were not used as data. Each session was closed 
with the request for problem security and the announce- 
ment of a $2.00 prize for the best performing individual. 


Reading Time Sample 

Sixty-six students in a regular class meeting of 
elementary psychology at Miami University (Ohio) 
were given the same three problems. ‘The experimenter 
gave the same instructions as for the Individual condi- 
tion, except that the subjects were directed to record 
the time when they finished reading the problem, as 
well as when they had written an answer. On a sub- 
sequent class meeting, the same subjects were asked 
to judge the number of stages in the experimental 
problems. © 


RESULTS 


In direct comparison, group problem solving 
was found to be clearly superior to individual 
problem solving. As shown in Table 1, a higher 
proportion of groups than individuals solved 
each of the three problems correctly. Although 
more groups solve the problems, groups and 
individuals do not seem to differ systematically 
on mean time to a right or wrong answer; see 
Table 2. In fact standard ¢ tests applied to the 
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TABLE 2 


MEANS AND STANDARD DEVIATIONS OF SOLUTION 
Times IN SECONDS (reading times subtracted) 
FOR GROUPS AND INDIVIDUALS 


Individuals Groups 
Outcome Problem 
M SD M SD 
Correct Rope 131.3 | 115.5 | 94.0 | 124.7 
solution | Word Tangle| 264.8 | 154.1 | 269.0 | 216.2 
Gold Dust | 373.0 | 166.7 | 377.5 | 219.2 
Wrong or | Rope 186.4 | 151.5] (Only one 
No solu-| case) 
tion Word Tangle] 215.8 | 169.0 | 315.0 | 227.7 
Gold Dust | 420.6 | 346.2 | 444.1 | 217.2 
All Rope 145.3 | 127.4 | 109.6 | 142.0 
subjects | Word Tangle} 240.5 | 163.0 | 279.4 | 214.1 
Gold Dust | 399.9 | 282.6 | 392.6 | 215.0 


- differences between individual and group mean 
times on each problem were not significant. 


Distribution of Solution Times Model 


Hypothetically, the distribution of indi- 
vidual solution times is a gamma distribution. 
The parameter k, number of stages, and the 
rate parameter, A, were estimated by the 
method of moments, Equation 2 and Equation 
10. Only the distribution of solvers was used 
for this analysis; nonsolvers (those who failed 
to solve or gave a wrong answer) were excluded 
from this analysis, since a person who failed 
to solve the problem may have blundered on 
any one of the & stages. It is not possible to 
determine from the data the stage on which 
this might have occurred, Consequently, data 
composed of persons who, during the 12-minute 
experimental interval, failed different and un- 
known stages cannot be expected to provide an 
estimate of the number of stages in the 
problem. 

The effects of adding subjects who obtained 
a wrong answer may, perhaps, be best sum- 
marized by supposing that a subject either 
enters a process which leads to solution, or 
some other process which leads to failure. The 
solution times then are a gamma distribution, 
given that a correct solution and not an error 
is obtained. This conditional distribution might 
be expected when a subject misreads the 
problem initially and hence enters into one or 
the other of the problem solving processes. 
(The nature of eureka problems and the large 
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variances associated with solution time on all 
three problems suggest a “misreading model” 
is plausible here.*) 

The experimental time interval for any one 
problem has heretofore been treated as a 
uniform block of time. However, it is theo- 
retically possible to distinguish between the 
relatively invariant time required to read a 4 
problem and the highly variable time to solve 
it. The mean times for the Rope, Word Tangle, 
and Gold Dust problems, as recorded by the 
Reading Time Sample, were 62.7, 61.0, and 
41.9 seconds, respectively. Hereafter, "solution 
time" will refer to net time, with the mean 
reading time subtracted. 

Table 3 shows the estimated number of 
stages, Å, and the estimates of the rate of 
solution of each stage, À, for each problem, As 
might be expected, k; did not turn out to be an 
integer for all three problems. Theoretically, 
k should assume an integral value, but it is not 
surprising that at least one of the estimates, 
which are subject to error, does not happen to 
be an integer. 

The estimates of k were similar to the ' 
pre-experimental intuitive estimates, described 
earlier, in regard to magnitude as well as 
relative order. Mean judgments of number of 
stages from the Reading Time Sample were 
2.8, 4.0, and 6.3 for the Rope, Word Tangle, 
and Gold Dust problems, respectively. These 
mean judgments differ from the estimates 
using the gamma distribution theory by 1.5, 
1.0, and 1.3 stages, with the students' judg- 
ments being higher than the theoretical esti- 
mates, a very striking correlation. Observe that 
the theoretical estimates came from the 
distribution of solution times, which were en- 
tirely unknown to the judges; and the theo- 
retical estimates in no way used the judgments 
of the students. Hence, the two estimates were 


* See Restle and Davis (1962) for a detailed discussion 
of nonsolving subjects. 


TABLE 3 


ESTIMATES OF Å AND À FOR EACH PROBLEM, 
READING Time EXCLUDED 


Problem Ê = (1/52 R =i/sé 


.010 
011 
.013 


Rope 
Word Tangle 
Gold Dust 


nO 
oou 


Note.—Individual solvers. 
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CUMULATIVE RELATIVE FREQUENCY 


0 100 200 300 400 500 600 700 
TIME (SECONDS) 


Fic. 1. Cumulative relative frequency of individual solvers (solid points) and 
fitted gamma distributions with & = 1.3 for the Rope problem, & = 3.0 for the 
Word Tangle problem, and k = 5.0 for the Gold Dust problem. 


ROPE PROBLEM 


CUMULATIVE RELATIVE FREQUENCY 


o 100 200 300 400 500 600 700 
TIME (SECONDS) 


^ . 2a. Rope problem. 
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2b. Word Tangle problem. 
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GOLD DUST PROBLEM 


lS 


o 100 200 300 400 500 600 700 
TIME (SECONDS) 


2c. Gold Dust problem. 


Fic. 2. The solid and broken lines represent cumulative relative frequency, at 
50-second intervals, predicted by the Lorge-Solomon model (k=1, k=2, re- 
spectively) for groups of size 4. (Observed group performance, for the same intervals, 
is given by solid points.) 
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entirely independent and could have been 
completely unrelated. 

The sampling distribution of & (and À) is 
unknown under the estimation method used. 
To provide a check on the estimates, the 
theoretical curve, determined by a particular 
Êi, was fitted to the observed distribution of 
solutions for each problem. As can be seen in 
Figure 1, the obtained distributions are strik- 
ingly similar to gamma distributions as hy- 
pothesized. The theoretical curves, taken from 
Pearson’s (1922) tables of the incomplete 
gamma function, provided acceptable fits to 
the data for all three problems. Goodness-of-fit 
was tested by the Kolmogorov-Smirnov one- 
sample test. 


Prediction: Lorge-Solomon Model 


The Lorge and Solomon Model does not deal 
with the question of the time interval allowed. 
In assessing the predictive usefulness of their 
pooling model it was, therefore, necessary to 
apply it at various arbitrarily selected points in 
time. The sample estimate of the probability 
of an individual solution, P'r, was taken to be 
the cumulative relative frequency of solution 
by #= 100, 150,..., 700, 720 seconds. 
Applying Equation 3 and Equation 4 at each 
of these points, the probability of a "con- 
cocted” group solution, P', was obtained. 
Predictions thus generated are shown in Fig- 
ures 2a, 2b, and 2c for k = 1, 2. Since the 
Lorge-Solomon Model does not utilize the 
time variable, the question of reading time was 
irrelevant. 

Predictions of group performance are poor 
9n each problem, and the predictions become 
Worse with the large value of k; clearly the 
estimates of the number of stages from the 
Distribution of Solution Times Model are 
unsuitable here. Only for the Rope problem 
Would one regard as tenable the hypothesis 
that the observed distribution of group solu- 
lions was drawn from the theoretical distribu- 
tions represented by the Lorge-Solomon pre- 
dictions, (For the other two problems, 
observation was significantly different from 
Prediction, p <.01, as tested by the Kol- 
Mogorov-Smirnov one-sample goodness-of-fit 
test.) In the Rope problem the mull hypothesis 
Was tenable for both & = 1 and k = 2, but the 
Predictions were everywhere uncomfortably 


larger than the observed proportions of group 
Solutions, = 
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Inspection of predicted and observed curves 
(Figures 2a, 2b, and 2c) suggests that k must 
be taken less than one to provide good agree- 
ment between the two throughout the experi- 
mental interval. Any k < 1 is uninterpretable 
psychologically ; therefore, it is concluded that 
the Lorge-Solomon Model as applied here is 
unsuitable, 

In essence, the Lorge-Solomon Model pre- 
dicts the probability of a group solution to be 
the probability of solution by any one of four 
persons working separately. Groups in this 
study were found to fall below the statistical 
baseline established by pooling individual 
accomplishments, 


Prediction: Combination-of-Contributions Model 


The Hierarchical Model assumes that non- 
solvers are nonfunctional and are suppressed 
in hierarchical fashion by the more successful 
members. Using Equation 8, predictions were 
generated for each of the three problems. 
Figures 3a, 3b, and 3c show observed group 
solutions and predictions from the Hierarchical 
Model. Large discrepancies between observed 
and predicted probabilities are evident. Good- 
ness-of-fit was again tested by the Kolmogorov- 
Smirnov one-sample test; predictions were 
statistically tenable for the Rope problem, but 
could be rejected confidently for the other two 
problems (p < .01). These results indicate the 
Hierarchical Model to be unacceptable—at 
least with regard to the Word Tangle and 
Gold Dust problems. Predictions under the 
Hierarchical Model appear to be close to, and 
about as good as, those attained with the 
Lorge-Solomon Model A (k = 1). 

The Equalitarian Model assumes that non- 
solvers consume their share of the group's 
working time, though they do not contribute 
to the solution, i.e., one may expect equali- 
tarian not hierarchical interaction. Predictions 
from the Equalitarian Model were also com- 
pared with the data from problem solving 
groups, and appear in Figures 3a, 3b, and 3c. 
On all three problems the correspondence 
between prediction and observation is striking. 
Kolmogorov-Smirmov  goodness-of-üi tests 
provide no evidence against the hypothesis 
that the sample observations have been drawn 
from a theoretical population having the 
predicted distribution. 

The successful Equalitarian Model has a 
variety of implications about group structure. 
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3a. Rope problem. 
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3b. Word Tangle problem. 
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3c. Gold Dust problem. 


Fic. 3. The solid lines represent cumulative relative frequency predicted by the 
Hierarchical and Equalitarian Models for groups of size 4. (Observed group per- 


formance is given by solid points.) 


1. All members of the group contribute 
about equally to the discussion, whether they 
are helpful or not. 

2. The group does not form a hierarchical 
structure with a particular member or members 
taking over, but is equalitarian. 

3. The superiority of groups which solve 
over groups which fail to solve is explained 
merely on the basis of accidents of individual 
Successes, not on the basis of the evolution of 
Superior social organization. 


Emergent Social Structure 


The first observation supporting the Equali- 
tarian Model is that the experimenter was 
usually unable to tell who had and who had 
Not contributed to solution, despite close 
attention to the discussions. It seems reason- 
able that subjects in the groups, being busy 
With the problem itself, would have at least 
qual difficulties deciding who was useful in the 
discussion, hence, would not have the in- 


formation needed to set up a valid intellectual 
erarchy, 3 


'The companion-choice data support this 
interpretation. For each group, the number of 
choices of each member after each problem 
was counted, On the hypothesis that the four 
subjects had the same probability of being 
chosen (equalitarian choice structure) it 
follows that the true uncertainty of this dis- 
tribution for any group would be 2 bits. 
Estimates of uncertainty, Ĥ, from sample 
frequencies cannot be higher than 2 with only 
4 equiprobable categories, and will almost 
always be smaller than 2 for small numbers of 
observations. Rogers and Green (1955) give 
tables of the expectation of estimates of H, 
RW E(Ĥ), for given numbers of events 
(choices) and categories (number of subjects 
in a group), assuming uniform distributions. 


The mean uncertainty estimates, H, (averaged 
over groups separately for each problem) were 
very close to the mean expectations, EÊ), 
based on uniform distributions, as shown in 
Table 4 (upper half). The companion-choice 
data show no distinguishable tendency to 
concentrate on one subject per group, even if 
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TABLE 4 


MEAN UNCERTAINTY, Ê, AND STANDARD DEVIATIONS FOR GROUP COMPANION CHOICES 
AND COMMUNICATIONS ON ALL THREE PROBLEMS : 


Solvers Solvers and Nonsolvers 
Problem = - > 
n Ê sh E(A) n A sh EÂ) 
Choices 
Rope 21 1.6094 3505 1.5482 22 1.6178 3444 1.5487 — 
Word Tangle 17 1.5542 .3991 1.5720 22 1.5417 4042 1,5719 
Gold Dust 17 1.4403 4628 1.5457 22 1.4161 4633 1,5388 
Communication 

Rope 21 1.8085 .1601 1.9945* 22 1.8145 .1586 1.9946 
Word Tangle 17 1.7951 4755 1.9949* 22 1.8134 1774 1.9952 
Gold Dust 17 1.7848 1585 1,9956* 22 1.7714 1685 1.9957 


Note.—For both choice and communication data, tests were employed to test whether Ẹ was significantly different from the - 
hypothesized value, E(H), for solving groups on each problem. 


* Significance of deviation, obtained Êt from theoretical EÊ), assuming H = 2:5 < 001. 


each problem is considered separately. Since an 
estimate of H is calculated for each group, the 
distribution of estimates can be compared with 
the theoretical E(IT). None of the f tests per- 
formed, for groups which solved, showed 
significant discrepancies. 


The systematic time sample frequencies of 
talking were analyzed the same way. Since a 
large number of events were recorded in the 
12-minute sessions, relatively accurate esti- 
mates of relative frequency of talking were 
obtained. Different members of groups tended 
lo talk with different frequency ; as can be seen 
from the lower half of Table 4, the mean 
estimated uncertainties were near 1.8 whereas 
E(I1), assuming uniform distributions, would 
be over 1.99, and the discrepancy is highly 
significant. However, this result merely shows 
that talking was not equally distributed across 
group members, and does not demonstrate 
that a definite intellectual hierarchy was 
established. The obtained mean uncertainty 
estimate of about 1.8 is in fact quite high and 
indicates that the various subjects talked with 
nearly, though not exactly, equal frequency. 
Some individual differences in frequency and 
length of speeches would seem probable and 
might have no strong bearing on emergent 
group structure. 


DISCUSSION 


Distribution of Solution Times Model 


The mathematical model takes advantage of 
a great deal of information—the distribution 
of individual solution times as well as the 
proportion of right and wrong answers. The 
model had two empirical hurdles to surmount. 
First, it claimed that times to solution would 
have a gamma distribution. The obtained 
distributions were close to the fitted gamma 
and did not differ in any significant way: 
Second, it was possible to estimate k, tht 
number of equally difficult stages, for each of 
the three problems. The stage estimates agrè 
with the writers’ intention and intuition, am 
were similar to the judgments of naive subjects: 
Only further investigations of stages M 
problem solving will determine whether this 
correspondence is an accident. n 

While the concept of “number of stages 
k, has some value in beginning an analysis % 
problem solving, it should be emphasized that 
the number of stages estimated by the present 
method is only an approximation. One wout 
expect that in most problems: with sever 
stages the stages differ in difficulty and hav’ 
different values of A. Given any set of stag® 
and their À values, one can construct E 


| 
| 
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distribution of total times. However, any 
empirically determined distribution of total 
times can be closely approximated by various 
combinations of stages and A values. The 
present model must therefore be considered an 
idealized description of the problem solving 
‘process which is similar in some respects to the 
abstract description of a communication 
system achieved through the application of 
information theory. In this study, rather 
similar problems with a small number of 
stages were investigated and the idealized 
- description of the solution process seemed 
satisfactory. Further research is needed to 
determine when such a description is adequate 
for many-staged processes. 

Experimental separation of stages is needed 
- before conclusions can be drawn about the 
exact nature of a stage. The necessary data 
were not collected in the present experiment. 
Possibly, by the use of protocols, it would be 
possible to determine when the subject over- 
comes various stages of the problem, such as 
“understanding” the problem, hypothesizing, 
etc. Such logical or psychological analyses may 
lead to an understanding of the stages of 
problem solving, but each such analysis should 
be adopted with caution. The data of this 
study suggest that some problems involve 
compounds of “part problems,” and that each 
part (even though it may involve formulation 
of the problem, hypothesizing, etc.) is mastered 
In a single, all-or-nothing stage. 

The Distribution of Solution Times Model 
can, in principle, be applied to other than 
eureka problems. For example, a geometrical 
Proof might involve roughly as many stages 
as there are steps in the proof. A task yielding 
relatively invariant times (e.g. routine ma- 
nipulation of many blocks) would presumably 
have a large number of stages. In fact, such 
applications suggest additional means of in- 
Vestigating the stage concept, viz., the study 
of tasks which, prima. facie, differ widely in 
number of stages. 

The classification or ordering of problems 
(especially “nebulous” problems such as word 
Puzzles) is a promising potential application 
of the model. The number of stages to solution 
may be useful for distinguishing between other- 
Wise similar problems, since a particular 
Problem may be expected to yield a character- 
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istic distribution of solution times under 
standard conditions. However, the psycho- 
logical utility of such classification, attained 
by estimating k and fitting the function thus 
determined, remains to be demonstrated. 


Groups and Individuals 


In terms of the proportion of correct solu- 
tions, the collection of groups outperformed the 
collection of individuals, as has been almost 
universally true in earlier studies. Further- 
more, groups were less efficient than would be 
expected by pooling individual accomplish- 
ments. Problem solving groups in this study 
failed to reach the level predicted by the Lorge- 
Solomon Pooling Model, or by the Hier- 
archical Model which is the continuous time 
analog of the pooling model. The interpretation 
offered, which is embedded in the assumptions 
of the Equalitarian Model, is that group 
members who are on the wrong track consume 
as much group time as do members who are 
approaching solution. The simple pooling 
models pool the accomplishments but do not 
pool the disabilities and errors of the members 
in the group, ie. the Lorge-Solomon Model 
combines the wheat and leaves behind the 
chaff, The Equalitarian Model, which fits the 
data, pools both wheat and chaff. Before 
solution there is no mechanism within the 
group for singling out members progressing . 
toward a solution. In this sense, the Lorge- 
Solomon and Hierarchical Models both set an. 
unrealistic standard for groups. 

The data and Equalitarian Model indicate 
that in terms of man-hours, it is cheaper to 
have individuals solve problems separately 
than to use groups. Individuals in separate 
rooms will on the average, solve a problem 
sooner than the same four individuals in a 
face-to-face group. Using only this simple 
criterion, one might leap to the conclusion that 
groups are inefficient problem solvers in 
general. However, the Equalitarian differs 
from the Hierarchical Model only when the 
probability of solution is less than one. If all 
subjects would solve the problem, there would 
be no difference between the models. What is 
more, in that case a group of size r would haye 
a rate of r^, if individuals have a raté'of d, It 
is only the probability of an individual failing 
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to solve the problem, which slows down the 
group average. 

Though individuals will, on the average, 
obtain a correct answer before a group, they 
may also obtain some wrong answers. Working 
separately, four individuals would not separate 
the correct from the wrong answers, and as a 
result the first individual solution (which on 
the average will be before the group solution) 
cannot be accepted with any confidence. The 
group solution has a probability 1 — (1 — a)’ 
of being correct, when a is the probability that 
an individual is correct and the group is of 
size r. The first individual solution has only 
probability a of being correct. In this sense the 
group is slower, but surer. 

Obviously, the present model makes sense 
only if the problem is one of searching for a 
unique correct answer. In a “human relations” 
problem (Lorge et al., 1958) the task may be to 
assemble a large number of ideas none of which 
is the "answer" In such a task, mutual 
stimulation may be an important advantage 
whereas the drawback of groups (time wasting) 
may not be found if virtually all ideas are 
accepted as valuable. 

Groups may have “real-life?” advantages 
which are not reflected in the experimental 
conditions studied here. For example, groups 
may persevere longer than individuals, thus 
enhancing the probability of a solution; group 
work may be conducive to higher or more 
appropriate motivation than work alone, etc. 
The social benefits of group interaction are too 
obvious to be mentioned here. The preceding 
positive factors, which are important but 
tangental to the actual problem solving process, 
are probably more characteristic of stable than 
of the ad hoc groups discussed in this study. 
Certainly there is no reason to believe that the 
simple models employed here would apply to 
very different situations. 

However, within the context of this study, 
any assertion that intermember interaction 
results in a “super pooling” effect or the 
assertion that group problem solving derives 
from a simple pooling effect are both seriously 
in question. 

Social psychological data in this study agree 
quite well with the supposition that groups 
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were organized in an equalitarian structure, 
The success of the Equalitarian Model suggests 
that members who are on the wrong track con. 
tribute their share to the discussions. The data 
on choice of future partners failed to reveal 
any noticeable trace of stable leadership 
within groups. 
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Schizophrenic patients leaving a VA hospital for independent life in the 
community were followed up for a year after departure. They had been 
rated on ward and at activities, had been interviewed, and had been extensively 
tested. By 1 year 33 of 78 had returned, but there was little difference in the 
predeparture measures or in a 1-month home visit between these and those 
who stayed out. For those staying out, 35 of 111 predictors correlated sig- 
nificantly with at least 1 of 2 year-end adjustment criteria. Social assets 
predicted no better positively (and may have predicted worse) than psy- 
chopathology predicted negatively. Demographic variables, such as time since 
1st mental hospitalization, predicted at least as well as tests; and factorially 
pure test scales predicted no better than empirical scales. 


Accurate foresight could solve many of the 
problems faced by those who deal with mental 
disturbance. Yet unpredictability is one of 
the most salient characteristics of that dis- 
turbance. Just as there are many ways of 
spoiling an omelet, there are many ways in 
which adjustment can be disturbed. Less 
remarkable, perhaps, than the occurrence of 
mental disturbance is the fact that good 
adjustment itself can be maintained. This is 
particularly true of the precarious adjustment 
of the ex-patient from the mental hospital. 

The patient leaves the hospital full of hopes 
and forebodings. He carries with him the 
blessings of the staff, who stand behind him, 
their fingers gingerly crossed. Both patient 
and staff wish they could be sure of how well 
he will be doing in the coming year or two. 
But prediction is difficult; we make an in- 
formed guess and hope for the best. 

The prognosis of schizophrenic patients has 
been amply studied in the hospital. One 
bibliography (New York State Department of 
Mental Health, 1959) lists close to 850 titles. 
Chase and Silverman (1941) evaluate some 
of the older researches. Almost all these stud- 
les address themselves to questions concern- 
Mg prognosis viewed from the time of admis- 
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sion: How long will the patient have to stay 
in the hospital? What are the possibilities 
of his improvement under different forms of 
hospital treatment? 

There have been fewer studies of the pa- 
tient's course once he has left the hospital, 
and these studies have been marred by several 
deficiencies inherent in the investigation of 
an elusive long-term dysfunction. In some 
there are simply not enough cases in the 
sample upon which to base conclusions. Some 
are simply statistical compendia. In many 
the criterion of recovery or improvement in 
schizophrenia depends upon records, cor- 
respondence, or responses to mail question- 
naires. It is the unusual study which brings 
the patient whose adjustment is being judged 
face to face with a professionally trained 
rater. 

Even when the criterion of outcome is 
adequate, some of the published studies have 
difficulties in getting more than retrospective 
measures of the prognostic factors. Thus, in 
a study by Schofield, Hathaway, Hastings, 
and Bell (1954), the predictors were derived 
solely from ratings of case history material. 
The ratings were reliable and uncontaminated 
by knowledge of outcome, but their scope 
was limited by the nature of the case history 
material. Thus, it is not surprising that the 
authors found that only 17 of their 200 rated 
items differentiated at the 5% level between 
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patients who subsequently made a “good” 
adjustment and those who made a “poor” 
one. Although the Pittsburg studies (Bayard 
& Pascal, 1954; Cole, Swensen, & Pascal, 
1954; Feldman, Pascal, & Swensen, 1954; 
Pascal, Swensen, Feldman, Cole, & Bayard, 
1953; Swensen & Pascal, 1954a, 1954b) 
show a higher percentage of “hits,” their 
predictive data, too, were gathered retro- 
spectively from case files. Several of the 
other published follow-up studies show this 
same deficiency in predictor variables. A 
number are deficient both in gathering predic- 
tive data before the fact of discharge and in 
providing an adequate measure of outcome. 

Ideally a longitudinal study should in- 
clude a sufficient number of subjects on which 
to base conclusions and should combine meas- 
urements taken with the specific aim of pre- 
dicting posthospital adjustment with ratings 
of that adjustment by a trained person having 
access to interview and other relevant data. 
Two studies which satisfactorily do this are 
concerned with special aspects of the problem. 
Ullmann, Berkman, and Hamister (1958) 
are specifically concerned with the problem of 
prediction of behavior when the ex-patient is 
placed in a private home. Wirt and Simon’s 
(1959) study compares follow-up outcomes of 
different chemotherapies and other forms of 
treatment in the hospital. 

It seems generally agreed that outcome in 
follow-up studies is best estimated through 
personal contact with the patient and with 
those about him. There is less agreement 
about which predictor measures are likely to 
be useful in making an informed guess about 
prognosis. Ellsworth and Clayton (1959) 
suggest that we have been too much bemused 
by psychopathology. In judging patients we 
should look for their possible assets. In their 
view we are much more likely to obtain suc- 
cessful predictions from estimates of social 
adequacy; and they have, in fact, produced 
a scale, the MACC, to estimate this social 
adequacy by predicting from assets rather 
than from liabilities. They found that nurse 
and aide ratings of behavioral adjustment on 
the MACC predicted both discharge from the 
hospital and adjustment at 3 months following 
dischargé better than did psychiatrists’ ratings 
of psychopathology. They suggest that per- 
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sons whom we characteristically see as grossly : 
abnormal can, given adequate social skills 
make quite satisfactory adjustments. It is 
these skills, rather than psychopathology, 
which are likely to be predictive. 
Pishkin and Bradshaw (1960) suggest 
that a weighted index, derived from demo- 
graphic and case history data, can successfully 
predict which patients will return to the hos- 
pital within a year. For them success in 
readaptation to community life is a function 
of group identifications (such as being mar- 
ried or single) and of hospital experiences 
(such as being a member of a therapy group). 
This is in contradiction to Stotsky (1952), 
who, approaching prognosis in schizophrenia 
by contrasting patients who were soon dis- 
charged with patients whose hospitalization 
was prolonged, found that demographic vari- 
ables showed little differences while the 
Rorschach and Wechsler revealed a number of 
contrasts between the groups. 
Still another set of predictor variables may 
be taken into account. These lie not in the 
patient himself but rather in the situation to 
which he returns. The specific environment 
and the stresses he meets there may determine 
whether he is to remain out of the hospital or 
must return. Freeman and Simmons (1958) 
have collected data which they interpret as 
showing patients returning to a home whett 
they occupy the role of son need deploy 4 
much smaller range of social skills than those 
going to homes where they must function 4 
husbands. For this reason, husbands are 
more likely to return to the hospital than are 
sons of similar levels of social competency: 
They would suggest that it is the interaction 
between the patient's social skills and fhe 
stress or support of the environment which 
determines his final success. f 
The kinds of predictor variables which 
most effectively predict outcome are of theo- 
retical interest since they may shed light 0? 
the nature of schizophrenia. If psychopath 
ology is most predictive, then schizophrenia 
might be comparable to an illness such 3 
liver disease where the prognosis is directly 
related to the amount of tissue damage—th? 
malignancy of the symptoms. If social a 
justment is most predictive, then we mi 
compare the disturbance to a complex of ba 
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habits. It would be like the poor study skills 
and habits of life which prevent some college 
students of adequate ability from receiving 
passing grades. If demographic indices are 
most predictive, then schizophrenia might be 
akin to some symptoms of societal dysfunc- 
tion. We would compare it then to suicide or 
to juvenile delinquency. Finally, if it is the 
role which the ex-patient is called upon to 
- play and the setting in which he is called 
upon to play it which best foretell his suc- 
cess, then schizophrenia would resemble some 
other cases of faulty fit of person and role. 
It would be comparable to some cases of 
marital discord or of vocational maladjust- 
ment. 

The present study attempted systematically 
to evaluate prognosis for schizophrenics leav- 
ing the hospital. Several criteria of outcome 
were used and measures were taken pro- 
spectively rather than retrospectively. Pa- 
- tients leaving the hospital in the 13 months 
following June 1, 1957, were measured in a 
number of ways before their departure, and 
then they were followed in the community for 
a full year. Indications of adjustment were 
obtained by a trained psychiatric social 
Worker through interviews with the patient 
himself and with the family or others with 
whom he was living. The predictive measures 
included measures of psychopathology, of 
Social assets, and of demographic variables. 


1 METHOD 
Subjects 


The subjects of this study were schizophrenic 
male veterans who were separated from inpatient 
status at American Lake Veterans Hospital between 
June 1, 1957, and July 1, 1958. Initially all had gone 
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to residences in western Washington. All had con- 
sented to be interviewed at regular intervals outside 
the hospital during their first posthospital year. 

Voluntary participation was not an important 
limiting factor, since only one person of all those 
who might otherwise have been included refused 
to participate. Two others who might have been 
eligible indicated that, although they were going 
to addresses in western Washington, they were not 
going to reside there very long. Even assuming 
that these persons may have avoided participation 
in the study through a convenient fiction rather 
than by outright refusal, the total loss through 
refusals and avoidance was less than 3% of all 
persons who met the other selective criteria. 

The total sample consisted of 115 veterans. Of 
these, 37 were placed through the hospital’s foster 
home care program, The results on this home care 
group have not been included on the present report. 

Of the 78 who were not placed in foster home 
care, 1 was lost prior to the initial follow-up inter- 
view; no data regarding his posthospital adjust- 
ment were available, except that it was ascertained 
from his veteran’s claim file that he had not entered 
another hospital during the year following his dis- 
charge. Six others had moved and were not available 
for year-end interviews. In each of these instances 
partial information was obtained through cor- 
respondence with Veterans Administration offices 
or other hospitals. In all instances it was ascer- 
tained whether the veteran had been hospitalized 
for mental illness within 1 year of departure from 
American Lake. Thus, of the total sample of 115, 37 
went to home care, and of the 78 independents, 7 
had incomplete data. 

Nearly two-thirds of the patients in the study 
were in their 30s and 40s. Less than 15% were 
over 60. Only one was under 20 at the time of his 
departure from the hospital. Nearly three-fifths 
were single. Slightly more were divorced than were 
currently married, excluding four who were separated 
and in process of divorce. All had been diagnosed 
as schizophrenic following a complete psychiatric 
evaluation. Nearly half were described as paranoid, 
and approximately one-fourth were classified as 
chronic, undifferentiated (see Table 1). 


TABLE 1 
Numpers or Patients RECEIVING VARIOUS DIAGNOSES 
Independent Home care 
Praia a 
Total Out Returned | Total Out Returned 

Simple. 9 2 1 1 7 2 5 
Hebephrenic 8 2 0 2 6 6 0 
atatonic 6 4 3 1 2 1 1 
aranoid 54 43 26 17 11 11 0 
Acute undifferentiated 4 2 2 0 2 1 1 
ronic undifferentiated 30. 21 9 12 9 8 i 1 
hizo-Affective 4 4 4 0 0 0 0 
Total 115 78 45 3 37 29 8 
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TABLE 2 
NUMBER OF PATIENTS By YEARS SINCE First HOSPITAL ADMISSION 
Independent Home care 
Years oer 
Total Out Returned Total Out Returned 

Less than 1 8 8 6 2 0 0 0 
1-2 yi 7 1 6 0 0 0 
2-5 18 18 7 11 0 0 0 
5-10 21 16 11 5 5 4 1 
10-20 40 25 17 8 15 11 4 
Over 20 21 4 3 1 17 14 3 

Total 115 78 45 33 37 29 8 


The first hospitalization for mental illness had 
occurred more than 10 years before this present 
release for more than half the subjects. Only 10% 
had first been hospitalized less than 2 years prior 
to this departure. Between 15 and 20% had entered 
upon their first hospitalization more than 20 years 
ago (see Table 2). 

Chronicity tends to be characteristic of the 
schizophrenic population in Veterans Administration 
hospitals. Only those who have rendered military 
service in time of war are ordinarily eligible for 
admission. The majority of eligibles and potential 
eligibles are veterans of World War II, which ended 
13 years before this research began. The next largest 
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NUMBER OF PATIENTS BY NUMBER OF HOSPITALIZATIONS FOR MENTAL ILLNESS* 


group of eligibles, the veterans of World War I, 
gained their potential eligibility about 40 years ago. 

All except four of these persons had been hos- | 
pitalized more than once. Approximately 40% had 
been hospitalized more than three times (see 
Table 3). 

More than one-third had remained in the hos. 
pital less than 6 months on their current admissions, 
Less than one-fifth had remained more than 10 
years. Including those who stayed less than 6 months, 
51% of the total sample had remained in the hos 
pital less than one year (see Table 4). i 

Nearly 60% were considered incompetent to man- 1 
age their own finances at the time of separation i 


Independent Home care 
Number aoe 
Total Out Returned Total Out Returned 

1 4 2 1 1 2 2 0 
2 33 21 9 12 12 10 2 
3 31 21 14 7 10 9 1 
More than 3 47 34 21 13 13 8 5 

Total 115 78 45 33 37 29 8 


f In cases where patient was transferred from a state, military, or Veterans Administration hospital to another hospital fof 
continuing care, both hospital admissions were counted. Where admission was temporary, such as for commitment and trans! 
toan appropriate hospital, the temporary hospitalization was not counted. 


TABLE 4 
: NUMBER or PATIENTS BY LENGTH or PRESENT HOSPITALIZATION 


Independent Home care 
Length Total 
i sample 
Total Out Returned Total Out Returned 
Less than 6 months 41 40 26 14 1 1 9 
6 months-1 year 17 15 T 8 2 1 1 
1-2 years 12 12 5 7 0 0 0 
2-5 years 11 8 5 3 3 3 0 
5-10 years 13 1 1 0 12 8 HE. 
More ?han 10 years 21 2 1 il 19 16 3 
Total 115 78 45 33 37 29 8 
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! from the hospital. Nearly two-thirds had entered 
the hospital under legal commitment, and approxi- 
mately the same proportion were veterans whose 
jnitial and subsequent illnesses were “service con- 
nected,” for which they receive compensation from 
the Veterans Administration. Seventy percent left 
the hospital on trial visit status, while all except 
one of the others were discharged as having re- 
ceived maximum benefit from hospital treatment. 
One man had left the hospital against medical advice. 


Measures 


The measures which were taken can be divided 
into the predictors and the criteria. The predictors 
were selected so as to give measures both of 
psychopathology and of social functioning, and they 
also included demographic indices (background vari- 
ables such as commitment status). Test measures 
were included as well as rating and interview 
measures. In selecting the test scores which were 
to be used, scales derived from factor analytic 
studies were contrasted both with scales derived 
from empirical criterion correlations and with a 
priori scales, 


Predictor Measure 


Tests. The tests used included the MMPI, the 
IPAT 16 PF Test, a specially devised Sentence 
Pace Test, and the Rotter Level of Aspiration 
est. 

On the MMPI all the standard validity and K 
corrected clinical scales were used. In addition the 
"subtle" and "obvious" scales derived for D, Hy, 
Pd, Pa, and Ma (Wiener, 1948) were used as well 
as the following criterion derived scales: Si, Cn 
(Cuadra, 1956), Lb (Hanvik, 1951), ECs (an un- 
published scale to measure ego control developed by 
Jack Block), and Es (Barron, 1953). The a priori 
Scales used were Social Desirability (SD) (Edwards, 
1957), the Manifest Anxiety (MA) scale (Taylor, 
1953), the Dependency (Dy) scale (Navran, 1954), 
TE, a measure of internalized reaction to stress 
(Rozynko, 1959), the Manifest Hostility (MH) 
Seale (Siegel, 1956), and AFR, a scale of atypical 
alse responses, ie. all the false responses to items 
which the majority of the normative population 
answered “True.” In choosing these nonclinical 
Scales it was desired to use as many as possible 
Which would be relatively independent of the 
pal scales. Lb, ECs, IEs, and AFR all showed 

I5 comparative independence. Four factor scales 
E Used: three, Mı, M», and Ms, derived from a 
ED factor analysis of MMPI items (Mees, 1959) 
hs the Welsh (1956) R scale which measures much 

© Same factor, apparently as does Ms. Finally, 

ose items of the Welsh R scale which were 
Scored in the X or infrequent direction were com- 
ea to make an R-X scale. Thus, in all, there 
koe 40 different scores derived for each subject 
in. his MMPI performance. 
E. e IPAT 16 PF test Form C was included be- 
* it provides a number of factor scales. It 
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should be noted that this is the shorter, less reliable 
form of the test. The burden of tests which were 
to be given to each subject gave a premium to 
brevity. All 16 scales were used, and the raw 
scores were converted to stanines. 

The Sentence Completion Test had responses 
scored in terms of a “Coping” scale which has been 
described more fully elsewhere (Mainord, 1956). 
Completions of any of the 40 stems which acknowl- 
edge the emotional demand character of the stem 
are given a score of two. Completions which ignore 
this emotional demand are scored zero, while com- 
pletions which are intermediate in this respect are 
scored one. Thus, to the stem “I cannot control 
myself when . . .” the response “I’m mad" would 
receive a score of two, the response “I’m asleep” 
would receive a score of zero, while the response 
“Im confused” would receive a score of one. 
Reliabilities of coping scores independently arrived 
at by different raters have ranged from .68 to .90 
with a median around .80. 

The Rotter Board has been described elsewhere 
(Rotter, 1942, 1945). Nine scores were derived 
from this test performance: 

1. The number of times the subject reached his 
bid (his estimate of what score he was likely to 
make). 

2. The mean bid. 

3. The mean weighted score (computed by giving 
credit only for the amount bid when the subject 
reaches or exceeds his bid and by penalizing over- 
estimates by subtracting the difference from the 
amount actually made). 

4. The mean difference between actual performance 
on one trial and the bid made preceding the next 
trial. 

5. The number of shifts in bidding. 

6. The number of typical shifts (upward after suc- 
cess and downward after failure). 

7. The number of unusual shifts (downward after 
success and upward after failure). 

8. The number of different levels of bids. 

9, The mean discrepancy between bids and per- 
formance. 

Interview ratings. Each subject was interviewed 
in the week immediately preceding his departure from. 
the hospital Two psychologists (JCS and JM) 
served as interviewers, From these interviews ratings 
were made on the Symptom Rating Sheet which 
has been reported elsewhere (Jenkins, Stauffacher, 
& Hester, 1959). Ratings were made of 20 dif- 
ferent symptoms (eg, anxiety, depression, orienta- 
tion) on the basis of the patient's behavior in the 
semistructured interview. 

Behavior ratings. To assess the subject's level 
of behavioral adjustment, the MACC scale (Ells- 
worth & Clayton, 1959) was completed by observers 
who had an opportunity to evaluate his adjustment 
in two situations. His behavior in the ward living 
situation was rated by a psychiatric aide, while his 
behavior at his detail or in occupational therapy was 
rated by the appropriate therapist. In addition to 
the four standard MACC scales, Motility, Affect, 


122 


Cooperation, and Communication, four items were 
added to measure Sociability. These included such 
items as “Do other patients seek him for companion- 
ship?” and “Does he reply when others talk to 
him?” A total adjustment score was computed by 
summing scores for Affect, Cooperation, and Com- 
munication. Thus, there were, for each subject, six 
behavior scores derived from each of two raters, 
and these scores measured social skills rather than 
psychopathology. 

A preliminary study on 30 patients rated on the 
ward on two occasions a week apart by the same 
aide showed that the Motility variable was un- 

- reliable (probably because of limited range) but the 
others showed adequate but not spectacular reli- 
ability. The r's between the two occasions were 
Motility, .35; Affect, .68; Cooperation, .79; Com- 
munication, .78; and Sociability, .79. There was rela- 
tively little communality of the ward and activity 
therapist ratings (r for Total MACC for 57 of our 
study patients was only .28 between the two situa- 
tions). This may have occurred because somewhat 
different types of behavior are required in work 
and living situations. Because of this relative in- 
dependence of the two behavioral ratings, they were 
obtained from both sources. 

Demographic data. From the hospital clinical 
records it was possible to extract 13 demographic 
Variables such as marital status, number of depend- 
ents, age, length of hospitalization, diagnosis, and so 
forth. (Many of these have been discussed in the 
Section under Subjects. Since several of these were 
classificatory variables rather than scalar ones, sub- 
jects were assigned to a category rather than given 
a score. 

Field observations. A psychiatric social worker 
(CL) visited the patient at his residence at one 
month following departure from the hospital. In 
these visits both the patient and those he was living 
with were interviewed. An attempt was made to 
conduct these two interviews separately. The raw 
data from the interview were entered on a work 
sheet. From the interviews themselves the visiting 
Social worker made 20 ratings along five-point scales. 
There were 7 different ratings of his present adjust- 
ment, 1 in his work, 1 at school (where this was 
applicable), 1 in the area of money management, 1 
in his family relations, 1 in his interpersonal relations 
outside the family, 1 in his recreation, and 1 which 
was the average adjustment observed in all these 
areas. The trend, for better or for worse, in his 
adjustment was rated in the same 7 areas. Finally 
there were 6 ratings of the amount of support or 
lack of stress being afforded the patient in five 
areas: work, school, family, outside relations, “other,” 
and an average over all these areas, These scales were 
a revision and extension of those constructed by 
Jenkins and his group for the Veterans Administra- 
tion Psychiatric Evaluation Project? 


2 Copies of the work sheet which were filled out 
on these visits, of the rating sheet itself, and of the 
procedure used in making the ratings have been de- 


J. Margs, J. C. SravrrAcHER, AND C. LYLE 


The field worker filled out a modified MACC 
scale on the patient following each visit. Six scores 
were available from this: Motility, Affect, Co. 
operation, Communication, Sociability, and Total 
Adjustment. . 

At this time a number of follow-up observations 
were also made of factors which might be thought | 
to influence the subject’s eventual success in inde 
pendent life. The social worker determined whether 
he was continuing his outpatient psychotherapy at 
least on a once-weekly basis. In addition he recorded 
whether the patient was working, whether this 
work was enough to support him, whether the 
patient was going to school, and whether or not he 
was having difficulties with alcohol or with the 
police. Thus, from the follow-up visits there were 
20 ratings and 7 observations. 

Two other home visits were made by the same 
worker at 6 months and a year following the 
patient's departure from the hospital. On these | 
occasions the same 20 ratings were made and the | 
MACC scales were filled out. 

These home visit ratings were not originally in- | 
tended as predictor measures. In the thought that 
the one-month measures in the community might, 
however, prove useful in predicting rehospitalization, 
they were included among the predictor measutes 
for that criterion. Since the same social worker made 
both the one-month and the one-year ratings which 
were to be used as other criteria, one-month ratings 
were not used in predicting year-end adjustment, 


Criterion Measures 


A number of measures were made of the subject's 
adjustment after he had left the hospital, The 
simplest measure was whether or not he had returned 
to a hospital within one year after his initial 
departure, Y 

For the patients who remained out of the hospital 
for the full year following separation, other criteria. 
were possible. Two scores were selected for this from 
those the social worker made at his one year visit 
the Average Adjustment Score and the MACC Total 
Adjustment. i 

It should be noted that, although the rating 
social worker knew the demographic measures 8$ 
part of the social history, he was not aware of the” 
interview ratings, the test measures, or the be 
havior ratings which were made at the time 
departure. Therefore, on these last three types % 
measure there could be no contamination of criterion: 


posited with the American Documentation Institute: 
Order Document No. 7234 from ADI Auxiliary Pub- 
lications Project, Photoduplication Service, Lib ae | 
of Congress; Washington 25, D. C., remitting in a% | 
vance $125 for microfilm or $1.25 for photocopies 
Make checks payable to: Chief, Photoduplication 
Service, Library of Congress. 
Also available from the authors. 


ipproximately a week before the subject was 
luled to leave the hospital the ward notified 
Research Service. Each subject was then called 
| for an interview from which the Symptom Rating 
et judgments were made, and the Rotter was 
inistered individually. The group tests were 
during the regular weekly hospital testing 
ms. The MACC scales were sent to the ap- 
riate aides and to the therapists. Only selected 
and therapists who had received instructions 
the use of the MACC scales were used. Following 
ure from the hospital, the regular 1-month, 
mth, and 1-year visits were made as scheduled 
ie research social worker (CL). & 


d punched on IBM cards. The statistical analysis 
as accomplished on an electronic computer.$ 


RrsuLTS 


Conceptually the analysis of data was a 
le one: the comparison of the predictor 
es for the outcome groups as classified by 
criteria. It will be recalled that this 
ysis is being done only for the group 
the hospital to live independently and 
in foster home care. Returnees are com- 
d in terms of predictors with those who 
d out. For those who stayed out a full 
the relation is explored between pre- 
ctors and the year-end criteria of the 
ijustment, 


ése two groups were compared on 144 
ferent predictor measurements, including 
[the tests, the interview ratings, the be- 
r ratings, the one-month follow-up ob- 
ons, and the 13 demographic variables. 
Cause of missing data, the Ns for the 
S to be compared could not always be 
e, but the returnees varied from 28 to 
individuals while the stay-outs ranged 
m 40 to 46 in the various comparisons. 
only 1 of all 144 comparisons did the 
fence in measures between the two groups 
h beyond the .05 level of probability. The 
nees showed a lower MACC rating on 
tation from their activity therapist (t = 
© are indebted to Edward F. Gocka for the 


Ane analysis and for later machine calculation 
nany of the multiple Rs. 
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ter the data had been collected, it was coded ` 
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2.76). Among this number of comparisons 
such a difference could well have occurred 
by chance. 


TABLE 5 


PREDICTOR VARIABLES SHOWING SIGNIFICANT 
RELATIONS TO THE ONE-YEAR 
ADJUSTMENT CRITERIA 


Average | MACC 
P MESS gf tan 
MMPI (40) 
F 36| —.57** —.69** 
D 36 | —.36* 
Pa 36 | —.50** 
Pt 36 | —.27 
Sc 36| —.58** 
Si 36 | —.44** 
Sd 36 Ann 
Es 36 A* 
MA 36 | —.34* 
Dy 36 | —.30 
Pa (Subtle) 36 | —.35* 
D (Obvious) 36 | —.39* 
Pd (Obvious) 36| —.37* 
Pa (Obvious) 36 | —.39* 
Ma (Obvious) 36 | —.38* 
ees 1 36 | —.39* 
16 PF (16) 
Sophistication (N) 35 | —.15 —.34* 
Independence (Q2) 35 :35* he 
Rotter (9) 
Number of shifts (E) 37 | —.41* —.33* 
Unusual shifts (G) 37 | —.48** —,49** 
Different levels (H) 37 | —43** | —.29* 
Sentence Completion 
‘oping 37 24 :3* 
Interview Ratings (20) 
Withdrawal 37| —.44** —.35* 
Uncooperativeness 37 | —.35* —.28 
Bizarre positions or 
movement 37 | —32 —AT 
Apathy 37 | —.52* | —.37* 
Behavioral Skill Ratings 
(12) 
Aide MACC 
Sociability 37| 46" 32 
Activity therapist MACC 
Motility 32 35* 25 
Cooperation 32 36* 08 
Sociability 32 35* 17 
DE Background 
Time since first hospital- 
ization 32| —.47** | —24 
Number of hospitaliza- 
tions 32| —.35* —.15 
Commitment status 32 29 .38* 
Legal competency 32 45* 18 
Amount of compensation | 41 | 5.91*** 


a Numbers in parentheses are numbers of predictors in that 
classification. 
* r significant at .05 level. o 


zy Penes. at .01 level. 
**** x! significant at .05 level, 1 df. 
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Relation of Predictors to Year-end Adjustment 
among Independent Stay-outs 


All the 111 predictors (including 13 back- 
ground data items but excluding the one- 
month measures) were correlated with the 
two year-end measures, rated average adjust- 
ment and MACC total adjustment score. Of 
the 111 predictors, 35 showed significant cor- 
relations with at least one of the two criteria. 
These are shown in Table 5.* 


Discussion 
Problem of Returns 


The failure to find any predictors which 
distinguished those who are most likely to 
stay out is disappointing. The payoff on re- 
search such as this is to find factors which 
induce rehospitalization. Apparently none of 
a series of extensive measures taken at the 
time the patient departs can tell us whether 
or not he is likely to come back. In fact, not 
even the one-month observations made at 
the patient’s residence were even remotely 


*For comparison, the remaining 76 predictors 
showing no significant criterion correlations were 
(numbers in parentheses represent the total number 
of predictors of that classification): MMPI (40)—K, 
Hs, Hy, Pd, Mf, Ma, ?, L, D (Subtle), Hy (Subtle), 
Pd (Subtle), Ma (Subtle), Hy (Obvious), Cn, R, 
Mees 2, Mees 3, IEs, EC, MH, AFR, R-X, Lb. 

16 PF (16)—Cyclothymia (A), Intelligence (B), 
Emotional Stability (C), Dominance (E), Surgency 
(F), Responsibility (G), Adventurousness (H), Sen- 
sitivity (I), Suspicion (L), Bohemianism (M), Anx- 
iety (O), Radicalism (Q:), Will control (Qs), Ten- 
sion (Qu). 

Rotter (9)—Times bid reached (A), Mean bid 
(B), Mean weighted score (C), Mean difference (D), 
Number of typical shifts (F), Mean aspiration dis- 
crepancy (I). 

Interview Ratings (20)—Evasiveness, Lack of rap- 
port, Disorientation, Disorganization of thought, 
Hallucinations, Suspicion, Depression (observed), 
Depression (admitted), Memory defects, Anxiety 
(observed), Anxiety (admitted), Physical complaints, 
Motivation, Specificity of goals, Desire for release, 
Hostility. 

Behavioral Skill Ratings (12)—Aide MACC, Mo- 
tility, Affect, Cooperation, Communication, Total, 
Activity therapist MACC, Affect, Communication, 
Total. 

Demographic Background (13)—Age at release, 
Length this hospitalization, Number of dependents, 
Diagnosi$, Reason for release, Type of release, 
Guardianship, Marital status. 
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predictive. Return is not only unpredictable 
from our observation of the patients them- 
selves but also from our observation of the 
stresses of the environment. Larger samples— 
up in the hundreds—might have pushed some 
of the differences found into the region of | 
statistical significance. But any predictive 
significance which was of practical use should 
have shown up in this smaller sample. 

In spite of the failure of one-month ratings 
of support to predict, it may be that the 
factors which lead to rehospitalization are, as 
Freeman and Simmons (1958) suggest, largely 
exterior to the patient. Two men may have sim- 
ilar competence in coping with the same tasks, 
but one of the two may be called on to suppott 
a family of 10 while the other may be living | 
with supportive parents on a farm where he 
is adequate to all the chores he is called upon 
to do. The first man may be brought back 
to the hospital by a family which sees him 
as failing to meet their demands, while the 
second man may be tolerated in a situation 
where almost no demands need to be mate 
upon him. The relations are probably a good 
bit more complex than this: living situations 
which make high demands upon the retum- 
ing patient are also situations in which the 
patient is needed, while those which require 
little of the patient are ones which can do 
without him. A man’s return to the hospital 
probably depends not only on how adequately 
he responds to the demand of his environ- 
ment but also upon how much the environ: 
ment needs what responses he can gNé- 
More detailed and more subtly informed 
study is required not only of the ex-patient 
but also of the social situation to which he 5 
returning. 

The failure to predict return contrasts with 
the relative success with which year-end l 
adjustment could be predicted for those wi | 
stayed out on their own. Even though M 
cannot tell, as the patients leave the hospit 
or even a month later, which ones are 
likely to return, we apparently can from 
tests, interviews, behavior ratings, and demo | 
graphic measures, make some good guess | 
about how well they will be doing at year end 
if they manage to stay out. 
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Psychopathology versus Social Adjustment 
The data provide some basis for judging 


Ellsworth and Clayton’s (1959) contention 


that social adjustment is a better indicator of 
whether or not a patient is likely to make a 
good posthospital adjustment than are observa- 
tions of psychopathology. The MACC made 
out on the basis of ward behavior shows that 
only the added Sociability scale correlates 
significantly with the criteria. Of the MACC 
scores derived from the activity therapist only 
3 of a possible 12 correlations with criteria 
are significant, (One of these was with the 
Sociability scale which was not included with 
the original MACC.) There is, on the other 
hand, a somewhat greater proportion of sig- 
nificant relationships derived from the inter- 
view and test data, particularly from the 


MMPI. Of the interview data it is precisely 


the ones which are most indicative of psycho- 
pathology (e.g., withdrawal, bizarre positions 
or movements, apathy) which are most pre- 
dictive of poor adjustment. On the MMPI the 
evidence is even more striking. The highest 
correlations in the entire study are with F and 
Sc, the two scores most indicative of bizarre 
symptomatology. 

The predictive efficiency of measures of 
psychopathology can be compared with that 
of measures of social skills by means of the 
multiple correlation technique. Year-end 
Average Adjustment is significantly related 
to six measures of psychopathology from the 
MMPI and the interview ratings, F, Sc, Pa, 
and rated Withdrawal, Bizarreness of move- 
ments, and Apathy. Using the method of 
Guilford ( 1956) these measures were multiply 
Correlated with Average Adjustment. The 
Measures showing the highest criterion cor- 
relations, F and Sc, were first used as pre- 
dictors, and the other measures were added 
Until the multiple correlation coefficient (cor- 
rected for degrees of freedom in small sam- 
bles) showed no increment. Thus, only those 


i Predictors having some degree of independent 
_ Contribution to the R were retained. Among 


the psychopathology measures F, Sc, With- 
Tawal and Apathy survived this predictor 


. Selection procedure. Their multiple correla- 


a with year-end Average Adjustment comes 
0 .76 (N = 41). Four of the MACC scores 
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are significantly related to Average Adjust- 
ment. The predictor procedure selects the 
Activity Therapist Cooperation score and the 
Ward Aide Sociability score. The multiple 
correlation of these two measures with Aver- 
age Adjustment is .53 (N = 36). Using 
Meredith’s method of evaluating differences in 
multiple correlations (Talbott, 1960), the 
difference between the multiple prediction 
from psychopathology and the multiple pre- 
diction from social skills has a probability 
just over .05. 

One apparently strange result of the pre- 
dictive power of these psychopathology meas- 
ures is shown in the correlations with the 
year-end MACC Total Adjustment score. 
None of the MACC ratings at departure suc- 
cessfully predict this, but a number of the 
psychopathology measures do. The MACC 
does not predict the MACC. Part of this 
may be due to unreliability in the MACC 
(though the significant correlations of pre- 
dictor MACCs with the Average Adjustment 
would belie this notion). It is simply more 
likely that year-end social skills are less 
adequately predicted by social skills at dis- 
charge than they are by discharge measures 
of psychopathology. 


Demographic Measures—Tests and Interviews 


Although measures of pathology may be 
more predictive than measures of social skills, 
it may be that demographic measures are 
even more effective. Of the demographic data 
selected for study, a good proportion (5 
variables out of 13) show significant criteria 
prediction. Patients showing poor year-end 
adjustment are more likely to have been dis- 
turbed for a number of years and to have been 
hospitalized a number of times. They are 
more likely, too, to have been committed, to 
be receiving 100% service connected dis- 
ability compensation, and to have been de- 
clared incompetent. In other words, they have 
been previously judged to have shown inade- 
quate adjustment to society for a longer 
time and more frequently. It would seem, 
then, that demographic information is likely 
to be predictive of adjustment among patients 
who stay out even if it does not predict who 
will return. All of these demographic measures 
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are ones which indicate poor adjustment in 
the past and are probably related to present 
psychopathology. 

The combined demographic measures pre- 
dict remarkably well in this sample. Using 
the three predictors which survive the selec- 
tion process—time since first mental hospitali- 
zation, whether committed by a court, and 
competency at the time of leaving the hos- 
pital—the multiple correlation in this sample 
of 41 cases goes all the way up to .91. This 
high value may not be found in other studies, 
but from this sample it looks very much as 
though these do even better than the com- 
bined test and interview measures of psycho- 
pathology. The difference between the two 
multiple correlations has a chance probability 
of less than .01. It should be noted that the 
three demographic measures which show the 
highest correlations with criteria and which 
are picked by the selection process as part 
of the multiple correlation are not, strictly 
Speaking, measures of social role as marital 
status or number of dependents might be. 
"They seem rather to be the distillation of so- 
ciety's long-term estimate of the patient's be- 
havior pathology. 

The tests vary greatly in their effective- 
ness. The one score from the Sentence Com- 
pletion Test predicts one of the criteria, Bet- 
ter adjusted patients are more "coping." Out 
of the possible 40 variables from the MMPI, 
17 show significant prediction while 3 of a 
possible 9 variables from the Rotter Board 
are significant, On the other hand, only 2 of 
the 16 PF scores show any significant pre- 
diction. 

The interview ratings seem as a whole to 
have a relatively poor batting average. Of 22 
different variables scored on the Symptom 
Rating Sheet only 4 showed significant rela- 
tionships with criteria. 

In sum, it would seem that the best predic- 
tors of year-end adjustment are those which 
can be most expeditiously obtained. The demo- 
graphic data can be found from the clinical 
records, and the MMPI and the Sentence 
Completion can be administered as group 
tests. Only the Rotter Board is very extrava- 
gant of Staff time. 
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Factor Scales versus Criterion Scales 


In selecting the test scores which were to 
be used, one consideration was to test the 
relative predictive efficiency of scales which 
were intended to be relatively pure measures 
of analytically derived factors and of scales 
which were derived either from criterion 


groups or by a priori considerations. The 16 | 
PF is a test which is composed largely of ' 


pure first order factor scales while the M, 
Mz, Ms, and R scales are designed to be fac- 
tor scales on the MMPI. 

In general it would seem that factor scales 
do somewhat less well than do the criterion 
scales. The relative inefficiency of the 16 PF 
—admittedly in a short, relatively unreliable 
form—has already been noted. It would seem 
that the most predictive scales of the MMPI 
are those which are heavily weighted with the 
first factor, the one which has been called 
variously “General Complaint” or Social De- 
sirability. Yet it is not the scales which might 
be most purely loaded on this factor, M; and 


SD, which give the best prediction. Instead | 


of these factorially pure scales it is the old 
“impure” F and Sc scales which are most 
likely to give good prediction. ! 

The optimal combination of the three fac 
tor scores M;, M», and Mg for the prediction 
of the two year-end criteria can be computed. 
The multiple correlation of these three scales 
with Average Adjustment comes to .44 while 
the correlation with MACC Total is 49 (Y. 
= 36). The single scales F, Pa, and Sc all 
exceed these correlations with both criteria 
This suggests that there are probably scale 
specifics in these three scales which make fot 
their relatively efficient prediction. A com 
bination of F and Si correlates .60 (N = 36) 
with the Average Adjustment criterion. This 
is greater than the .44 multiple prediction of 


the three factor scales but not significantly $% | 
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ALTERNATIVE RESPONSES TO DISSONANCE * 


IVAN D. STEINER ann EVAN D. ROGERS 


University of Illinois 


50 male and 50 female Ss were placed in an experimental situation in which 
they found their judgments contradicted by a respected associate of the same 
sex. Ss were free to resolve the dissonance by conforming to the contrary 
judgments of the associate, rejecting the associate as one who was less com- 
petent than he had been thought to be, underrecalling the disagreements, or, 
devaluating the importance of the topics about which disagreements had 
occurred. Female Ss made less use of rejection than did male Ss and were 
more inclined to tolerate the conflict. Other findings suggest that individuals 
are inclined to employ the 4 responses as alternative means of reducing dis- 
sonance rather than as supplementary means. Finally, correlations relating 
the MA scale to conformity, underrecall, and tolerance were significantly dif- 
ferent for the 2 sexes, suggesting that the effect of anxiety upon Ss’ choice of 


dissonance reducing response depends upon the sex of the Ss. 


In earlier papers Steiner and Peters 
(1958) and Steiner (1960) have suggested 
that conforming behavior may profitably be 
viewed as one of several responses to dis- 
sonance. The individual who finds himself in 
disagreement with a trusted associate may: 
conform; reject the associate as one whose 
competence is lower than it had been 
thought to be; misperceive or underrecall the 
disagreements with the associate; devaluate 
the importance of the topics about which 
disagreements have occurred; communicate 
to the associate in an effort to change his 
opinion; or tolerate the dissonance, at least 
temporarily. It should be noted that all but 
the last of these responses are means of 
reducing dissonance, and that the individual 
who produces one of them to a pronounced 
degree may have no need to produce the 
others, 

Whether or not individuals actually tend 
to employ only one dissonance reducing 
response when confronted by interpersonal 
conflicts is a question for empirical research. 
Unequivocal evidence in favor of either a 


1 This project was supported by a grant (M-4460) 
from the National Institute of Mental Health, 
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positive or negative answer to this question 
is lacking, but at least a few research find- 
ings can be interpreted to imply that the 
simultaneous use of several responses may 
be rather common. Asch (1956) noted that 
some of his subjects not only conformed but 
also underrecalled instances in which they 
had disagreed with the majority. Wiener 
(1959) found that subjects who altered their 
judgments to conform to fictitious norms 
also tended to recall fewer initial disagree- 
ments with those norms. However, this find- 
ing may reflect a rationalization for compli- 
ance rather than the use of underrecall as 
a supplementary means of reducing dis 
sonance. Positive correlations were obtained 
by Pepitone and Wilpizeski (1960) in à 
study which analyzed conforming and re 
jecting responses. In this study subjects 
heard their own opinions contradicted by 
accomplices of the experimenter. Those sub- 
jects who subsequently gave low evalua 
tions of their own judgments tended also 
to indicate hostility toward the accomplices: 
Other studies have yielded negative Cot 
relations between responses to interpersonal 
conflict, and have suggested that individuals 
who produce one or two of the responses to 
a marked degree tend not to produce the | 
others. Kelley and Woodruff (1956) 0b: 
tained a negative correlation between con 
formity scores and measures of the subjects 
tendency to minimize autistically the di 
crepancies between the judgments whic 
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they and their associates had expressed. 
Zimbardo (1960) used experimental manipu- 
lations to vary the subjects' involvement in 
an issue about which they were subse- 
quently told that their friends held opinions 
which were contrary to their own. Subjects 
who received the high involvement treat- 
ment, and who were presumably not free to 
devaluate the topic, conformed more than 
did other subjects. Steiner (1960) attempted 
to measure conformity, devaluation, rejec- 
tion, and underrecall of disagreements in a 
single study. Most of the resulting correla- 
tions between pairs of these responses were 
negative, but with small samples they did 
not achieve satisfactory levels of signicance. 
Allyn and Festinger (1961) found that 
experimental manipulations which increased 
the subjects’ propensity to conform to a 
persuasive communication did not increase 
their willingness to reject the communicator. 
Conversely, experimental conditions which 
favored rejection of the communicator did 
not accentuate conformity. 

Because of the paucity and contradictory 
character of the relevant findings, no firm 
conclusions can be reached concerning the 
Simultaneous versus alternative use of dis- 
Sonance reducing responses. Probably there 
are conflict situations which encourage most 
individuals to employ a single response, and 
Other situations in which most people will 
use several responses simultaneously. But it 
Seems reasonable to assume that human 
beings learn to “prefer” some dissonance 
teducing responses over others, and that one 
individual's learned “response hierarchy" 
may differ from another’s. Some of the 
Writings of Horney (1937) and Fromm 
(1941) suggest that such individual dif- 
ferences are very common and constitute 
highly important aspects of personality. 
Steiner (1960) has reported data which 
Suggest that there may be marked sex dif- 
ferences in preferences for dissonance re- 
ducing responses, and that *maladjustment" 
among males may involve response prefer- 
ences which are indicative of good adjust- 
Ment among females. 

„The study reported in this paper has three 
ams. It investigates the simultaneous versus 
alternative use of dissonance reducing re- 
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sponses in a situation where subjects are 
relatively free to use either one or several 
means of reducing dissonance. It explores 
sex differences in the use of such responses. 
And it examines relationships between mani- 
fest anxiety (MA scale) scores and reliance 
upon each of four dissonance reducing 
responses. 

Manifest anxiety 1s regarded as an im- 
portant variable in the present study because 
anxiety is commonly conceded to have strong 
effects on people's interpersonal behaviors, 
and to be accentuated by interpersonal 
conflicts. In addition, the MA scale has been 
found by Goldberg, Hunt, Cohen, and 
Meadow (1954) to be positively correlated 
with conformity among females and nega- 
tively correlated with conformity among 
males. A subsequent study by Tuddenham 
(1958) obtained moderate but not signifi- 
cant negative correlations for both sexes; 
and Mangan, Quartermain, and Vaughan 
(1960) obtained a significant negative rela- 
tionship between the MA scale and con- 
formity for a sample which included both 
male and female subjects. It is hoped that 
a study which takes explicit account of the 
subjects! nonconforming reactions to dis- 
sonance will provide insight into the possible 
meaning of these seemingly contradictory 
findings. 

METHOD 
Laboratory Situation 


Subjects. Fifty male and 50 female students in 
the introductory psychology classes at the Univer- 
sity of Illinois served as experimental subjects in 
this study. They received research credit for their 
participation. 

Each subject came alone to the laboratory at a 
designated time and was joined shortly thereafter 
by another student of the same sex who was intro- 
duced as the second subject in a study of two- 
person groups. Actually the second student was an 
accomplice of the experimenter who had been trained 
to enact a prescribed role. The same two students 
(one male and one female) served as accomplices 
throughout the study. 

Manipulation of the subject’s impression of the 
accomplice. The experimenter explained that the 
study was concerned with “how people form im- 
pressions of one another” and that he would ask 
each subject to tell the other certain facts about 
himself. Specifically, the subject and the eecomplice 
were asked to reveal their hometown, their major, 
and their class level at the university, whether their 
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academic average was higher or lower than “B,” 
and whether they had ever been the president of a 
fraternity or social organization on campus. The 
accomplice always asserted that he was a Junior 
from Chicago majoring in physics with an average 
higher than “B.” Although he said that he had 
never been the president of a campus social organi- 
zation, he volunteered that he had been a vice 
president. These answers were intended to create a 
favorable impression of the accomplice, and to 
convince the subject that the accomplice was a 
highly competent person. 

The subject and the accomplice were next asked 
to rate one another on a series of six-step graphic 
rating scales. Five of these scales called for judg- 
ments about the other person’s knowledge and 
skill in dealing with topics on which the subject 
and the accomplice were subsequently to be tested. 
Nine other scales called for evaluations of the other 
person’s personality along continua labeled patient- 
impatient, careless-careful, undependable-responsible, 
etc. These 14 scales were scored by assigning 
numerical values ranging from 1 (for a least favor- 
able rating) to 6 (for a most favorable rating), 
Male subjects gave the accomplice an average rating 
of 4.8 on these scales, and female subjects gave 
the accomplice an average rating of 4.9. Although 
no control data are available for comparison, these 
ratings are high enough to suggest that the accom- 
plice was generally perceived rather favorably. 

Conformity test. The experimenter next an- 
nounced that as a means of providing the subjects 
with additional information about one another he 
was going to administer a test to which the subjects 
would respond orally. This test was said to have 
been developed at Yale University and to measure 
pertinent characteristics of individuals. However, the 
experimenter asserted that he would not tell the 
subjects what these characteristics might be. Test 
items were of the multiple-choice type and in- 
cluded items from IQ tests (number series and 
figure series), questions concerning animals, geo- 
metric designs, well known personages, and news 
events. Each test item was printed on a card ap- 
proximately 10 X 12 inches in size. Three answers 
to each question were printed on each card, and the 
subject and the accomplice were asked to announce 
their choices orally. In a casual manner the ex- 
perimenter asked the accomplice to always give his 
answers first in order to facilitate the recording 
of replies, Although the accomplice and the subject 
were told that the cards would be exposed for 30 
seconds, they usually announced their answers after 
only 10-15 seconds, and the experimenter proceeded 
to expose the next card. Cards were exposed in 
the same predetermined order in all experimental 
sessions. 

The accomplice’s answers to the first 10 cards 
were those which had most often been given by 
pretest subjects during preliminary testing sessions. 
Thirteen pf the accomplice’s answers to the remain- 
ing 22 cards were those least often given by pre- 
test subjects. The accomplice's answers were the 
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same in all experimental sessions, the popular anj 
unpopular answers to the last 22 cards occurring 
in an approximately random but predetermined 
order. The conformity score is based on the sub- 
ject's responses to these last 22 items. Each time the 
subject agreed with the accomplice's popular answer 
he was scored as having agreed once. Each agree- 
ment when the accomplice gave a least popular 
answer was counted as two agreements, The sub- 
ject’s conformity score is the sum of these weighted , 
scores. 

Measurement of devaluation. After the first 10 
test cards had been exposed, the experimenter asked 
the accomplice and the subject to express their 
judgments about what the test items were measur- 
ing. Ten six-step graphic rating scales were em- 
ployed for this purpose. Each scale required a 
judgment of the degree to which people's answers 
to test items indicated their IQ, general knowledge, 
academic ability, sensitivity to the feelings of others, 
etc. The scales were scored by assigning numerical 
values (1-6) to the subject's responses, and sum- 
ming across the 10 graphic rating scales. The mean 
of the male subjects! ratings was 38.8, while for 
female subjects the mean was 37.4 (high scores 
indicate high evaluation of the test items). 

After all 32 cards had been exposed, the subject 
and the accomplice again responded to the scales 
which called for evaluations of the importance of 
the test. Because each subject evaluated the impor- 
tance of the test twice, it would have been pos- 
sible to measure devaluation by the difference 
between the first and second ratings. Howevet, 
subjects differed considerably from one another in 
their first ratings, and the size and direction of the 
discrepancies between their first and second ratings 
were substantially influenced by their first ratings. 
Because difference scores would have been difficult 
to interpret, only the subjects’ final ratings wete 
used in computing devaluation scores. The devalua- 
tion score is the sum of the numerical values of 
the responses given by the subject to the 10 ques- 
tions concerning what the test items measure. The 
mean and standard deviation of the devaluation 
Scores are reported in Table 2. Low scores indicate 
large devaluation. 

The mean of the devaluation scores was not 
significantly different from the mean of the first 
ratings, so one cannot say that there was a geni 
tendency for subjects to lower their estimates 0 
the significance of the test. However, for both sexts | 
the standard deviation of the devaluation scores 
was larger than the standard deviation of the fist 
ratings (p<.05 for males; ms for females). Thi 
correlation between first ratings and devaluation 
Scores was +.66 for males, +.51 for females, anA | 
+.59 for all subjects. It should be noted that these 
correlations, obtained under test-retest conditioni 
with an interval of approximately 20 minutes d 
rating the two administrations of the instrumen 
are markedly lower than the corrected split " 
reliability coefficients which are .83, .88, and $ 
respectively. The fact that the test-retest correla 


are consistently much lower than the split-half 
ibility of the instrument suggests that the ex- 
mental situation induced subjects to change 
aluations to a degree which cannot be 
ed entirely in terms of the unreliability of 


asurement of underrecall. After all 32 test 
s had been exposed, the subject was asked to 
"how many times out of 32 he and the other 
dent (the accomplice) had disagreed in their 
ers. The subject’s estimated number of dis- 
ments was then subtracted from the number of 
l disagreements to obtain the underrecall score. 
le mean of the underrecall scores for all subjects 
+5.08 (significantly different from zero at the 
devel). Thus, it may be concluded that the 
€ ental situation was conducive to underrecall 
jÍ disagreements. 
[t should be noted that underrecall scores and 
Ormity scores are not completely independent 
another. A subject who has received a high 
ormity score will ordinarily (but not neces- 
have disagreed few times, and he will there- 
ave limited opportunity to underestimate the 
aber of times he has disagreed with the accom- 
ce, However, the actual number of disagreements 
h the accomplice ranged only from 8 to 19, and 


al disagreements and their underrecall scores 
only +.12 (not statistically significant). Conse- 
ently, any artifactual correlation between con- 
Jy scores and underrecall scores which might 
enerated by this type of bias should not be 
‘than —.12. p 

urement of rejection. Three indexes of re- 
On were available. At the conclusion of the 
mental session each subject rated the accom- 
‘with respect to personality characteristics and 
wledge of the subject matter covered by the 
7! (These ratings were made on the same 
ic rating scales as were employed at the begin- 
"of the laboratory session.) Each subject also 
ted how many times he believed he had been 
ind how many times the accomplice had been 
in answering the “test” items. Ratings of 
ity and subject matter were positively cor- 
(phi=.31, » «.001) and were significantly 
001 by £ test) less favorable to the accom- 
t the end of the experimental session than 
had been at the beginning of the session. 
jects’ estimates of the number of times they 
' accomplice had been right were converted 
measure of rejection by subtracting the latter 
ute from the former. This index of rejection 
Positively correlated with the index in- 
knowledge of subject matter (phi=.31, 
) but was only minimally correlated with 
based on ratings of personality character- 
(phi=.08, ns), In the analysis to be de- 
the remainder of this paper each subject 
assigned a single rejection score obtained 
g his ranks on the three separate indexes 
on. (The subject who had the highest 
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score On a measure was ranked 1 on that index of 
rejection. Consequently, high scores on the com- 
posite index indicate little rejection.) The mean 
and standard deviation of the composite rejection 
scores are reported in Table 2. 

Measurement of tolerance. All 100 subjects were 
ranked with respect to their scores on conformity, 
rejection, underrecall, and devaluation. The subject 
whose score indicated least use of a response re- 
ceived the rank of 100 for that response. Each 
subject’s four rank scores (one for each of four 
responses) were then summed to obtain a tolerance 
score. High scores indicate relatively little overall 
use of the four dissonance reducing responses. The 
mean and the standard deviation of the tolerance 
scores are reported in Table 2. 


Second Session 


Sometime during the 8-day period which followed 
his participation in the laboratory session, each 
subject reported to a specified classroom where he 
once more answered the questions printed on the 
32 test cards. The experimenter explained that a 
preliminary inspection of the data suggested that 
the laboratory procedures had been faulty and that 
subjects who gave their answers second might in- 
advertently have allowed themselves to be influ- 
enced in one way or another by hearing their 
partners’ answers. In order to obtain valid test 
scores it was therefore desirable that each subject 
write his answers on paper without hearing anyone 
else's replies. Subjects were allowed to look at each 
card for 30 seconds but generally used only a 
fraction of the permitted time. The accomplice was 
not present at the second session, but several sub- 
jects attended the same session. They were seated 
in a manner which prevented communication. 

Finally, subjects responded to the Minnesota 
Multiphasic Personality Inventory. 


Control Group 


Twenty-three male and 24 female students from 
the introductory psychology classes served as con- 
trol subjects. During their first session they wrote 
their answers to the 32 test items on paper. These 
subjects did not receive information about anyone 
else's answers. They were permitted to see each 
card for 30 seconds. After responding to all 32 
cards they also reacted to the graphic rating scales 
which called for judgments about what the test 
measured. 

In a second session, which occurred from 1 to 8 
days later, the control subjects responded once 
more to the test items and the instrument calling 
for evaluations of what the test measured. No 
excuse for requiring them to repeat the tests was 
offered. 

The same scoring system which had been used to 
obtain conformity scores for subjects in the labora- 
tory sessions was used to obtain conformity,scores for 
experimental subjects in the second sessions, and for 
control subjects in both sessions. Even though no 
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TABLE 1 
CONFORMITY SCORES OF EXPERIMENTAL AND CONTROL SUBJECTS 
First session Second session 
Subjects N 
M b (U test) 7 b (F test) M b (U test) 7 P (E test) 
Males y 
Experimental 50 11.74 01 2.56 11.62 01 2.79 
Control 23 9.87 2.29 9.96 2.24 
iud l 50 12.78 3.66 12.50 3.70 
Experimental ; .66 5 9: 
Control 24 | 1004 Die] iore qi 942..|..:01 216 | 0 
i 
Experimental | 100 12.26 3.21 12.06 3.30 
Control 47 | 996 | 9i 2.43 i05 Sos i ot 223 0 


accomplice was present during these sessions, sub- 
jects were regarded as having “conformed” once for 
each response which agreed with a popular answer 
given by the accomplice, and twice for each response 
which agreed with an unpopular answer given 
by the accomplice. 


RESULTS 


Table 1 compares the conformity scores 
of experimental and control subjects. Both 
male and female experimental subjects ob- 
tained significantly higher conformity scores 
than did their counterparts among the con- 
trol subjects. In addition, the scores of fe- 
male experimental subjects, and of all 100 
experimental subjects, have significantly 
larger standard deviations than do the scores 
of the comparable control subjects. These 
data suggest that the experimental situation 
induced a moderate degree of conformity and 
that some experimental subjects were more 
strongly affected than were others. 

It should be noted that results obtained 
for second sessions correspond very closely 
to results for the first sessions. Although the 
correlations between the scores obtained by 
experimental subjects in the first and second 
sessions were only .34 for males and .58 for 
females (among control subjects the cor- 
relations were .45 and .52, respectively), the 
experimental subjects maintained their tend- 
ency to give answers which more often 
agreed with those given by the accomplice. 
Experimental subjects also remained more 
variable in their conformity scores. Although 
the relatively low correlations between ses- 
sions indicate that considerable shifting of 
scores occurred, it is apparent that the over- 


all effect of the experimental situation was 
not reduced during the time which elapsed 
between sessions. 

The length of time elapsing between first 
and second sessions was not found to have 
any significant relationship to changes in 
conformity scores. 

The means and standard deviations of the 
subjects’ nonconforming responses to the ex 
perimental situation are reported in Table 2. 
As in the case of the earlier study by Steiner 
and Peters (1958), females are found to be 
less similar to one another (i.e., to obtain 
scores with larger standard deviations) than 
are males, and to be less prone than males 
to reject the accomplice. In the present study 
females are also higher in tolerance than are 
males. (Technical difficulties presented this 
type of comparison in the Steiner and Peters 
study.) 


TABLE 2 
Means AND STANDARD DEVIATIONS OF THE RESPONSE 
SCORES OBTAINED BY MALE AND FEMALE 
Supjects 
Response variable 
Con- |Under-| Devalua-| Rejec- | Toler 
formity | recall | tion* tions | ance 
M 
Males 11.74 | 5.28 | 38.30 |12742 inm 
Females 12.78 | 4.88 | 39.78 174.42 21 s 
b (U test) 011 
SD "ma 
Males 2.56 | 3.66 7.52 | 53.06 m 
Females 3.66 | 444 7.91 | 62.63) 42: 
p (F test) .01 .05 - 


Note.—N = 50. s 
a High scores on this variable indicate little use © 
response. 


f the 


. 
f 


* 


ALTERNATIVE RESPONSES TO DISSONANCE 


Interrelationships of response variables are 


"reported in Table 3. It is to be noted that 


most of the correlations are low and negative. 
The correlations involving tolerance are sub- 
stantially higher (in the negative direction) 
than the others; but, since each of the other 
scores is involved in the computation of 
tolerance scores, this finding is not sur- 
prising. None of the sex differences is signifi- 
cant except the one involving conformity and 
tolerance. The correlations shown in this 
table are not greatly different from those 
obtained by Steiner and Peters (1958); and 
they are at least consistent with the conten- 
tion that conformity, underrecall, devalua- 
tion, and rejection serve as alternative means 
of reducing dissonance. 

Table 4 presents the results of another 
analysis of the interrelationships of response 
variables. If subjects tend to use the 
responses as alternative means of reducing 
dissonance, subjects who rank high in the 
use of one response should tend to rank 
low in the use of the others. As a test of 
this prediction, the subjects’ ranks on one 
response were compared with the sum of 
their ranks on the other three responses. 
Table 4 reports the results of chi square 
tests in which subjects were dichotomized 


TABLE 3 
INTERCORRELATIONS OF RESPONSE VARIABLES 


E a 
variable | ecl | tions | ions | erem 
Conformity 
Males —45 00 | —22 | —44** 
Females —A1** | 2960/9 — 921-3507. 
All subjects | —.32** | —.16 | —.24*| —.11 
Underrecall 
Males —.06 | —.08 | —29* 
Females —.03 | +.16 | —.49** 
All subjects —.04 | +.08 | —.39** 
fidüstion 
ales —43 | —44^ 
Females —22 | —.33* 
AII subjects —.13 | —40** 
ejection 
Males —.28* 
Females — 55% 
All subjects —46"* 


` 
the Beet to make this table more readily understandable, 
lecause i the correlations in this column have been reversed. 
inthis coh igh scores indicate little use of the response reported 
rect igo umn, unreversed signs would tend to misrepresent the 
tion of the relationships between this response and others 

*5 P atk is compared. 


** < iol. 
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TABLE 4 


RELATIONSHIPS BETWEEN SUBJECTS’ RANKS ON ONE 
RESPONSE AND THE SUM OF THEIR RANKS 
ON THREE OTHER RESPONSES 


Variables being compared xt 


Conformity and (Underrecall 
+ Devaluation + Rejection) 


Males .85 
Females 4.09* 
All subjects 4.92* 


Underrecall and (Conformity 
+ Devaluation + Rejection) 


Males .38 
Females 3.92* 
All subjects 3.17 


Devaluation and (Conformity 
+ Underrecall + Rejection) 


Males 1.67 

Females 5.99** 
All subjects 5.165€ 
Rejection and (Conformity 

+ Underrecall + Devaluation) 

Males .64 

Females .06 

All subjects 46 


Note.—Subjects were dichotimized as closely as possible to 
the median, Probabilities were obtained through the use of chi 
square with one degree of freedom. 

*p = 05. 

*kp 02. 


as closely as possible to the median both on 
the single response and on the sum of their 
ranks on the other three responses. (In a few 
cases tied ranks prevented the use of exactly 
balanced dichotomies.) Although none of the 
relationships is significant for male subjects, 
three of the four are significant for females, 
and two of the four are significant for the 
entire sample. All relationships, whether sig- 
nificant or not, are in the anticipated direc- 
tion; subjects who are above the median in 
their use of one dissonance reducing response 
tend to be below the median in their use of 
the other three. 

Additional evidence of the subjects! pro- 
pensity to use the four responses as alterna- 
tive ways of resolving dissonance is obtained 
by counting the number of subjects who 
ranked high in their use of only one of the 
four responses. In conducting this analysis 
it séemed desirable to regard any subject 
who ranked above the seventy-fifth per- 
centile in this use of a response as having 
ranked high on that response. However, be- 
cause there were many tied ranks in the 
region of the seventy-fifth percentile, it was 
sometimes necessary to draw the dichotomies 
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TABLE 5 


CORRELATIONS BETWEEN TAvLoR MA SCALE AND 
RESPONSES TO INTERPERSONAL CONFLICT 


Correlation of the MA scale with: 


N 
Con- | Under-| Devalu-| Rejec- | Toler- 
formity | recall | ation* | tion* | ance 

Males 50| —.15 | —31* | +.01 | —.14 —.05 
Females 50| 4-28* | +.15 =—.01 |—.36***| --.33+* 
All subjects |100| -+.03 | —.25**| +.06 | —.23* | +.05 
Significance OL .01 -— = 01 

of sex dif- 

ference 


^ The signs of the correlations in this column have been 
adjusted so that they accurately report the relationship be- 
tween the MA scale and response to interpersonal conflict. 
* Significantly different from zero at the .05 level. 
** Significantly different from zero at the .02 level. 
*** Significantly different from zero at the .01 level. 


slightly above or below the seventy-fifth 
percentile. In the case of all four responses 
the dichotomies were drawn as close as pos- 
sible to the seventy-fifth percentile. Fifty- 
three subjects were found to have scored 
above the cutting point on only one response. 
By chance, a total of 41.98 subjects should 
have been above the cutting point on only one 
response. This difference is significant at 
the .01 level by chi square test. When males 
and females are considered separately the 
same trends appear, but the obtained fre- 
quencies are not significantly greater than 
might be expected to occur by chance, 
Table 5 reports relationships between the 
Taylor MA scale and responses to the ex- 
perimentally created conflict. It is to be 
noted that MA scale scores are negatively 
correlated with underrecall scores among 
males and positively correlated with con- 
formity and tolerance scores among females. 
The MA scale is also. negatively related to 
rejection among females, but the correlations 
for male and female subjects are not sig- 
nificantly different (.05 < p< -10). The 
overall pattern of the results reported in 
Table 5 suggests that the effect of anxiety 
(as measured by the MA scale) on the sub- 
ject’s choice of dissonance reducing response 
depends greatly upon the sex of the subject. 
Anxious females tend to conform or to toler- 
ate the conflict; they avoid rejection. As a 
group, anxious males tend not to emphasize 
the use of any single response, but they avoid 
the use of underrecall. 3 
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Although MA scale scores constitute the 
primary personality data of this report, the 
subjects’ scores on nine diagnostic subscales 
of the MMPI (Hs, D, Hy, Pd, Mj, Pa, Ph 
Sc, and Ma) were also computed and com 
pared with the earlier findings of Steiner 
and Peters (1958). As in the case of the 
earlier study, very few correlations between 
MMPI subscale scores and responses to dis- 
sonance were found to be significantly dif- 
ferent from zero. In the present investigation 
only 3 of 90 correlations reached the 05 
level of significance, and these 3 can obvi- 
ously be regarded as chance occurrences 
However, sex differences which were sug. 
gested by the earlier study were evident also 
in the new data. Correlations between scores 
on MMPI subscales and responses to dis 
sonance tended to be more negative (or les 
positive) for males than for females. This 
was true of 29 of 45 pairs of correlation 
coefficients. Examination of the correlations 
involving MMPI scores and conformity | 
Scores revealed that these correlations were 
less positive for males than for females nine 
times out of nine. In the case of correlations 
relating MMPI scores to tolerance, the male 
correlations were less positive seven times 
out of nine. These data, like those yielded 
by the earlier study, suggest that the dy- - 
namics of dissonance reduction are different 
for the two sexes. 


Discussion 


This study has examined responses to dis- 
sonance under somewhat unusual circum- 
stances. The experimental situation was 
deliberately devised to permit free use of 
four different dissonance reducing responses} 
and pretests were run to determine that each 
of these responses was, in fact, made by 
some subjects. In many experimental situa 
tions, and in many situations outside the 
laboratory, circumstances undoubtedly favor 
the use of some responses and virtually pro 
hibit the use of others. Furthermore, this 
study has examined responses to only one 
type of dissonance—that which occurs when 
the individual finds his opinions contradicted — 
by a respected associate of the same se 
Additional research is needed to determine 
the range of circumstances within which the - 
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findings of this study may safely be gen- 
eralized. 

The tolerance scores employed in this 
study reflect the tendency of subjects to fall 
at the bottom of four distributions of re- 
sponse scores. Although these tolerance scores 
indicate the position of subjects relative to 
one another, they do not measure tolerance 
in an absolute sense. Consequently, it is not 


— possible to conclude with certainty that the 


. UT 


overall level of tolerance was high or low. 
However, the fact that none of the 100 sub- 
jects fell in the bottom half of all four dis- 
tributions of response scores suggests that 
steadfast tolerance may have been a fairly 
rare reaction to the experimentally induced 
dissonance. Indeed, the tolerance scores em- 
ployed in this investigation probably convey 
an exaggerated picture of the degree of 
tolerance which actually prevailed. If a 
subject scored high in his use of one response 
and low on the other three, his relatively 
high tolerance score would suggest that he 
had successfully withstood the dissonance 
created by the experimental situation. Actu- 
ally, his strong use of one response (say con- 
formity or rejection) may have been suf- 
ficient to completely resolve the dissonance. 
Because subjects in this study more often 
tanked high on only one response than might 
have been expected by chance, it is possible 
that many subjects who resolved the dis- 
sonance may have received fairly high 
tolerance scores. 

Th spite of these limitations, this study 


Yields insights which may help to explain 


Some of the apparent inconsistencies in the 


| findings of research dealing with dissonance 


reduction. The data of the present study 


- Suggest that when people have freedom to 
use any of several means of reducing dis- 


Ros, different subjects emphasize dif- 
erent means. Subjects tend not to use all 


available responses to an equal degree; in- 


Stead, they emphasize the use of some and 


. Minimize the use of others. This phenomenon 
Suggests that individuals’ responses to dis- 
. Sonance are governed by “habit family hier- 
‘ E which have been learned through 
i atlier experience with dissonance. Undoubt- 


*dly these learned hierarchies of responses 


. Vaty not only across different individuals 
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but also for a single individual as he en- 
counters radicaly different types of dis- 
sonant situations. But it would be un- 
economical for the individual to attempt to 
discriminate between every pair of situations 
he encounters or to employ dissonance re- 
ducing responses in a purely random manner. 
Consequently, it seems reasonable to postu- 
late that individuals learn response hier- 
archies which are applied in broad classes 
of situations. 

The individual's response to a dissonance 
producing situation need not always be the 
one which is highest in his response hier- 
archy. Sometimes the situation makes it 
very difficult to employ a favored re- 
sponse. Conformity is difficult if it involves 
behaviors which contradict strongly held 
beliefs. Rejection is unlikely to occur if the 
other parties to the conflict are regarded as 
experts. Devaluation will rarely be extreme 
when the topics are ones which the individual 
has long regarded as highly important. The 
literature contains adequate evidence of this 
type of situational constraint on the indi- 
vidual's freedom to use specific responses. 

If the above discussion is substantially 
correct, it should not be difficult to under- 
stand why the literature relating personal- 
ity variables to conformity contains many 
seeming contradictions. Different experiment- 
ers have tended to employ different experi- 
mental situations. Although subjects have 
almost invariably been free to conform, it 
is not at all clear that they have been free 
to employ their preferred nonconforming 
response. In some studies the alternatives 
open to the subjects have probably been 
conformity versus rejection. In other studies 
the alternatives have probably been con- 
formity versus devaluation, or conformity 
versus some other response. Individuals who 
conform when their only alternative is to 
reject may not conform when they are free 
to devaluate or underrecall. (Data from the 
present study suggest that this may be true 
of many female subjects, and especially true 
of females with high MA.) Correlations of 
conformity with personality variables may 
therefore vary depending upon whigh non- 
conforming responses are available, and 
experimental manipulations which affect the 
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availability of one dissonance reducing 
response may influence the frequencies with 
which others will occur. Unfortunately, it is 
difficult to determine, or even to guess, which 
dissonance reducing responses have been per- 
mitted in various studies, and an ex post 
facto analysis of contradictory findings is 
therefore of limited value. 

The findings of this study are consistent 
with those of Goldberg et al. (1954) who 
found MA scale scores to be positively cor- 
related with conformity among female sub- 
jects, and negatively correlated with con- 
formity among male subjects. Tuddenham’s 
(1958) nonsignificant negative correlations 
for both sexes were obtained from subjects 
who were considerably older and of more 
varied socioeconomic status than were the 
college students used in this study and in the 
study by Goldberg et al. Whether the dif- 
ferences between Tuddenham’s findings and 
those of the present study are due to situa- 
tional factors which selectively restricted the 
availability of certain responses, or are due 
to differences between the “habit family 
hierarchies” of subjects with different back- 
grounds, is a problem for future research. 
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CREATIVITY AND THE NEED FOR NOVELTY * 


JOHN P. HOUSTON ann SARNOFF A. MEDNICK 
University of Michigan 


An experiment designed to demonstrate the reinforcing properties of as- 
sociative novelty for creative Ss. 60 undergraduates were separated into high 
(HC) and low (LC) creativity groups on the basis of their scores on the 
Remote Associates Test. Ss were then presented with a series of pairs of 
words (each pair containing a noun and a nonnoun) and asked to choose 
the word they preferred. The choice of a noun was followed by a novel 
association to that noun. Nonnoun choices were followed by common associa- 
tions. The HC group significantly increased and the LC group significantly 
decreased the frequency of their noun choices over the series of pairs. The 
results support the need for novelty hypothesis. 


A theory has been proposed which explains 
creative thinking in associative terms (Med- 
nick, 1962). The theory concerns itself with 
the details of the thought process of the crea- 
tive individual but does not deal explicitly 
with the motivational factors in creative be- 
havior. This paper is an attempt to explore 
this aspect of the problem. In particular we 
shall be testing the hypothesis that the highly 
creative person has a strong need for associa- 
tive novelty. 

Positing a need for associative novelty in 
the highly creative person certainly seems a 
banal hypothesis until examined a bit more 
closely. There has been some considerable ef- 
fort to assert and demonstrate that the crea- 
itve individual has a preference for novelty 
(Barron, 1958; Golovin, 1959; Sprecher, 
1959), However, demonstration of preference 
alone does not establish the existence of a 
need. There are several ways that the exist- 
ence of a need as a drive state may be demon- 
Strated. In this experiment we have attempted 
to satisfy the need and observe whether the 
Consequent reinforcement would increase the 
Probability of emittence of behavior which 
Immediately preceded the reinforcement. Thus 
if there is a need for novel associative stimu- 
lation then supplying such stimulation should 
teduce the need (just as ingesting of food re- 


* This research was supported by the Office of Edu- 
red Cooperative Research Branch, United States 
E Health Service, through Contract Number 
073 with Sarnoff A. Mednick and Martha T. Med- 
nick. The authors wish to express their gratitude to 


Martha T, Mednick for her advice in all phases of 
this study, 


duces hunger). If the need is reduced then 
this should produce reinforcement. To demon- 
strate the reinforcing properties of novel as- 
sociative stimuli, a variation of the verbal 
operant conditioning technique was employed. 
Each time the subject chose the noun mem- 
ber of a pair of words he was answered with 
an improbable association to that noun; each 
time he chose the nonnoun member of a pair, 
a highly probable association was given. If the 
improbable, novel association has reinforcing 
properties the rate of the noun response would 
be expected to increase. We predicted that 
highly creative individuals would show the ef- 
fect of such reinforcement while less creative 
individuals would not. 


METHOD 


Remote Associates Test 


The theory of creative thinking which in part 
guided this research states that the creative process 
consists in the forming of associative elements into 
new combinations which meet specific requirements. 
The Remote Associates Test (RAT) is an opera- 
tional statement of this definition. In RAT test items 
the subject is provided with words from mutually 
remote associative clusters and asked to find an as- 
sociative link which combines these words in a speci- 
fied manner. Here are some examples of the type of 
item found in the test. Evidence bearing on the use- 
fulness of the test has been presented elsewhere 
(Mednick, 1962). 


Example 1: surprise line Democratic 

Example 2: rat blue cottage ? 
The test consists of 30 such items; the score is the 
number right. 


2 Answers to sample RAT items; 1, “party; 2: 
cheese. 
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Subjects 


Subjects were chosen from a pool of University of 
Michigan undergraduates to whom the RAT had 
been administered several months prior to the onset 
of the present study. Thirty high RAT scorers com- 
posed the high creative (HC) group, and 30 low 
RAT scorers composed the low creative (LC) group. 
Mean HC score was 24.36, with a range of 22-29, 
and the LC mean score was 12.96, with a range of 
4-19, 


Materials 


The materials consisted of a series of 180 3" X 5" 
index cards, on which a pair of words was centered 
and typed one above the other. All subjects received 
the series in the same order. 

Free operant items. The first 40 items in the series 
of 180 were composed of pairs of nouns and non- 
nouns matched for frequency of occurrence by the 
Thorndike-Lorge (1944) lists. Half the nouns were 
randomly assigned to the top position on the cards, 
These 40 free operant items were introduced to ob- 
tain a baseline preference for choosing nouns. To 
each of the subject’s 40 free operant choices the ex- 
perimenter responded with its most common asso- 
ciate. Common associates were defined as one of the 
three associations most frequently given by a group 
of 25 Michigan undergraduates. 

Filler items, Every ninth card in the series of 180 
was printed with a pair of nonnouns. The experi- 
menter always responded with a common associate 
to the two nonnouns, These 20 filler items, which are 
disregarded in the results, were introduced in the 
hope of preventing the subject from gaining insight 
into the experimental method. These word pairs were 
chosen in the same manner as the free operant items. 

Critical items, The remaining 120 items were com- 
posed of pairs of nouns and nonnouns taken from 
the Minnesota Kent-Rosanoff norms (Russell & Jen- 
kins, 1954) and from a set of word association norms 
prepared by Deese (1960 unpublished), Each pair 
was also matched for frequency of occurrence in the 
language by the use of the Thorndike-Lorge (1944) 
lists; half of the nouns were randomly assigned to 
the top position on the card. The common associa- 
tions given by the experimenter to these words were 
the associations given most frequently in the Deese 
and Kent-Rosanoff norms. Novel associations were 
defined either as the associations occurring uniquely 


TABLE 1 
MEAN AND RANGE oF RAT Scores 


Experimental group Control group 
RAT group - 
M Range M Range 
High >| 24.66 22-29 24.06 22-27 
Low 13.26 4-18 12.66 
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or not given at all in the norms. (In the latter case 
the experimenter supplied the association.) 


Procedure 


The HC and LC groups were each randomly di. 
vided into a control and experimental group of 15 
subjects each. Table 1 presents the mean and range | 
of RAT scores obtained by the four groups, 

All subjects were seen individually; the experi- 
menter read the following instructions: 


I am going to show you some cards, one ata 
time. On each card there are two words. I want 
you to look at both words and say aloud the one 
you like best. In response to each word you sy 
aloud I will say a word, or give you an associa- 
tion. For example, if I show you a card with the 
words “pencil” and “telephone” on it and yoy 
choose “pencil” I might say “paper.” If you said 
"telephone" I might say “call.” So, for each card 
you will choose one of the two words and say it 
out loud and I will give you a response word. Pay 
attention to the words and associations because I 
shall ask you to do something else later on. Any 
questions? 


The experimenter then presented the 180-card se 
ries and recorded the number of nouns chosen by the 
subject. 

Each time a member of the two Experimental 
groups (HC and LC) chose a noun on one of the 
120 critical items, the experimenter responded with a. 
novel association; each time he chose a nonnoun, the 
experimenter presented a common association. For 
example, if a pair of words were FATHER and WHITE 
and an experimental subject chose warre the ex 
perimenter would say “black.” If the subject chose 
FATHER, the experimenter might respond with “egg 
beater.” The two Control groups (HC and LC) wert 
given common associations to both nouns and non- 
nouns, For example, if a Control subject chos 
FATHER from the above example, the experimentet 
responded with “mother.” Pilot studies indicated 
that the HC subjects tended to choose the nonnout 
before any experimental manipulations were intro- 
duced. Just as in T maze research the subject 5 
trained away from his position preference, we a 
tempted to operantly condition the high creatives #0 
choose the nonpreferred noun. 

After the completion of the series of 180 cards the 
experimenter asked the subject to try to recall 
the words on the cards. An interview with the b 
ject concerning the intent of the experiment followe 
the testing session. 


RESULTS 


The interview following the testing session 
revealed that none of the subjects had pe 
ceived or suspected the intent of the expe | 
menter. The majority of the subjects reported | 
an awareness of the difference between n0v® 


and common associations but none were con- 
cious of their contingent relationship to the 
"nounness" of the stimulus words. 


For purposes of analysis, the 180 items were 

divided into four trials of 40 items each. The 
-20 filler items were disregarded. Figure 1 pre- 
sents the mean number of nouns chosen 
per trial by the High Creative-Experimental 
(HCE) and the Low Creative-Experimental 
(LCE) groups. Table 2 contains the repeated 
“measurements analysis of variance (Edwards, 
1951) by which the HCE and LCE group per- 
formances were compared. 


—o—o-—— High Creative Experimental 


225 — R 
\ —-e- -e- - Low Creative Experimental 


MEAN NUMBER 


TRIALS 


! Fm. 1, Mean number of nouns chosen per trial by 
9; the HCE and LCE groups. 


c. Neither of the main effects were significant. 
che Trials x Creativity interaction was sig- 
"hiüicant at the .001 level. This interaction sug- 


tial effect on the performance of the HCE 
and LCE groups over trials. 

— The predictions that the HCE group would 
W an increase in the number of nouns they 
Hose was supported by a significant corre- 
ed ¢ value obtained in comparing the HCE 
ans on Trials 1 and 4 (t = 2.37, df = 14, 
P< 05). However, while the number of 
uns chosen by the HCE group significantly 
Noredsed, the number chosen by the LCE 
Up significantly decreased. 
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TABLE 2 ` 


REPEATED MEASUREMENTS ANALYSIS OF VARIANCE 
oF HCE AND LCE Groups 


Source af MS F 
Creativity 1 .00 .00 
Between subjects in 28 46.51 

same group 3 
Total between subjects | 29 
Trials 3 7.36 48 
Creativity X Trials 3 96.61 6.44* 
Interaction: Pooled 

subjects X Trials 84 15.27 


Total within subjects 90 


*p <.001. 


The difference between the HCE and LCE 
means on Trial 1 was large enough to suggest 
that the number of nouns chosen on subse- 
quent trials might have been affected in some 
way by this original difference. An analysis 
of variance comparing the performance of 10 
subjects from each of the HCE and LCE 
groups matched in terms of the number of 
nouns chosen on Trial 1 yielded a Creativity 
X Trials interaction term significant at the 
.01 level. This significant interaction suggests 
that the effects observed in the HCE and LCE 
groups cannot be accounted for in terms of 
the large initial difference in group means on 
Trial 1. 

Figure 2 presents the mean number of 
nouns chosen per trial by the High Creative- 
Control (HCC) and Low Creative-Control 
(LCC) groups. Table 3 contains the repeated 
measurements analysis of variance by which 
the HCC and LCC groups were compared. 


pa ees —o—o- High Creative Control 
2 3 7-9- —9-— Low Creative Control 
2 220[- 
© 
z 
u 21.5 
[s] 
a 2lO0r[— 
m 
= 05r x 
2 N, 
7 200} s 
z 
z N ^ 
8 195 Nora 
owe [ x1. v^ NE 
f 2 3 4 
TRIALS 


Fic. 2. Mean number of nouns chosen per trial by 
the High Creative-Control and Low Creative-Con- 
trol groups. 
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TABLE 3 


REPEATED MEASUREMENTS ANALYSIS OF VARIANCE 
or HCC ann LCC Groups 


MS F 


Source df 

Creativity 1 1.01 02 
Between subjects in same 28 51.66 

group t 
Total between subjects 29 
Trials 3 10.64 86 
Creativity X Trials 3 11.70 95 
Interaction: Pooled 

subjects X Trials 84 12.36 
Total within subjects 90 


As can be seen none of these differences is 
significant. The lack of significance supports 
the interpretation that the differences ob- 
served in the experimental conditions were 
due to the manipulation of the associative 
novelty of the stimuli. 


Recall Data 


Table 4 contains a comparison of the per- 
centage of nouns recalled by the four groups 
after the testing session. The f ratio for dif- 
ferences in percentage of nouns recalled by 
the HCE and LCE groups was 3.41 (df = 28, 
b < 01). The ¢ ratio obtained in comparing 
the LCC and HCC recall values was not sig- 
nificant (t = .37, df = 28). In view of the 
fact that the HCE group pronounced more 
nouns than the LCE group in the period of 
time immediately preceding the recall session 
this finding is not completely unexpected. 


Discussion 


We have apparently obtained results which 
are supportive to the hypothesis that the 
highly creative individual has a need for as- 
sociative novelty. Let us examine this con- 
clusion with some care. To begin with, while 
the results are most easily explained in terms 
of reinforcement brought about by reduction 
of the need for associative novelty, some other 
possibilities also suggest themselves. It could 


TABLE 4 
PERCENTAGE or Nouns RECALLED BY THE 
Four GROUPS 
HCE , LCE HCC Lcc 
56.1 42.5 55.5 56.4 
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be argued that the highly creative subjects 
were not so much responding positively to 
novelty as they were responding negatively to 
banality. That is, with all due respect for our 
intentions, the behavior of these subjects may 
have been more controlled by the aversiveness 
of the highly probable association than by the 
novelty of the improbable association. This is 
almost analogous to saying that the rat does 
not approach the alley that has the food, but 
is actually avoiding the foodless alley. The 
analogy breaks down when we realize that in 
the case of the rat we can define the need 
relatively precisely in terms of antecedent 
conditions. In the case of the highly crea- 
tive subject we cannot do so. This argument 
suggests that the next step in examining 
this question should consist of the institution 
of some antecedent conditions which would 
arouse the need for associative novelty. We 
could then test to see what are the effects of 
the interaction of varying levels of depriva- 
tion and levels of creativity. It would also be 
of interest to increase the number of trials so 
as to carry the performance to an asymptote 
and then note differential rates of extinction, 
The shortage of associational norms made this 
impossible in this study. 

The noun choosing experimental subject 
was always presented with a novel associa- 
tive stimulus. However, except for a stale 
ment of probability of S-R association We 
would be hard put to define precisely what 
we mean by novelty. Within a given level of 
probability the associative stimuli vary M 
complexity, humor, meaning, rhyme, incom 
gruity, etc. Subsequent research might well 
be aimed at studying the dimension of ass 
ciative novelty independently of the dimen- 
sion of meaning, and vice versa. 

Of considerable interest is the behavior of 
the low creative experimental group which 
shows a marked and significant tendency !? 
avoid the improbable associations and/or ap- 
proach the highly probable associations. Here 
again some further analytical research 5 
needed to separate out the effect of these 1"? 
variables. However, if we can for the mome? 
argue in a mildly tautological fashion it ® 
clear why low creative individuals becom 
that way. The individual who eschews E 
usual ideas and/or prefers the highly P0" 
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able is not likely to develop highly creative 
solutions to problems. 
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HYPNOTIZABILITY AS A FUNCTION OF 
i NONHYPNOTIC EXPERIENCES * 


ARVID ÅS 
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The relationship between various subjective experiences and susceptibility to 
hypnosis was studied in 2 samples of female college students, totaling 102. The 
subjective experiences were registered by the Experience Inventory, a question- 
naire developed earlier for this purpose. Hypnotizability was determined by 
administering objective hypnotic scales individually. Correlations between hyp- 
notizability and the total Experience Inventory score as well as a composite 
score of selected items were significant in both samples, but the latter failed in 
predicting hypnotizability better than the total score. Items of subjective ex- 
periences significantly related to hypnotizability in the total sample were ana- 


lyzed in terms of the personality dimensions implied. 


The search for personality correlates of 
hypnotic susceptibility is as old as the his- 
tory of scientific hypnosis itself. Yet the re- 
sults of these efforts have been astonishingly 
meager. In his review, Hull (1933) charac- 
terized the experimental attempts to discover 
relationships between suggestibility and char- 
acter traits as indecisive. Twenty to 25 years 
later the reviewers have little more to add. 
Weitzenhoffer (1953) denotes the area of per- 
sonality traits and suggestibility as one of the 
most ambiguous. He does cite some evidence 
suggesting positive attitudes as favoring sug- 
gestibility, and states that needs and motiva- 
tion may be related to suggestibility in vari- 
ous ways. Pattie (1956) declares that today 
we hardly know more about what kind of per- 
son is susceptible to hypnosis than did Abbé 
Faria in 1819 when he thought that the “li- 
quidity of the blood” had something to do 
with it. He concludes: 


Susceptibility is not a sign of low intelligence; there 
is probably a sex difference in favor of the females 
but so small as to be of merely academic interest ; 
and older persons are not as susceptible as younger 
ones. Beyond these facts there is no certainty, one 
man’s findings not being confirmed by the next in- 
vestigator. 


1This study was made possible by a Fellowship 
Grant from the Foundations’ Fund for Research in 
Psychiatry. Appreciation is expressed to E. R. Hil- 
gard for aid and advice in the planning and per- 
formance of the study and for letting the gathering 
of data be, part of the continuous research in psycho- 
dynamics and hypnosis at the Laboratory of Human 
Development, Stanford University. 


What strikes the observer of subjects in 
hypnosis is, of course, the amazing and often. 
dramatic behavior they display, apparently in 
response to words only. However, persons who 
themselves have been hypnotized or who haye 
carefully interviewed other persons after they 
have been hypnotized, know that from the 
subject's point of view hypnosis is more than 
anything a remarkable experience. Although 
traits, attitudes, and needs, as well as intelli- 
gence, sex, and age, are important aspects of 
personality, it has puzzled the writer for some 
time that students of hypnotic susceptibility 
have not been more interested in the subjec 
tive experiences of the hypnotized subject, 
especially his experiences outside hypnosis 
that may be related to hypnosis and hypno- 
tizability. A more phenomenological approach 
has proven valuable in other areas of psychol- 
ogy, and ought to be attempted in the field | 
of hypnosis as well. 

A first step in this general research was the 
construction of an instrument for the registra- 
tion of nonhypnotic experiences that presum- 
ably would be related to hypnotizability. The 
Experience Inventory, developed for this pui | 
pose, has been described in a previous article 
(As, O'Hara, & Munger, 1962). An article 
by Shor (1960) on the frequencies of nati 
rally occurring “hypnotic-like” experiences 
appeared during the preparation of the 1 
ventory. Since many of his items were Vel 
much in line with some of the thinking be 
hind the present study, 18 of Shor's items 4 
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well as his instructions were incorporated in presented randomly to the subjects. The categories 
the Experience Inventory. are: 


. Quite recently Shor, Orne, and O’Connell . Altered state—fading of generalized reality ori- 
(1962) reported the relationships between oes ede 

H s ; H . Tolerance for logical inconsistencies 
their Personal Experiences Questionnaire and Role taking 


hypnotizability. In a mixed-sex sample of 29 
subjects, frequency and intensity scores of re- 
sponses to the 129-item questionnaire were 
related to scores on the Stanford Hypnotic 
Susceptibility Scale (Weitzenhoffer & Hil- 
gard, 1959). While the frequency scores did d 
not give positive results, 45 of the items As examples, some of the items are listed below. 
scored for intensity of experience gave sig- 74 preceoive nener indicates ithe ateen: 
nifcant phi coefficients with hypnotizability. (A) Have you ever had the experi- 

A composite score of these items in a new ence of doing some task in the 


i SU middle of the night (eg. jot- 
sample of 25 subjects correlated significantly Gib dom EE x 


Dissociation—exclusion of distracting stimuli 
Willingness to relinquish ego control 

. Tolerance for regressive experiences 

. Constructive use of regression 

. Peak experiences 

. Basic trust—interpersonal relations 
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with the Stanford scale and an overall rating phone call with no memory 
of hypnotic depth (r — .46 and r — 44). the next morning of having 
done so? Yes No ? 


(B) Do you think there are events 


Probl METHOD and things which cannot be 
pozrem. ultimately explained logically? Yes No ? 

The purpose of the present study is twofold: to (C) Do you feel that regardless of 
determine the value of the Experience Inventory or what happens around you, you 
components of it as an instrument predictive of hyp- stay and feel pretty much the 
notizability, and to analyze the relationships between same? Yes No ? 
hypnotizability and the experiences measured by the (D) Have you ever had the feeling 
Experience Inventory with a hope of reaching some that a part of your body was 
understanding of the aspects of personality involved. not really a part of you? Yes No ? 

(E) Would you like to find a great 
H purpose, goal, or leader in life, 
puniects to which you could feel dedi- 

The subjects were 102 female college students at cated? Yes No ? 
Stanford University. They were members of classes (F) Do you enjoy “wild” parties? Yes No ? 
in introductory psychology and had a wide variety (G) Have you ever found a sort of 
of undergraduate majors. They volunteered to serve fulfillment of yourself in cre- 
4s subjects in experiments involving hypnosis and ating something, as in crafts, 
received some course credit in return. The subjects science, writing, art, or music? Yes No ? 
Were recruited from two classes in two consecutive (H) Have you ever had the experi- 
quarters, Fifty were studied in the Fall of 1960 and ence of being caught up by 
52 in the Winter of 1961. music or dancing so that you 

became enraptured by it and 
Procedure had it live and express itself 

rhe first part. of the present investigation was bine Vr) 
planned as a replication study. First the relationships during it? Yes No ? 
between the Experience Inventory responses and hyp- (I) Are your feelings toward one 


notizability were determined in the sample of sub- 
Jects studied in the Fall of 1960. On the basis of 
these results a prediction was made for the sample 
9f subjects to be studied in the Winter of 1961, on The subjects were asked to answer by circling the 
Which identical procedures would be applied. In the appropriate answer. The single items were scored by 
Second part of the study the two samples were com- assigning the value one to an item if it was an- 
bined and a tentative analysis made of the possible swered in the affirmative of the category variable 
Personality aspects implied in the subjective experi- assumed to be tapped by the item, and zero if an- 
ences related to hypnotizability. swered in the negative. Question marks were given 
Experience Inventory. The Experience Inventory the value of one-half. 
(As et al, 1962) is a questionnaire consisting of 60 The Experience Inventory was adminisjgred to the 
items. The items are assumed to give information on total classes of students of which the subjects were 
Dine tentatively selected category variables, but are members, in one of the first class hours in the quar- 


or both of your parents usually 
negative? Yes No ? 
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TABLE 1 


CORRELATIONS BETWEEN EXPERIENCE INVENTORY 
ScorEs AND HyPNOTIZABILITY 


Fall 1960 Winter 1961 
(N = 50) (N = 52) 
Total score 96** KY 
16-item score 
(Fall) .68** .29* 
17-item score 
(Winter) 2E or 


Note.—Pearson r. 
* Significant beyond 5% level. 
** Significant beyond 1% level. 


ter, prior to the hypnotic experiments. The scoring 
also took place prior to and independent of the 
hypnosis experiments. 

Odd-even reliability coefficients (Pearson r) were 
computed for the 60-item scale for the total class 
samples of 176 females and 295 males, giving the 
values of r = .74 and r = .76, respectively. 

Hypnosis. The subjects participated in the hyp- 
notic experiments individually in two sessions on two 
consecutive days. Different experimenters served as 
hypnotists? on the two days, without previous 
knowledge about the subjects or their responses on 
the Experience Inventory. 

On the first day Form A of the Stanford Hypnotic 
Susceptibility Scale (Weitzenhoffer & Hilgard, 1959) 
was used. On the second day the Consolidated Scale 
of Hypnotic Responsiveness ? was administered. This 
scale has some motor items (such as hand lowering, 
arm rigidity, and arm immobilization) in common 
with Form B of the Stanford scale, but also contains 
more "cognitive" items, such as dreams, age regres- 
sion, and hallucinations. The score varies from 0 to 
12, the value 1 being assigned to each item passed. 
The Consolidated scale correlates y= .83 with Form 
A of the Stanford scale. 

A preliminary study (As, 1960) showed that the 
experience categories correlated somewhat better with 
the Consolidated scale than with a combination score 
based on the Stanford scale and the Consolidated 
scale. A better balance of motor and cognitive items 
in the latter also makes it more meaningful as a hyp- 
nosis criterion to be related to the rather cognitive 
domain of subjective experiences. In the present 
study, therefore, the score on the Consolidated scale 
is the measure of hypnotizability. 


RESULTS 


Experience Inventory as a Predictor of Hyp- 
notizability 


Fall sample. The first measure selected for 
correlation with hypnotizability on the first 


2 Hypnotists were A. M. Weitzenhoffer, Mary R. 
Roberts, D. M. Sjoberg, Jr., and the author. 

8 This is an experimental scale with objectively de- 
fined scoring criteria. It was designed by A. M. 
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sample of 50 subjects was the sum of scores of 
the 60 Experience Inventory items. The cg. - 
relation between the total score and the Con- 
solidated scale was 7 = .36, which is signifi. 
cant at the 1% level (Table 1). Thus, from 
the beginning, a relationship was indicated, 

To get a more detailed account of this 
relationship, biserial correlations were com- 
puted between the single items * of the Ex- 
perience Inventory and the Consolidated scale, 
The correlation coefficients are represented by 
the white bars in Figure 1.5 There are 20 co- 
efficients around zero (between —.10 and .10). 
Of the remaining 36 coefficients, 27 or 75% 
are positive and 9 or 25% are negative. 

A more accurate estimation of the relation- 
ship between hypnotizability and subjective - 
experiences may be derived by inspecting the 
significance levels of the biserial correlations. 
Of the 27 positive correlations ("yi > .10), 
12 were significant at or beyond the 5% level, 
and an additional 4 between the 5% and 10% 
levels. Of the 9 negative correlations, only | 
was significant below the 596 level, and ? 
others just below the 1095 level. 

The white bars in Figure 1 show that à 
majority of the larger positive correlations in 
the Fall sample are found in Category À 
(“Altered state") and Category C (‘Role 
taking"), with a couple of such correlations 
in Category E (*Relinquishing ego control") - 
and one here and there in other categories. 

Predictions for the Winter sample. It was 
predicted that the total score on the Expet* - 
ence Inventory would be significantly c 
related with hypnotizability in the sample 
studied during the Winter of 1961, because 
this was a direct replication of the Fall pto- 
cedures. As seen in Figure 1 quite a few 0 
the single item correlations in the Fall sample 


Weitzenhoffer, and is currently in use at the Stan- 
ford Laboratory of Human Development. 

*For correlational purposes items on the Experi- 
ence Inventory answered with the question mat} 
were counted as +. 

5An earlier analysis (As, O'Hara, & mi 
1962) showed that the answer percentages (percent 
age of subjects answering “Yes” or “No” to à a 
tion) in a large sample of subjects were over 90 
below 10 for 4 out of the 60 Experience Inventoiy - 
items. With practically no discriminating ability these 
items were considered unfit for biserial correlation, 
and hence excluded. 
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(Females, 1960-61) 


Altered state 


Tolerance for logical 
inconsistencies 


60 


40 


20 


00 


7.20 


Tolerance for regressive 


Relinguishing ego 
control experiences 


20 


00 


[] ot sample, N= 50 


Role- taking 


Constructive use 
of regression 


[] Winter sample, N= 52 


Dissociation 


J] totes sompe, N 102 


Fic. 1. Biserial correlations between single items on Experience Inventory and hypnotizability. 


Were zero or negative. In an effort to develop 
à measure that would capture most of the 
items contributing to the overall positive re- 
lation to hypnotizability without being ham- 
pered by the negative or noncontributing 
items, the scores on the 16 items with posi- 
live correlations beyond the 10% level of 
Significance were added together in a com- 
posite score. This 16-item composite score cor- 
related r = 68 (Pearson) with hypnotizabil- 
ity in the Fall sample. Because such a com- 
Posite score would capitalize on the chance 
“rors involved in the distribution of correla- 
tions in the Fall sample, it was expected that 
Its correlation with the criterion would be 
Somewhat lower in a new sample. However, 
the prediction was that the correlation be- 
m n the 16-item composite score and hyp- 
Dotzability would remain significant in the 


Winter sample as well, and would exceed the 
correlation from the total scale. 

Winter sample. 'This sample consisted of 52 
subjects, and was therefore also comparable 
in size with the Fall sample. 

As shown in Table 1 the correlation be- 
tween the total Experience Inventory score 
and the Consolidated scale was .31, which is 
significant at the 1% level. Thus, the first 
prediction was verified. 

Table 1 further shows that the correlation 
between the 16-item composite experience 
score and hypnotizability in the Winter sam- 
ple was 7 = .29 with a corresponding p = .02. 
Thus, the second prediction was not verified, 
because although significant, the correlation 
was no higher than that of the toéal score. 
The drop in correlation from .68 to .29 sug- 
gested that the chance errors in the sampling 
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of the single experience items, on which the 
composite score capitalized, were fairly large. 

The biserial correlations with hypnotizabil- 
ity for the single Experience Inventory items 
were computed for the Winter sample as well. 
As a check on the composite score prediction 
reported above, the 17 items giving biserial 
correlations with p < .10 in the Winter sam- 
ple were added into a composite score, which 
correlated r = .59 with the Consolidated scale 
in this sample (Table 1). This 17-item com- 
posite score was then correlated with hypno- 
tizability in the Fall sample, and the result 
was r = .27 (p = .03). Thus, the correlation 
of this new composite score also remained sig- 
nificant when computed on a new sample, but 
the drop in correlation from the one sample 
to the other was again clear and very similar 
to the one found for the 16-item score used 
in the original prediction. In neither case did 
the selected shorter scale predict better than 
the total scale. 

The biserial correlations with hypnotizabil- 
ity for the single experience items in the 
Winter sample are represented by the dotted 
bars in Figure 1. The overall picture of posi- 
tive correlations is about the same as for the 
Fall sample represented by the white bars. A 
closer inspection also reveals some differences. 
For instance, in the “Altered state" category 
there are some items showing high correla- 
tions in the Fall sample and very low ones in 
the Winter sample. In the category “Basic 
trust" the positive correlations in the Winter 
sample clearly exceed those in the Fall sample. 


DISCUSSION 


. The fact that the total score of the Experi- 
ence Inventory is significantly related to the 
hypnotizability measures in both samples, and 
essentially with the same magnitude, suggests 
that some relationship between the two sets of 
data has been found. As to the nature of this 
relationship, however, the rather large drops 
in the cross-correlations of the composite 
scores, based on the highest correlating items 
in one of the samples, suggest that only to 
some degree is the common correlation with 
hypnotizability in the two samples carried by 
a restrictéd number of items which separately 
show significant correlation with the criterion 
in both samples. The number of such items in 
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the present case is only five. To quite some 
degree the common correlation is caused by a 
series of items, the separate correlations of 
which vary quite a bit from sample to sam- 
ple, so that they may show up as significant 
in one case but not in another. In spite of this 
sample variation, they are to a smaller or 
larger degree essentially positively correlated 
with hypnotizability, and added together, 
their contribution to the overall correlation 
common for the two samples counts. As evi- 
dence for this one may consider the correla- 
tions of the residual items not included in the 
16-item composite score which was tentatively 
chosen as one of the prediction statistics, 
While the composite score of these residual 
items correlate only r = .08 in the Fall sam- 
ple, the correlation in the Winter sample is 
r — .26, that is, almost the same correlation 
as for the 16-item score. For the 17-item 
Winter sample score the corresponding corre- 
lations are r= .08 and r = .34. This also 
shows that as an attempt to develop a meas- 
ure that would capture the bulk of the posi- 
tive relationship between the total experience 
Score and hypnotizability, the choice of the 
16-item score was not particularly successful. 
In future research, attempts will be made t0 
develop measures that are more successful in 
this respect.* 

On the other hand, it is also very possible 
that the nature of the relationship between 
hypnotizability and subjective experiences is 
such that a score of a restricted number of 
items always predictive of hypnosis cannot be 
found. Thus, the hypothesis that suggests it- 
self at the present level of research is that for 
different subjects there are somewhat different 
areas of subjective experiences that are re- 


6 An analysis of a new sample of 50 male students 
has already been made. The correlation between the 
total Experience Inventory score and the Consoli- 
dated scale was r = 47, giving further confirmation 
of the earlier results. A composite score of 23 items 
with biserial correlations significant beyond the 10% — 
level in the sample of 102 females used in the present 
study correlated .51 in this sample and .35 in the new 
male sample. A composite score of the 37 items that 
separately gave positive biserial correlations in the 
present sample (NW =102, composite r= .49) corre- 
lated 48 in the new sample. The correlations of this 
latter composite score does not drop from one sam- 
ple to the other, but it still does not predict hyp- 
notizability better than the total score. 
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lated to hypnotizability. In other words, there 
would always be a relationship between hyp- 
notizability and the broader range of subjec- 
tive experiences as represented by the Experi- 
ence Inventory, but from group to group there 
would be differences in the subareas of ex- 
periences most saliently related to hypno- 
tizability. 

Hypnotizability and Personality 


To arrive at a basis for a tentative analysis 
of the personality aspects implied in the sub- 
jective experiences related to hypnotizability, 
the two samples described above were com- 
bined (V = 102) and the biserial correlations 
between the single experience items and the 
Consolidated scale were computed. These cor- 
relations are represented by the black bars in 
Figure 1. Of the 41 positive correlations, 14 
are significant at or beyond the 5% level, 
with 5 of them also significant beyond the 
1% level. In addition, 9 correlations have p 
values between .05 and .10. In the present 
exploratory phase of the research these 23 
items were selected as the basis for the tenta- 
tive analysis in terms of personality aspects 
to be given below. 

As seen in Table 2, 5 of the 23 items sig- 
nificantly related to hypnotizability in the to- 
tal sample were also significant in the two 
Subsamples of 50 and 52 subjects, respec- 
tively. Twelve items were significant in one 
sample, and positive (from .05 to .25) with- 
out reaching significance in the other sample; 
Seven items had a relatively large correlation 
in one sample and around zero correlation 
(from —.01 to .05) in the other sample. 

In Table 2 the 23 significant items have 
been grouped according to the psychological 
characteristics which at face value seemed to 
be most saliently represented. These charac- 
teristics are similar to some of the categories 
in the Experience Inventory, but also to some 
extent cut across and combine these category 
Variables, 

In the first group of nine items listed in 
Table 2 the first three items may be said to 
reflect social influencibility.7 The next two 


T It is of interest to note that in a recent factor 
analysis, a test of social influencibility was the only 
t among a series of pencil-paper personality tests 
un Showed some loading on the factor defined by 
e hypnotic items (Moore, 1961). 
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items have to do with experiencing unreal 
things as real, that is, having the imagina- 
tion take on reality character. The last three 
items cover experiences of mental absorption 
to a degree where one becomes oblivious to 
the surroundings and where the state of con- 
sciousness becomes temporarily altered or 
takes on a new reality in itself. These have 
been called peak experiences. All the experi- 
ences represented by the items in the first 
group have a role taking aspect to them. Role 
absorption is proposed as a better term for 
what these experiences have in common. 

The second large group of significant items 
represents different kinds of unusual experi- 
ences that the subjects have had. Some of 
these experiences imply amnesia for activi- 
ties completed in sleep or during absentmind- 
edness, Several experiences in this group seem 
to imply a certain dissociated mental state 
and in some cases a tolerance of phenomena 
known to be logically inconsistent. A certain 
antirationality is also represented. It is diffi- 
cult to find a short term covering what these 
items have in common other than represent- 
ing experiences of incidents that are unusual 
or different and where the consciousness is 
somehow more or less altered. 

Three items from the category Basic trust 
are listed as a smaller group, representing 
positive feelings toward one or both parents, 
the wish to experience closeness with people, 
and the opinion of oneself as an optimist 
rather than a pessimist. 

Two remaining items represent the wish to 
find a great purpose, goal, or leader in one’s 
life to which one could feel dedicated, and 
admission that there have been persons in 
one’s life outside the family who have influ- 
enced one a great deal or who have had a 
certain “grip” on oneself. These items could 
perhaps be said to reflect a streak of disciple- 
ship. 

Three items mentioned above in connection 
with other groups could possibly constitute a 
group by themselves. The willingness to be in- 
fluenced by others, the wish for the purpose, 
goal, or leader to feel dedicated to, and the 
wish to get beyond the world of Jogic and 
reason and experience something new and dif- 
ferent have in common a willingness to relin- 
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quish a certain ego control for the sake of 
some new experience. 

A previous article (As et al, 1962) re- 
ported a factor analysis of a correlation 
matrix of 24 items representative of the Ex- 
perience Inventory, selected without regard or 
knowledge about relationships to hypnotiza- 
bility. Two main factors were found, role ab- 
sorption and the experience and tolerance of 
unusual experiences, the latter factor also 
representing some regressive elements. With 
less secure evidence the additional factors of 
discipleship and social influencibility were 
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mentioned. Only 10 of the 24 items, repre- 
senting seven categories, ended up in the 
present series of 23 items showing significant 
correlations with hypnotizability. It is there- 
fore all the more remarkable that we find es- 
sentially the same psychological dimensions 
represented. This also shows that the theo- 
retical notions underlying the items making 
up the earlier factor matrix were useful guide- 
posts in the search for empirical correlates of 
hypnotizability. 

The main feature in the first group of sig- 
nificant items was role taking. However, a 


TABLE 2 
EXPERIENCE INVENTORY ITEMS CORRELATING SIGNIFICANTLY WITH HYPNOTIZABILITY 
Fall inter 
Item content* Tomson | av so) ne 
Role absorption 
Experiencing change in self-feeling with change in social situation ;29** .41** .07 
Not feeling of being the same regardless what happens around Tif bed .36** PL 
Not against being influenced by others 24** org 48 
Having imagined something so it appeared real? .20** a 20% 
Having told a story with elaborations to make it sound better, 
then having the elaborations seem as real as the actual incidents E .60*** .38"* 
Having recollected a past experience with reality-like clarity and 
vitality? pois —.01 E 
Having been completely immersed in nature or in art with 
sweeping feelings of awe, inspiration, and grandeur> And ipd 29% 
Having been so absorbed in reading that one “lived” the story 
almost like real 19% .08 .26* 
Having experienced "becoming" the character when acting in a 
play Mitre 34" ATP 
Experience and tolerance of altered states and unusual incidents 
Having carried on real conversation with another person while 
asleep? 519% 2s —.01 
Having completed some task during the night with no memory in 
the morning of having done so? 23% .25* .20 
Having been so absentminded as to complete tasks without 
awareness? AT* .00 32 
Having gone into a benumbed or serene state of consciousness by 
focusing hard at something? Apre PY haid 33 
Having experienced the walls or ceiling as changing, although 
knowing this impossible 24** hie 46 
Having felt the “mind” going apart from the body? 109 1v .03 om 
Having experienced the feeling of body parts moving involuntarily .19* Aes .03 
Wanting new and different experiences beyond logic and reason 23T 16 230% 
Basic trust 
Wanting closeness with people .19* .00 38" 
Having positive feelings toward parents .20* .05 32" 
Regarding oneself as optimist .19* .07 Ej 
A streak of discipleship 
Wanting to find a great purpose, goal, or leader to which one 
could feel dedicated 129 *tf TEES .20 
Having had persons in one's life (outside family) influence one 
a great deal .19* .26* 18 
Thinking oneself to be regarded by others as humorous .18* .00 32 


Note.—Biserial correlations. 
a Not verbatim. 
b Incorporated from Shor (1960). 
*.10 > p > .05. 
**.05 Sp 5 01. 
3 p <01. 
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closer study of these items showed that there 


isa greater degree of involvement and ab- 
sorption in these experiences than is implied 
in the concept of role taking as discussed by 
Sarbin (1950). There is a fading of the dif- 
ferentiation between self and role, as it were, 
so that the experience takes on new reality or 
results in a perceived alteration of the mental 
state, suggestive of the cognition of being in 
the peak experience, as described by Maslow 
(1959). The term role absorption seems to be 
more indicative of the psychological processes 
here involved. 

The second large group of items described 
above represented earlier experiences of un- 
usual incidents and states as well as the toler- 
ance for such experiences. Grouped together 
here are items of sleep activity and amnesia 


| based on Shor's (1959) notion of the altered 


state as a result of the fading of the general- 
ized reality orientation; items based on Orne's 
(1959) concept of trance logic and tolerance 
for logical inconsistencies; and items based 
on the old notion of dissociation. As pointed 
out earlier (As et al., 1962), a closer analy- 
sis of these different concepts shows that also 
on a theoretical level they have very much in 
common. 

That trust and confidence in the environ- 
ment is a characteristic of the hypnotizable 
person has been indicated by several students 
in the field (Hilgard, 1961). 

The streak of discipleship implies a certain 
need for deference (White, 1941) as well as 
parental figure transference and a willingness 
to relinquish a certain ego control (Gill & 
Brenman, 1959). 

The notion of the willingness to relinquish 
4 certain ego control for the sake of some new 
experience was derived from Gill and Bren- 
man’s (1959) theory of hypnosis as a regres- 
sion in the service of the ego. In this connec- 
tion it is of interest to note that among the 


23 significant items, the items of tolerance 


Or regressive-aggressive experiences were not 
Ound. Thus, it appears that in the female 
Subjects studied, it is not so much the toler- 
im ce for impulsive regressive acts that is re- 
ated to hypnotizability. It is more the toler- 
ance for primary-process colored experiences 
as exemplified by the absorption experiences 
ês Well as by the experiences of unusual inci- 
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dents and states. This kind of regression may 
be more in line with the kind of primitiviza- 
tion concept proposed by Meares (1957). 

The preceding tentative analysis may be 
summarized as follows: The findings indicate 
that the hypnotizable young college girl has 
had earlier experiences of a changing self with 
changing social situations; of imaginative 
situations taking on reality character; and of 
mental absorption to the degree of feeling the 
state of consciousness change. She is not 
against being influenced by others. Further, 
she has had earlier unusual experiences, in- 
volving amnesia, logical inconsistencies and 
dissociation. As a matter of fact, she wishes 
to get beyond the world of reason and logic 
and experience something new and different. 
She has a confident and trustful attitude to- 
ward other people, she is willing to give up 
some ego control for new experiences, and she 
may carry a longing for dedication to some 
cause greater than herself. 

This description of the hypnotizable person 
makes good sense when seen in relation to the 
hypnotic situation itself. The very agreement 
to let oneself be hypnotized would seem to 
imply a certain trust in the hypnotist and a 
willingness to relinquish or relax control, and 
let oneself be influenced by the hypnotist's 
suggestions. Further, during the induction and 
following it, it is very much a question of 
focusing and becoming absorbed to the ex- 
tent that the mental state alters. The experi- 
ences of a changing self with changing situa- 
tions and of imaginative situations taking on 
reality character are very much implied in 
the hypersuggestibility usually associated with 
hypnosis. 

The fact that the suggested psychological 
dimensions of the hypnotizable person also 
make sense when applied to the hypnotic 
situation points to the possibility that the 
separation between hypnotizability on the one 
hand, and characteristics of the hypnotizable 
person on the other, is artificial. A breakdown 
of this traditional barrier would facilitate a 
conception of hypnosis, not as a special pe- 
culiar thing as often seems to be implied, but 
as a normal personality process, special only 
in the sense that it represents an ynusual 
combination of situational, personal, and in- 
terpersonal factors which separately or in 
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other configurations are also present in other 
experiences and behaviors. 
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Recent investigations have suggested that 
theories of cognitive consistency can be 
adapted to account for individual differences 
in self-concept (Aronson & Carlsmith, 1962; 
Deutsch & Solomon, 1959). For example, 
Aronson and Carlsmith, using the theory of 
Cognitive dissonance (Festinger, 1957) as a 
‘Point of departure, suggested that dissonance 
Occurs most often when an individual's be- 
havior is at variance with his self-concept— 
te, at variance with his expectancy concern- 
ing his own behavior. The authors reasoned 
that, theoretically, behavior which is incon- 
-Sistent with a self-relevant expectancy should 
carouse dissonance; dissonance would be 
aroused whether the self-concept involved is 
Positive or negative. For example, the self- 
concept “I am not generous” would be dis- 
. Sonant with the cognition “I gave $1,000 to 
"Charity? To test this hypothesis, Aronson 
and Carlsmith conducted a laboratory experi- 
Ment which demonstrated that subjects who 
expected to perform poorly on a particular 
task, but performed well, expressed greater 
dissonance (discomfort) than subjects who 
‘pected to perform poorly and did perform 
Poorly, 

pb the above experiment, Aronson and 
Carlsmith dealt with a specific and limited 
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SOME HEDONIC CONSEQUENCES OF THE CONFIRMATION 
AND DISCONFIRMATION OF EXPECTANCIES * 


ELLIOT ARONSON 


University of Minnesota 


In a test of the hypothesis that events which disconfirm expectancies will be 
perceived as unpleasant, Ss tasted a random sequence of sweet and bitter solu- 
tions. On the basis of certain signals given by the E, they developed expect- 
ancies or hypotheses about whether the next solution would be bitter or sweet. 
On trials when the Ss’ expectancies were disconfirmed due to incorrect signals, 
the solutions were judged to taste more unpleasant. Thus, a bitter solution was 
rated more bitter; a sweet solution was rated less sweet. The results were inter- 
preted in terms of Festinger's theory of cognitive dissonance. 


kind of expectancy—an expectancy which in- 
volved a self-conception regarding one's abil- 
ity on a particular task. The present experi- 
ment is an attempt to generalize this predic- 
tion to make it applicable to any expectancy. 
That is, rather than limiting the prediction 
to situations involving a person's disconfirmed 
expectancy about his own behavior, it is now 
suggested that dissonance is aroused when- 
ever any event occurs which disconfirms any 
strong expectancy. Thus, the following pre- 
diction can be made: If a person firmly ex- 
pects a certain event and it does not occur, 
he will experience dissonance. His cognition 
that he expects the event to occur is dis- 
sonant with his cognition that the event did 
not occur. 

Festinger has described dissonance as a 
state of “psychological discomfort." Previous 
experiments testing derivations from his the- 
ory have inferred the existence of dissonance 
from observable manifestations of attempts 
to reduce it. The present experiment was de- 
signed to measure the existence of dissonance 
on a more directly hedonic level. If dissonance 
arises from the disconfirmation of an expect- 
ancy, then a disconfirmed expectancy should 
result in a hedonically negative state. If one 
assumes that this negative hedonic tone gen- 
eralizes to objects in the environment, one 
can make the following prediction: If a per- 
son expects a particular event (X) and in- 
stead, a different event (Y) occurs, he will 
experience dissonance. Consequently, he would 
judge Y to be less pleasant than if fie had 
had no previous expectancy. 
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PROCEDURE 


In general, the procedure involved: leading the 
subject to expect that a solution the subject was 
about to taste would be bitter (or sweet), experi- 
mentally varying the strength of his expectancy, ex- 
perimentally confirming or disconfirming this expect- 
ancy, and measuring the subject’s perception of the 
taste of the solution, 

The subjects were 52 college students who were 
paid volunteers. When the subject arrived, the ex- 
perimenter said that he was a graduate student who 
was working for two psychologists simultaneously, 
and that these psychologists were interested in to- 
tally different problems. In order to conserve time 
and money, the experimenter continued, they had 
decided to attack these two problems in the same 
experimental session. Thus, the experiment would 
actually consist of two unrelated experiments which 
could be done at the same time. 

The experimenter explained that the problem one 
of the psychologists was interested in was people’s 
sensitivity to interpersonal cues—cues emanating 
from another person which might indicate some- 
thing about how that person was feeling or what he 
was thinking. 


For example, suppose that every time I was 
about to fly into a rage, I performed some idio- 
syncratic movement, say scratching my head. If 
you were interested in knowing whether I was 
about to fly into a rage (so you could escape), 
you might learn this correlation and thus be able 
to predict my behavior. 


The experimenter explained that the ability to notice 
such cues was an extremely important part of one’s 
sensitivity to the feeling of others—that all people 
had such mannerisms, and that people who could 
learn to identify them quickly were able to predict 
feelings and thoughts of others and were thus con- 
sidered perceptive, sensitive, insightful. The experi- 
menter explained that he was interested in studying 
this process in a highly controlled laboratory situa- 
tion. He then proceeded to describe the task. 

On the table in front of the subject were 24 small 
paper cups, each containing about 2 cubic centime- 
ters of solution. One half of the cups contained a 
sweet solution, the other half a bitter solution. The 
experimenter explained that he would present the 
subject with a solution which the subject was to 
taste. Before the subject tasted the solution, the ex- 
perimenter would present a signal which would indi- 
cate whether the solution would taste bitter or sweet. 
On the basis of this signal the subject was to make a 
prediction before tasting the solution, The experi- 
menter said that the signals would be mannerisms of 
his—scratching his head, stroking his nose, etc. 


Some of the signals will mean that the next solu- 
tion will be sweet; some will mean that the next 
solution will be bitter; some will be completely 
irrelevànt. Your job is to learn what the signals 
mean and to predict the flavor of the next solu- 
tion you taste. 3 
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One of the variables of interest in the experiment, 
the experimenter explained, was the effect of motiva: 
tion on this ability, i.e., whether obtaining a r 
for being correct would increase a person’s efficien 
In order to test this, two groups were to be studie 
One group, which the subject would be a memb 
of, was an experimental group. For this group, there 
would be monetary payoffs for correct predictions, 
The subject would win $.50 for every correct gues, 
and would lose $1.00 for each incorrect guess, T 
experimenter lent the subject $5.00 with whic 
play the game, and explained that if the subj 
ended the experiment with less than $5.00, no money | 


Most subjects won between $1.00 and $2.00. 
The experimenter continued, 


other psychologist, Dr. —__________, is 
ested in the perception of taste. Specifically, hei 


gave you two sweet solutions, and one cont 
twice as much saccharin as the other, Physically 
such a solution would be twice as sweet, but ps 
chologically it probably wouldn't be; i.e., it probe 
ably wouldn't taste twice as sweet to you. It mig 
taste just a little sweeter, it might taste ten timi 
as sweet. This is what we are trying to find oll 
We would like a scale, a set of numbers to appi 
to these solutions, so that if we call one solul 
“10” and one “20,” we can be sure that the lallit 


of magnitude estimation. The experimenter told 
subject that he would first give him a standard 90 

tion which would be called “10.” There would W 
one standard for the sweet, one for the bitter, ^9 
test solutions were to be compared with the standi 
ards and were to be assigned numbers which Was 
proportional to how sweet or bitter they tasted. 


Thus, if it tastes twice as sweet as the stand 4 
you would call it “20,” if it tastes half as swa 
call it “5.” 


The experimenter emphasized that any number cai 
be used. Two scales were to be determined—on? iH 
sweet, one for bitter. The experimenter explained Uf 
the two scales were completely independent; 0n 
scale a higher number would represent morte an 
centration, ie. more sweet or more bitter. A 

The experimenter reviewed briefly the whole $ 
quence which was: the experimenter gave the S80 
the subject guessed sweet or bitter, the experimel™ 
gave him the cup, the experimenter told him whetai 
he was correct or not just as the subject taste 
solution, the subject estimated the sweetness % 
terness of the solution. 


CoNFIRMATION OF EXPECTANCIES 


The experimenter first presented two standards of 
each solution. One other standard of each solution 
was given after the ninth trial. The experimenter 
waited for one minute after each trial in order to 
allow any aftereffects of the taste to dissipate. Fresh 
water was available at all times if the subject wished 
to rinse his mouth. In actuality, all of the sweet solu- 
tions and all of the bitter solutions contained pre- 
cisely the same physical concentration. The sweet 
solution was a saccharin solution of 2 grams per liter. 
The bitter was a quinine sulfate solution of .080 
grams per liter. 

The signals actually used were extremely simple. 
They consisted of the experimenter’s moving his hand 
to his ear, forehead, and chin as a signal for bitter; 
nose, forehead, and chin as a signal for sweet. Most 
subjects learned these signals in three to five trials. 

After it was clear that the subject had learned the 
signals, the experimenter gave the signal for one solu- 
tion while presenting the subject with the other 
solution, which caused the subject to be wrong on 
that trial. The experimenter then resumed the cor- 
rect signals, and waited until the subject had been 
correct for four or five trials, at which point the ex- 
perimenter again gave a misleading signal. 

The assumption in this procedure is that the sac- 
charin solution will taste pleasant, and the quinine 
solution will taste unpleasant. Further, it is assumed 
that a sweeter solution will taste more pleasant, and 
a more bitter solution more unpleasant. Evidence in 
support of these assumptions is reviewed by Pfaff- 
man (1958). In this experiment, this assumption was 
Clearly satisfied for most subjects. For example, many 
of the subjects spontaneously remarked how unpleas- 
ant the quinine sulfate solution tasted, and one sub- 
Ject could not complete the experiment because she 
feared it would make her sick. However, there were 
a few subjects who found the bitter solution to be 
tather pleasant. Also, there were some subjects who 
found the saccharin solution unpleasant. In order to 
be certain that we used only subjects for whom the 
Sweet and bitter solutions were, respectively, pleas- 
ant and unpleasant, all subjects were asked the fol- 
lowing questions at the conclusion of the experiment: 


Would you say the standard sweet solution was 
pleasant or unpleasant? 

Was a sweeter solution more pleasant? 

Would you say the standard bitter solution was 
pleasant or unpleasant? 

Was it more unpleasant as it got more bitter? 


It will be remembered that all of the solutions 
wee Of the same strength; the subjects, however, 

owed considerable variability in their ratings. Thus, 
Pad phrase “a sweeter solution” did have meaning 
or the subject, 
3 For the analysis of the data on sweet solutions, 
duy those subjects were used who found the solu- 
ŠR fo be more pleasant as it became sweeter. Simi- 


® Evidence is presented by Neilson (1958) to show 


at aftereffects ini issi i ut 50 
Boris of quinine dissipate in about 5 
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larly, only the data of subjects who found the bitter 
solution to be more unpleasant as it became more 
bitter were used in the analysis of bitter solutions. 
Because of this precaution, the data of 10 of the 52 
subjects were eliminated in the analysis of sweet 
solutions; * the data of 3 of the 52 subjects were 
eliminated in the analysis of the bitter solution.5 

At the close of the experiment, the experimenter 
told the subject the true purpose of the experiment 
and discussed the need for deception. 


RESULTS 


The hypothesis under consideration is the 
following: If a person expects a particular 
event (X) and instead, a different event (Y) 
occurs, he will experience dissonance, Conse- 
quently, he will judge Y to be less pleasant 
than if he had had no previous expectancy. 
In terms of the present operations, if a per- 
son expects a solution to be sweet, but finds 
that it is bitter, he will judge the solution to 
be less pleasant (more bitter). Similarly, if he 
expects a solution to be bitter, but finds that 
it is sweet, he will again judge the solution 
which disconfirms his expectancy to be less 
pleasant (less sweet). 

Before discussing the results, several things 
should be noted. When the subject is incor- 
rect, there are clearly several factors operat- 
ing: He loses money ($1.00), he is wrong, 
and an expectancy is disconfirmed. These can 
be separated in the analysis. There are some 
trials on which the subject is incorrect, but 
has no clear expectancy about what the solu- 
tion will be. That is, on early trials, before 
he has learned the signal system, or on trials 
following a disconfirmation when he feels he 
does not understand the signal system, the 
subject does not have a firm expectancy. But 
when he has predicted correctly for several 
successive trials, he expects very strongly that 
the next solution will be what the signal has 
led him to predict. For this reason, in the 


4The surprisingly large number of subjects who 
found the sweet solutions to be unpleasant may be 
partially explained by the fact that six of them were 
among the last seven subjects who participated in 
the experiment. When the experimenter tasted to 
check this phenomenon, he found that the solution 
had become stale and indeed was slightly unpleasant. 

5 Five additional subjects were discarded from the 
analysis: one because he guessed the purpose of the 
experiment; one because the quinine made her feel 
sick and she could not complete the experiment; 
three because they failed to learn the signals, 
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analysis we look for trials on which the sub- 
ject was correct (Correct), trials on which he 
was incorrect but had no strong expectancy 
(Incorrect), and trials on which he was in- 
correct and strongly expected to be correct 
(Disconfirmation). It was decided to define 
Disconfirmation by including those trials on 
which all of the following occurred: the sub- 
ject was incorrect, the experimenter gave an 
incorrect signal, the subject had been correct 
on the previous two trials of that flavor. These 
criteria are probably conservative, in the 
sense that an independent measure of strength 
of expectancy might identify some trials as 
Disconfirmation which we have classified as 
Incorrect. 

Figure 1 shows the mean ratings of sweet- 
ness and bitterness for each trial for the In- 
correct and Disconfirmation conditions. Be- 
cause of large differences between subjects in 
both means and variances, each subject's rat- 
ings were transformed to standard scores with 


BITTER 


SWEET 


10 


Fic. 1. Mean ratings of sweetness and bitterness. 
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a mean of 10 and a variance of 1. Since no 
subjects had expectancies disconfirmed on the 
first five trials, the results for these trials are 
not included. | 

In examining the theoretical ideas outlined 
earlier, the following comparisons are of i 
terest. The difference between being correct 
and being incorrect with an expectancy should | 
be largest, since it contains the effect due to | 
being wrong, the effect due to losing money, 
and the effect due to having an expectancy 
disconfirmed. The difference between being 
correct and being incorrect with no expect | 
ancy gives an estimate of the effects of being 
wrong and losing money. But the crucial com- 
parison is between being incorrect with an er- 
pectancy (Disconfirmation) and being incor. 
rect without an expectancy (Incorrect). This 
provides a control for effects of being wrong 
and losing money, and should reflect only the } 
effect of having one's expectancies discon 
firmed. 


Disconfirmation 


——Disconfirnation 
— — -Incorrect 


18 


13 
Trials 


14 16 


CONFIRMATION OF EXPECTANCIES 


Because it was impossible to control com- 
pletely the trial on which a subject was cor- 
— rect or incorrect, the effects due to treatments 
are partially confounded with the effects due 
- to trials. To eliminate this confounding, and 
- get unbiased estimates of treatment effects, 
- an analysis of variance was performed. Al- 
though the unequal number of cases in the 
cells precluded use of the standard formulas 
for the analysis of variance, it was possible 
to use a standard analysis of variance model. 
— A least squares solution was obtained, giving 
- unbiased estimates of all treatment effects, 
- and yielding all F and £ tests. The analysis of 
— variance is shown in Table 1. It is apparent 
- "that the treatment effects are highly signifi- 
cant for both sweet and bitter (5 < .001 in 
both cases). 

As may be seen in Figure 1, or from the 
overall means in Table 2, the results for the 
sweet solutions are completely in line with the 
hypothesis. The subject perceived the solution 
as most sweet on trials when he correctly pre- 
dicted that it would be sweet. On trials when 
he was incorrect, but had no expectancy, the 
Solution was perceived as less sweet; the least 
pleasant of all were the cases in which a 
Sweet solution disconfirmed an expectancy. 

„The bitter solutions were perceived as most 
bitter on trials when the subject was incor- 
rect with a firm expectancy. The means for 
the other two groups are in the opposite di- 
tection from what one might have predicted 
_ ftom a simple-minded interpretation of asso- 

Cation theory; thus, the solution was per- 
| Ceived as more bitter on trials when the sub- 
: Jéct was correct than on trials when he was 

AY 


ee 


TABLE 1 
ANALYSIS OF VARIANCE 


Source df SS MS F 
Sweet 
pals 6 | 3299 | 5.50 | 7.33* 
Frottments 2 15.09 7.54 10.06* 
n 250 | 186.90 | 75 
. otal 258 | 234.98 
Bitter 
.— Tris 5 | s28 | 106 | 129 
+ Treatments | 2 | 1195 | 598 | 729* 
1 Ns 256 | 210.62 .82 
‘otal 263 | 227.85 
*? = 001, 
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TABLE 2 


UNBIASED ESTIMATES OF THE EFFECTS OF TREATMENTS 
ON THE PERCEPTION OF TASTE 


Taste |Correct| Tacor- | Discon- | t 

Sweet | 10.09| 9.89 9.47 CX T1382 
CX D: 4.58**** 
IX D: 2.54** 

Bitter | 10.15% 9,86 | 10.57 CXI: 1.86* 
CX D:2.89*** 
TD: 3.779 


a Higher scores mean more sweet. 
b Higher scores mean more bitter. 


incorrect but had no expectancy. This dif- 
ference is not significant, however (t= 1.86, 
P «.10); nor is the comparable difference 
for the sweet solutions (¢ = 1.32). 

The conclusion suggested by these results is 
that there is no clear effect on the perception 
of taste due to the subject's being incorrect 
if he had no expectancy. However, when the 
subject is incorrect, and a firm expectancy is 
disconfirmed, there is a clear and strong ef- 
fect on the taste of the solutions which discon- 
firms the expectancy. Bitter solutions which 
disconfirm expectancies are perceived as more 
unpleasant (more bitter); sweet solutions 
which disconfirm expectancies are perceived 
as less pleasant (less sweet). 

It should be noted that our results for bit- 
ter and sweet move, as predicted, in opposite 
directions. For the bitter solution, a discon- 
firmed expectancy resulted in a rating of 
more bitter (higher rating); for the sweet 
solution a disconfirmed expectancy resulted 
in a rating of less sweet (lower rating). Con- 
sequently, an alternative explanation involv- 
ing the possible assimilation of the rating to- 
ward the expected taste could account for the 
results involving the sweet solution but would 
predict the opposite for the bitter solution. 
Likewise, an alternative explanation involving 
the possible exaggeration of the strength due 
to surprise (contrast effect) could account for 
the results involving the bitter solution, but 
would predict the opposite for the sweet solu- 
tion. There would seem to be no regson to 
predict contrast in one flavor and assimila- 
tion in the other. It appears that the only 
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unitary explanation for the results is one that 
takes into account the affectual direction of 
the two continua, thus predicting opposite 
results for bitter and sweet. 

Our theory leads to the prediction that the 
stronger the expectancy, the greater the nega- 
tive affect following its disconfirmation. Such 
a prediction is apparently contrary to affec- 
tual theories which suggest that positive af- 
fect follows a departure from an expected re- 
sult (e.g., McClelland, 1953; White, 1959). 
For example, McClelland’s discrepancy hy- 
pothesis holds that stimuli widely discrepant 
from a person’s adaptation level produce 
negative affect; stimuli mildly discrepant from 
a person’s adaptation level produce positive 
affect; stimuli which coincide with a person’s 
adaptation level produce boredom. Here, level 
of adaptation can be considered to be identi- 
cal with an expectancy which has been formed 
by repeated invariant events. This hypothesis 
was confirmed in an experiment by Haber 
(1958) who found that subjects who had 
adapted to a given temperature preferred a 
slight change in temperature to either no 
change or to a large change. It would be 
of interest to determine the condition under 
which a disconfirmed expectancy is pleasant 
or unpleasant. For example, in the present 
experiment the expectancy was ego involving 
—i.e., the subject felt that he was right when 
he made his prediction. This was not the case 
in Haber’s experiment. Moreover, in the pres- 
ent experiment, disconfirmation and confirma- 
tion had important consequences for the fu- 
ture; ie., a disconfirmed expectancy meant 
that the subject would have to reformulate 
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his ideas if he wanted to be correct on the 
next trials. In addition, it is clear that cer- 
tain kinds of situations are inherently more 
boring than others. In the case of a very 
boring situation, a disconfirmed expectancy 
might be a welcome relief from boredom, It 
is clear that there are many variables which 
must be taken into account before a precise 


statement can be made concerning the rela- 


tionship between affect and the disconfirma- 
tion of an expectancy. What this study has 
demonstrated is that under certain conditions 
(not boring, future relevant, and ego involv- 


ing) the disconfirmation of a strong expect | 


ancy leads to negative affect. 
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COGNITIVE DISSONANCE, SENSITIVITY, AND EVALUATION 


N. T. FEATHER 
University of New England, N. S. W., Australia 


In a test of dissonance theory the sensitivity of regular smokers to information 
about smoking and lung cancer was compared to nonsmokers, Smokers were 
more interested in such information than nonsmokers; but they did not seek 
out negative evidence about lung cancer and smoking, nor did they reject 
information about the harmful effects more than nonsmokers. No clear-cut 
results emerge from the analysis of dissonance reduction in relation to 
extraversion and neuroticism, though regular smokers were higher in extraver- 


sion than nonsmokers. 


The present study attempts to distinguish 
between the effects of a state of cognitive 
dissonance on a person’s sensitivity to in- 
formation and his evaluation of information. 
According to Festinger (1957), dissonance 
may be assumed to exist between two cogni- 
tions when one implies the obverse of the 
other, i.e., a and b are dissonant if a implies 
not-b. This dissonant state is assumed to be 
motivating in the sense that a person will 
attempt to reduce dissonance so as to achieve 
consonance in his cognitions. Festinger de- 
scribes a number of situations in which cog- 
nitive dissonance may develop and ways in 
Which it may be reduced. Recent research 
(eg, Adams, 1961; McGuire, 1960; Rosen, 
1961) has focused on one or more of these 
Situations and methods of dissonance re- 
duction. 

Festinger’s (1957, p. 163) discussion of 
the effects of cognitive dissonance on ex- 
Posure to information is of particular in- 
terest to the present investigation. This dis- 
Cussion implies that, in general, a person 
should be much more sensitive to informa- 
tion which decreases dissonance than to in- 
formation which increases it. Furthermore, 
he should tend to evaluate or interpret in- 
formation in such a way as to reduce dis- 
Sonance. Evaluation of information should 
be in the direction of achieving consonance 
with cognition most resistant to change. 

These implications from dissonance theory 
Were recently investigated in a study of cig- 
arette smoking and lung cancer which 
Used young university students as subjects 
(Feather, 1962). As Festinger (1957) in- 

cates, “One may state with certainty that 


the knowledge that smoking is conducive to 
lung cancer is dissonant with continuing to 
smoke” (p. 153). The following four predic- 
tions were investigated: 

1. Regular smokers should be less likely 
than nonsmokers to seek out information sup- 
porting a relationship between lung cancer 
and cigarette smoking. 

2. Regular smokers should be more likely 
than nonsmokers to seek out information 
not supporting a relationship between lung 
cancer and cigarette smoking. 

3. Regular smokers should be more likely 
than nonsmokers to consider information sup- 
porting a relationship between lung cancer 
and cigarette smoking as unconvincing. 

4. Persons who hold a firm opinion about 
the relationship between lung cancer and 
cigarette smoking should be more likely to 
seek out information supporting that opinion 
than information which is contrary to it. 

The investigation showed that regular 
smokers were more interested in information 
about the relationship between cigarette 
smoking and lung cancer than were non- 
smokers. This greater interest occurred ir- 
respective of whether the information sup- 
ported or did not support a positive rela- 
tionship between the smoking habit and 
lung cancer. The tendency for regular smok- 
ers to be more interested than nonsmokers in 
information supporting a smoking-lung cancer 
relationship is difficult to reconcile with dis- 
sonance theory since this type of information 
should increase cognitive dissonance and 
hence should be avoided. On the other hand, 
the study indicated that regular Smokers 
considered the evidence for a relationship 
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between cigarette smoking and lung cancer 

` as less convincing than did nonsmokers. This 
result is consistent with the implication from 
dissonance theory. These results suggest that 
a state of cognitive dissonance may have 
more influence on a person’s evaluation of 
information than on his sensitivity to in- 
formation. 

It can be argued, however, that the failure 
to confirm the predictions about sensitivity 
to information may be a consequence of the 
youth of the subjects employed in the in- 
vestigation. Young university students who 
have only recently taken up cigarette smok- 
ing may not, in fact, have committed them- 
selves to the habit. They may be interested 
in any information about cigarette smoking, 
whether it be anti-lung cancer or pro-lung 
cancer, which would assist them in deciding 
whether to continue smoking or not. As 
Festinger (1957) argues ^. . . the preaction 
or predecision situation will be characterized 
by extended and nonselective seeking of 
relevant information” (p. 126). The pre- 
dicted influence of cognitive dissonance on 
sensitivity to information may then occur 
only after commitment to a belief, attitude, 
or course of action. Furthermore, there is 
some ambiguity in interpreting the results 
of the above study since one is unable to 
determine whether smoking behavior has in- 
fluenced cognition (the dissonance inter- 
pretation) or whether cognition has in- 
fluenced smoking behavior. In relation to this 
latter possibility, one might expect that 
young and, it is hoped, critical university 
students would examine the evidence about 
smoking and lung cancer. Those who decide 
that the evidence is unconvincing should be 
more likely to take up smoking. This inter- 
pretation is consistent with the finding in 
the above study that regular smokers rate 
evidence for a positive relationship between 
cigarette smoking and lung cancer as less 
convincing than do nonsmokers. 

' The above considerations suggest the need 
for a repetition of the earlier study using 
much older subjects who are thoroughly com- 
mitted to the smoking habit and who began 
smoking before the evidence on cigarette 
smoking and lung cancer was publicized. 
The present study is designed to fulfill these 
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requirements and to provide evidence rele- 
vant to the first three predictions of the 
earlier investigation. As a secondary aim it 
also explores the possibility that individual | 
differences in dissonance reduction may be 
related to Eysenck’s (1957) dimensions of 
neuroticism and extraversion. 


METHOD 


Subjects were 152 male students attending a 
vacation school at the University of New England, 
Of these subjects, 90 were cigarette smokers and T 
62 did not smoke. Materials were circulated to” 
these subjects in the following order: Interest 
Survey, Survey on Smoking Habits, Maudsley 
Personality Inventory. | 

Instructions for the Interest Survey were a 
follows: 


The present survey is concerned with types of f 
magazine articles which people find interesting. | 
The following list includes articles which appeat 
fairly frequently in popular magazines and news 
papers. I would like you to rank these articles 
in their order of interest to you, ie. give à 
of 1 to the article you would be most interested ] 
in reading, a rank of 2 to the article which would | 
be next in interest, etc. 


Each subject was- required to rank 13 articles ex 
emplified by the following: Cure of Illness by 
Hypnotism, Religion and Life, Sex and the Beat- 
niks, How to Lose Weight. Two forms of tt 
Interest Survey were randomly distributed. It 
Form A the fifth article was, “Smoking Leads to 
Lung Cancer.” Form B was identical to Form A | 
in every respect except that the fifth article wa 
“Smoking Does Not Lead to Lung Cancer.” 
The Survey on Smoking Habits consisted ofa 
set of questions of which the first two are me 
relevant to the present study. Question 1 asktly 
To what extent do you think that the evident 
that smoking leads to lung cancer is convincing: 
Each subject represented his opinion by checking 
along a 5-inch graphic rating scale with the two 
extremes marked as Highly Convincing and Not 
Convincing at all. Question 2 asked: To what extent 
do you agree with the statement that smoking le 
to lung cancer only when the person is a hea 
smoker? Each subject represented his opinion 
checking along a 5-inch graphic rating scale with the | 
two extremes marked as Strongly Disagree ant 
Strongly Agree. d 
Each scale was divided into five equal parts 9" 
scores from one to five were assigned by the E 
perimenter in the direction of increasing convicti 
that the evidence was convincing (Question 1) fi il 
increasing agreement with the statement (Ques! bí 
2). Other questions in the Survey on Smot 
Habits asked each smoker whether he classed 
self as a regular or as an occasional smoker, E 
many cigarettes he smoked each day on the avert? 


i 
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TABLE 1 


Mean Acres, MEAN TIME OF SMOKING, AND MEAN 
NUMBER OF CIGARETTES PER DAY FOR REGULAR 
SMOKERS AND NONSMOKERS COMPLETING 
Form A AND Form B or INTEREST 

SURVEY 


Regular smokers| Nonsmokers 
Form | Form | Form | Form 
4 B A B 
Mean age 30.0» | 29,188 | 27.778 | 29.35* 


Mean time of smoking | 15.23^ | 13.066 | — — 
Mean cigarettes 16.09 | 18.07 — — 

smoked per day 
22 17 22 17 


^ In years. 


what type of cigarette he smoked, how long he had 
been smoking, whether he had ever seriously tried 
io give up smoking, and why he started to smoke 
in the first place. Each former smoker was asked 
why he had quit smoking. Each nonsmoker was 
asked why he had never started to smoke. 

The Maudsley Personality Inventory, consisting 
of 80 items, provided measures of neuroticism and 
extraversion for each subject. 

Records of three subjects who misunderstood in- 
structions and/or made arithmetical errors in rank- 
ing articles on the Interest Survey were excluded 
from the analysis. The experimenter also excluded 
6 subjects who described themselves as occasional 
smokers, 20 subjects who had been smoking for 
less than 9 years, and 22 subjects who were former 
smokers but who had given up the habit. Records 
were also examined to find if there were smokers 
who had seriously considered giving up smoking 
within the preceding 2 months, These smokers 
might be near the limit of cognitive dissonance and 
hence might actively seek dissonance increasing in- 
formation (Festinger, 1957, pp. 129-131). How- 
ever, there were no records of this type. 

There remained 39 smokers who had begun 
smoking before extensive publicity about cigarette 
Smoking and lung cancer appeared and who were 
Presumably thoroughly committed to the habit. Of 
e "hardened" smokers, 22 subjects completed 
pam A of the Interest Survey and 17 subjects 
$ hne Form B. An attempt was made to match 
Ach of these subjects with a nonsmoker of ap- 
Proximately equivalent age. Table 1 shows the mean 


ages, mean time of smoking, and mean number of: 


arettes per day for the four different groups. 


RESULTS 
Analysis of Rankings (Form A and Form B) 
It is predicted that regular smokers should 


fend to rank the article “Smoking Leads to 


"ng Cancer” lower in interest than non- 
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smokers on Form A of the Interest Survey. 
In contrast, regular smokers should tend to 
rank the article “Smoking Does Not Lead to 
Lung Cancer" higher in interest than non- 
smokers on Form B of the Interest Survey. 
These predicted differences in ranking be- 
tween regular smokers and nonsmokers are 
consistent with the assumption that a person 
in a state of cognitive dissonance should tend 
to avoid information likely to increase the dis- 
sonance and to seek out information likely to 
reduce the dissonance. 

Table 2 presents the mean rankings of the 
critical article from Form A and Form B of 
the Interest Survey for regular smokers and 
nonsmokers, together with an analysis of 
variance. 

Table 2 indicates that, for both Form A 
and Form B of the Interest Survey, regular 
smokers show more interest in the critical 
article than do nonsmokers. The analysis of 
variance applied to the data shows that this 
main effect is highly significant (p < .001). 
The interaction term is not statistically sig- 
nificant as would be demanded if the predic- 
tions from dissonance theory were to be con- 
firmed. Thus, while regular smokers display 
more interest in the article “Smoking Does 
Not Lead to Lung Cancer" than do non- 


TABLE 2 


MEAN RANKINGS OF CRITICAL ARTICLE FROM FORM A 
AND Form B or INTEREST SURVEY FOR REGULAR 
SMOKERS AND NONSMOKERS 


Mean rankings of critical 


articles 
Form 
Reise Nonsmokers 
Form A (Pro-lung cancer 6.64 941 
article) 
Form B (Anti-lung cancer 6.47 9.06 
article) 
Analysis of Variance 
Source df} .MS F 
Regular smokers versus 1| 141.346 | 15.12** 
Nonsmokers 
Form A versus Form B 1 1.277 <1 
Interaction 1 164 «1 
Within 74| 9.346 
& 


» Smaller numbers denote higher interest. 
**p < 001. 
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smokers, they also rank the article “Smoking 
Leads to Lung Cancer” higher in interest 
than do the nonsmokers. This latter result 
is clearly contrary to prediction. 

The present analysis of rankings thus 
replicates the results of the previous in- 
vestigation. Apparently, in the situation in- 
vestigated, regular smokers are not differ- 
entially sensitive to the pro-lung cancer and 
anti-lung cancer articles, i.e., cognitive dis- 
sonance does not have the predicted influence 
on sensitivity to information. 


Analysis of Ratings (Question 1 and Ques- 
tion 2) 


It is predicted that regular smokers should 
be more likely than nonsmokers to answer 
Question 1 of the Survey on Smoking Habits 
by saying that the evidence is unconvincing. 
A tendency to discount the evidence that 
smoking leads to lung cancer would be one 
way in which regular smokers could reduce 
cognitive dissonance. 

Table 3 presents mean ratings for Question 
1 from the Survey on Smoking Habits for 
regular smokers and nonsmokers, who had 
previously completed Form A or Form B of 


TABLE 3 


MEAN RATINGS FOR QUESTION 1 FROM SURVEY ON 
| © SMOKING HABITS FOR REGULAR SMOKERS AND 
V^ | NONSMOKERS FOLLOWING COMPLETION OF 


"m Form A AND Form B or INTEREST 
SunvEY 
M 4 
queue 
Form 
Regular | Nonsmokers 
Form A (Pro-lung cancer 2.23 3.18 
article) 
Form B (Anti-lung cancer 2.82 3.24 
article) 
Analysis of Variance 
Source df MS F 
Regular smokers versus 1| 10.051 5.78* 
Nonsmokers 
Form A versus Form B 1| 2.024 1.16 
Interaction 1| 1414 «1 
Within 74| 1.739 
" 


a A low rating denotes that the subject believes evidence for 
the pinging iung cancer relationship is relatively unconvincing. 
p «.025. , 
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TABLE 4 


MEAN RATINGS FOR QUESTION 2 FROM SURVEY oh 
SMOKING HABITS FOR REGULAR SMOKERS AND 
NONSMOKERS FOLLOWING COMPLETION oF 
Form A AND Form B or INTEREST 
SURVEY 


Mean ratings for Question 2 
Form 


Regular smokers | Nonsmokerg 


Form A (Pro-lung 3.14 3.09 
cancer article) 
Form B (Anti-lung 2.82 329 


cancer article) 


a A high rating denotes that the subject tends to agree that 
HETIRUi leads to lung cancer only when the smoker is a heavy. 
smoker. 


the Interest Survey, together with an analysis 
of variance. 

Table 3 shows that the prediction is com 
firmed. Regular smokers tend to regard the 
evidence that smoking leads to lung cancer as 
less convincing than do nonsmokers. The 
analysis of variance applied to the data shows 
that this main effect is statistically significant 
(p < 025). This result is consistent with 
previous evidence presented by Festinget 
(1957, pp. 153-156). 

It is also predicted that regular smokets 
should be more likely than nonsmokers W 
answer Question 2 of the Survey on Smoking 
Habits by disagreeing with the statement 
This prediction is not as clear-cut as the 
former since regular smokers could presum 
ably reduce cognitive dissonance by argu 
that they are not heavy smokers. If this wet 
to occur they might be quite ready to agit 
with the statement in Question 2 that smoke 
ing leads to lung cancer only when a petsdl 
is a heavy smoker. Such a tendency was 4 
parent in the earlier study (Feather, 1961) 
where regular smokers showed statistically 
significant increases in their ratings {0 
Question 1 to Question 2 in comparison to 
nonsmokers. E, 

Table 4 presents mean ratings for Question 
2 from the Survey on Smoking Habits for 
regular smokers and nonsmokers who 
previously completed Form A or Form 
the Interest Survey. Fi 

There is no statistically significant (£^ ' 
ency in Table 4 for regular smokers t0 give 
a lower rating to Question 2 than nonsmoke™ 


B of 
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- Jt is, however, interesting to note that, when 
Table 4 is compared with Table 3, regular 
smokers do tend to increase their ratings 

from Question 1 to Question 2 when com- 

pared to nonsmokers. The mean increase in 
rating from Question 1 to Question 2 for 
regular smokers (+ .51) is not significantly 
different (f= 1.46, df = 76, ns) from the 
mean change in rating for nonsmokers (—.03) 
but is in the same direction as the same com- 
parison made in the earlier study. These data 
atleast suggest the interpretation that some 
regular smokers may reduce cognitive dis- 
sonance by identifying themselves as light 
smokers only. As light smokers they are 

- more likely to agree with the statement in 

- Question 2. 

- The above results thus parallel those of 
the previous investigation. It appears that 
cognitive dissonance does have the predicted 
influence on evaluation of information. 


Dissonance Reduction and Personality Di- 
mensions 


A secondary aim of the present study was 
to explore the possibility that individual dif- 
ferences in dissonance reduction might relate 
to differences in neuroticism and extraversion 
(Eysenck, 1957). No clear-cut results were 
obtained from this analysis. The only tend- 
ency which approaches significance is for sub- 
jects high in extraversion to show more in- 
terest in the article “Smoking Leads to Lung 

Cancer” than subjects low in extraversion 
(rz —48 for regular smokers, f < .05; 
- ^— —33 for nonsmokers, p> .05). The 
Mean extraversion score is significantly higher 
for regular smokers than for nonsmokers 
(mean extraversion score is 28.49 for regular 
“Smokers and 23.89 for nonsmokers, F = 5.68 
for df — 1/74, p < .025). This difference in 
4 &xtraversion between smokers and nonsmokers 
replicates the earlier result obtained by 
3 Tarrant, and England (1960). 


DISCUSSION 


E preceding analysis of rankings and 
Te "a for regular smokers versus nonsmokers 
|, Pilcates the results of the earlier study and, 
T 30 doing, adds to the generality of the 
: ndings by extending them to a wider range 


of conditions, Young smokers who have only 
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recently adopted the habit and smokers who 
have been committed to the habit for a con- 
siderable time behave in the same way. Both 
express greater interest in an article con- 
cerning cigarette smoking and lung cancer 
than do nonsmokers, irrespective of the kind 
of relationship which the article supports. 
There is no differential sensitivity among 
regular smokers to the pro-lung cancer and 
anti-lung cancer articles as implied by the 
theory of cognitive dissonance. Apparently, 
just the fact that the article deals with cig- 
arette smoking and lung cancer is sufficient 
to make regular smokers more sensitive to it. 
On the other hand, regular smokers from 
both age groups are more critical of the evi- 
dence that cigarette smoking leads to lung 
cancer than are nonsmokers. They are more 
likely to rate this evidence as unconvincing. 
It appears, then, that their evaluation of 
information is affected in the manner implied 
by the theory of cognitive dissonance. 

One might attempt to reconcile the results 
concerning sensitivity to information with dis- 
sonance theory by arguing that regular 
smokers show more interest in the smoking- 
lung cancer articles because, in some way, 
information suggested by these articles is 
consonant with cognitions which the smokers 
have in common. These cognitions may not 
be so salient or so general among the non- 
smokers. Specifically, regular smokers may 
have a stronger and more general fear of 
lung cancer than nonsmokers (cf. Quinn, 
1961) and information about lung cancer may 
be consonant with the knowledge that one is 
afraid (cf. Festinger, 1957, p. 149). But 
whether or not this cognitive consonance 
occurs would depend on the kind of informa- 
tion provided. Thus, information in the article 
*Smoking Leads to Lung Cancer" would 
be dissonant with the knowledge that one 
is a committed smoker but may be consonant 
with the knowledge that one is afraid of 
lung cancer as a consequence of smoking. 
Similarly, information in the article “Smok- 
ing Does Not Lead to Lung Cancer" would be 
consonant with the knowledge that one is a 
committed smoker but may be dissonant with. 
the knowledge that one is afraid of lung 
cancer as a consequence of smoking. It is not 
apparent how such a combination of cognitive 
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dissonance and cognitive consonance could 
account for the clear difference in sensitivity 
found between the regular smokers and the 
nonsmokers in the present study. An analysis 
of this kind requires detailed identification of 
all cognitive elements and their relationships 
in advance, and a more explicit treatment of 
how consonances and dissonances interact to 
influence sensitivity and evaluation than is 
provided by Festinger. Otherwise post facto 
interpretations such as the above will be 
possible and the theory of cognitive dis- 
sonance will have limited vulnerability. 

It is possible to account for the differ- 
ences obtained between regular smokers and 
nonsmokers in their evaluation of informa- 
tion without any appeal to dissonance theory 
at all. It can be argued that, as evidence for 
a positive relationship between cigarette 
smoking and lung cancer accumulates, smok- 
ers who believe the evidence is convincing 
tend to stop smoking while those who find 
the evidence is unconvincing are likely to 
continue to smoke. Thus, we would expect a 
group of smokers who have indulged in the 
habit for a considerable time to be fairly 
homogeneous in regarding the evidence as 
unconvincing, since many of those who con- 
sider that the evidence is convincing have 
excluded themselves by giving up the habit. 
This self-exclusion process should lead to a 
residual group with similar opinions. Some 
results from the present study do indicate 
that persons who have quit smoking are 
more convinced by the evidence than are per- 
sons who continue to smoke. The mean rating 
for Question 1 of the 22 former smokers who 
were excluded from the analysis is 3.18. 
The corresponding mean rating for the 39 
regular smokers included in the analysis is 
2.49. The difference between these means is 
statistically significant (F = 4.40, df = 1/59, 
p< .05). 

However, an analysis of the reasons given 
by the former smokers for discontinuing 
smoking renders the above argument less 
plausible. Only 1 former smoker explicitly 
mentions fear of lung cancer as a reason for 
quitting; 8 former smokers refer to physical 
discomforts (e.g, irritation of the throat, 
coughing) and/or concern about health; and 
the remaining 13 former smokers list a variety 
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of reasons such as failure to derive satisfac. 
tion from the habit, expense, and family 
pressure. The writer is therefore inclined to 
reject the interpretation that the differences 
obtained between regular smokers and non- 
smokers in their evaluation of the evidence 
are primarily due to a self-exclusion process. 
While this process may operate to a limited 
extent, it seems more likely that the obtained 
differences reflect the influence of pressure to 
reduce cognitive dissonance. In other words, 
these results suggest that smoking behavior 
has influenced opinion (the dissonance inter 
pretation) rather than vice versa. The preced 
ing discussion underlines the difficulty im 
specifying the direction of causality in studies 
based on dissonance theory. A more adequate | 
investigation of the effect of cognitive dis- 
sonance on evaluation would preclude the 
possibility that the opinions of a particular 
group may be related to a selection process, 
but this type of control is obviously most 
difficult for contentious and widely publicized 
issues such as cigarette smoking and lung 
cancer. 

The results of the present study strongly 
suggest that states of cognitive dissonant 
are more likely to influence a person’s evalua 
tion or interpretation of information than his 
sensitivity to information. A person who 
engaged in a course of action or who holds 
a certain attitude or opinion tends to be mort 
sensitive to information relevant to this 
course of action, attitude, or opinion than? 
person not behaving in that way or not 
holding that attitude or opinion. This greate 
sensitivity occurs irrespective of whether the 
information supports or does not support the 
course of action, attitude, or opinion. In 
many cases sensitivity to information seem 
to be governed by this factor of perso 
relevance, and possible dangers (e8; Jung 
cancer) are just as relevant to the cigarette 
smoker as are possible reinforcements (ega 
no cause for worry). Thus, commitment 
a course of action does not necessarily mea 
that a person will in future avoid adver 
information about his choice. Recent studié 
by Mills, Aronson, and Robinson (1959) @ 
by Rosen (1961) indicate that people may 
still be interested in adverse or incompal’ 
information after they have made a decis? 


Tt appears then that the “seeking of relevant 
information” which Festinger (1957) con- 
siders typical of a predecision situation also 
occurs after commitment and that this rele- 
vant information may be compatible or in- 
compatible with action or cognition. This is 
not to say that pressure to reduce cognitive 
dissonance has no influence on sensitivity. 
There is evidence which is consistent with 
` such dissonance reducing effects (e.g., Ehrlich, 
Guttman, Schónbach, & Mills, 1957; Mills 
et al, 1959; Rosen, 1961). But it is likely 
that this pressure is often only a minor part 
of the complex of determinants which in- 
fluence sensitivity and that it is frequently 
overshadowed by other factors. 

It appears, however, that evaluation of 
information may move in the direction of 
consonance with cognition most resistant to 
change, and that distortion of information 
-may occur in the process. Krech and Crutch- 
field (1948) have stated a related principle 
as follows, “Other things being equal, a 
change introduced into the psychological 
field will be absorbed in such a way as to 
| produce the smallest effect on a strong struc- 
ture” (p. 108). More recently, Katz and 
“Stotland (1959, p. 443) have adopted a 
Similar position in their discussion of a prin- 
‘tiple of consistency. Thus, evaluation of in- 
formation does appear to be influenced by 
the degree to which that information is con- 
- Sonant with existing cognitions. A member of 
the Ku Klux Klan should be very sensitive 
to any information about Negroes, whether 
it be complimentary or hostile, but his evalua- 
tion of this information will tend to be con- 
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sonant with his prejudice. In short, dissonant 
states seem to have more obvious effects on. 
evaluation than on sensitivity. 
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This study investigated the factorial structure of changes concomitant with 
client-centered therapy. Most of the variables were differences obtained by sub- 
tracting the pretherapy scores from the corresponding posttherapy scores of 
each of 93 clients. The 1st 4 factors seem to reflect: change in favorableness of 
client self-evaluation, change in adequacy ratings based on the TAT, therapist’s 
perception of change, and change on Hs and Hy from the MMPI. The Sth fac- 
tor was length of therapy. Of these orthogonal factors, the Ist 4 are seen as 
method factors, each being associated primarily with an observer-instrument 
combination. As measured, the changes associated with short psychotherapy 
cannot be adequately represented by a single global rating or by scores for 
personality traits defined either broadly or narrowly. 


The practice of client-centered therapy has 
been accompanied by various persistent re- 
search efforts to document and to understand 
therapeutic process and outcomes. Within 
these various efforts, one stream of work has 
concentrated on detailed examination of the 
varieties of personality change concomitant 
with psychotherapy (Cartwright & Roth, 
1957; Gibson, Snyder, & Ray, 1955; Rogers 
& Dymond, 1954). Throughout this stream 
of work, the evidence requires the conclusion 
that certain important questions about per- 
sonality change itself should be directly 
raised and investigated: Is such change a 
global modification of an entire personality, 
or is it a series of modifications of essentially 
separate parts or aspects of a personality? 


1 This study was supported largely by a grant from 
the Ford Foundation to the Counseling Center, Uni- 
versity of Chicago. The Social Sciences Research 
Committee of the University of Chicago also made 
several small grants to the principal investigators to 
assist in collection and analysis of data. 

The authors would like to express their apprecia- 
tion to the staff of the Counseling Center, Univer- 
sity of Chicago, for their extensive support and col- 
Jaboration in making this research possible. The 
authors are particularly indebted to Jane Blare, Rosa- 
lind Dymond Cartwright, and Sophie Fox Kirtner 
who served as diagnosticians, and to Raleigh Lyles, 
DonaldsPeterson, and Richard Robertson who pro- 
vided valuable assistance in the collection and proc- 
essing of the data. k 


Using the language of traits, one may formu- 
late the question as follows: Do all relevant 
personality traits change together, or al 
there groups of traits such that changes in 
one group are uncorrelated with changes in 
other groups? Answers to these questions att 
essential both to the understanding of pj 
chotherapeutic change and to research seek- 
ing the correlates of such change. 

Answers to the questions as posed above 
may be sought through the application of 
factor analytic methods. Somewhat analogous 
to the Spearman-Thurstone controversy about 
the nature of intelligence, the present dilemm 
may be stated as a question: Are the perso 
ality changes concomitant with client-ce- 
tered therapy best described by a single st 
eral factor or by a multiple-factor representa 
tion? 1 

This question is further complicated, how- 
ever, by the finding that different variables 
measured by the same instrument (C6 
MMPI) show concomitant changes (Gibson 
et al., 1955). Moreover, available evidence M 
dicates that different variables arising f 
self-report instruments tend to change ni 
gether; that the same is true for different 
variables arising from therapist ratings; an 
that little relation is indicated between, 
self-report and rating variables (Cartwright 
& Roth, 1957). 
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Factor STRUCTURE IN THERAPY CHANGE 


The foregoing discursive considerations and 
research results lead to several specific ques- 
tions about personality changes associated 
with psychotherapy: 

1. Is such personality change general over 
all variables (ie., is such change single-di- 
mensional) ? 

2. If not single-dimensional, then is each 
factor specific to the variables from a single 
measuring instrument? 

3. Or, is each factor specific to the vari- 
ables from a single viewpoint or class of ob- 
server, such as therapist or client? 

4. Or, is each factor specific to a defined 
personality variable as measured by each of 
several observer-instrument combinations? 

5. Or, is each factor specific to a group of 
such personality variables, each measured by 
several observer-instrument combinations? 

Pointedly and directly, these questions con- 
cern the nature of the domain of personality 


165 


changes associated with psychotherapy. And 
further questions immediately follow: Is per- 
sonality change during psychotherapy similar 
to or different from personality change after 
the termination of psychotherapy? Before the 
start of a course of psychotherapy, is predic- 
tion of personality change possible? What re- 
lation, if any, has length of therapy with per- 
sonality change? Can certain variables that 
describe the personality and behavior of the 
therapist be shown to influence critically per- 
sonality changes in the clients? 

Questions 1-5 and related further questions 
imply a certain research design: Question 1 
requires that a large sample of clients be 
tested before and after therapy on a wide se- 
lection of personality variables. Question 2 
requires several measuring instruments, each 
of which would yield measurements of a set 
of conceptually distinct personality variables. 
Question 3 requires that several classes of ob- 


TABLE 1 
ScHEMATIC OUTLINE OF RESEARCH DESIGN 


Observer source Instrument 


Data collection points 


Pretherapy Posttherapy Follow-up 


Preliminary interview form 
Posttherapy questionnaire 
Follow-up questionnaire 
MMPI 


Cli 
ents i Q sort (Butler & Haigh, 1954) 


Marginality questionnaire 


Rating of integration 
Rating of liking for client 
Posttherapy questionnaire 


Therapists 


protocols 

interview recordings 
Diagnosticians tured interview recordings 
diagnostician 


Sentence completion test 
Rorschach 


Other WAIS 


Number of interviews 


Social attitude scales (O’Neil & Levinson, 
14 Behavioral Adequacy self-rating scales* i 


14 Behavioral Adequacy rating scales* 


14 Behavioral Adequacy rating scales* on TAT 
14 Behavioral Adequacy rating scales on therapy 
14 Behavioral Adequacy rating scales on struc- 
Ratings of liking for client by TAT and interview 


Typology ratings on therapy interview recordings + 


Number of weeks pre- to posttherapy 


ERO 


1954) + + 


+++ 


F 
4 


* The Behavioral Adequacy scales are described in the text. 


Due to testing load it was decided to administer the Rorschach a: 


* 


nd the WAIS each to only half of the total sample. 
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servers each generate measurements on a va- 
riety of personality variables. Question 4 re- 
quires that several observer-instrument com- 
binations be used to measure the same defined 
personality variable. Requirements for Ques- 
tion 5 may be met through fulfillment of those 
for the first four questions. 

The question concerning personality change 
after the termination of therapy requires that 
the same battery of tests be administered at 
some follow-up date. A point 18 months after 
termination was selected. 

Except for the question of therapist influ- 
ence, the design of the present research was 
dictated by the requirements of all foregoing 
questions. Table 1 gives a schematic outline 
of the design. 

In the present paper, only material pertain- 
ing to the nature of personality changes will 
be reported. Other aspects of the total re- 
search, some questions about which have been 
raised in the preceding paragraphs, will be 
reported in subsequent papers. 


METHOD 


Fourteen Behavioral Adequacy Scales 


The rating scales used in this study were designed 
to give maximum coverage of variables investigated 
in previous research, The literature on personality 
changes associated with psychotherapy was reviewed 
(Cartwright, 1957a, 1957b), and variables were clas- 
sified according to conceptual content. For example, 
decreasing Depression in the MMPI (Gallagher, 
1953), increasing positive self-reference (Raimy, 
1948), and increasing self-ideal correlation (Butler 
& Haigh, 1954) were, among other variables, deemed 
categorizable under the notion of “increasing con- 
tentedness with self,” 

Fourteen such categorizing conceptions were elabo- 
rated to encompass as completely as possible the 
range of previous research variables; ratings, test 
scores, clinical judgments, etc. A 10-point rating 
scale was then developed for each conceptual vari- 
able. Each scale had five anchor positions, each 
anchor position ranging over two scale points. The 
positions ranged from an extremely low level of ade- 
quacy (anchor at Scale Points 1 and 2) to optimal 
functioning (anchor at Scale Points 9 and 10). Sev- 
eral differential modes of inadequate behavior were 
described for each of the numerically lower anchor 
positions on the scales, on the assumption that the 
same level of inadequate behavior may be expressed 
in a variety of ways. 

The főllowing are two sample anchors. from the 
low end and one anchor from the high end of one 
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scale (Scale II, Adequacy of Contentedness re Re. | 
lationships with Others, Roles, etc.).2 £ 


1 and 2 (a): The person feels extremely uncom- 
fortable with others, feels that relationships with 
others are strained, never flow easily and relaxedly, 
are devastating to a sense of contentment. Relating 
to others is a source of great discomfort and dis. | 
satisfaction to him. Being around others, function. 
ing in a role toward or with them, dealing and | 
communicating with them—social and interper- ' 
sonal exchange—are a continuing and very great 
strain upon him, “I don’t like the way relation- - 
ships go.” “I feel awful around people.” “I just | 
feel completely nervous in social gatherings,” “I 
simply don’t know what to do around people’ 
“People just bother me so much.” “People are so 
difficult to get along with, just impossible.” 

“I feel extremely uncomfortable in relationships 
with others." 

“I feel a kind of awful strain when I'm with 
other people.” i 
1 and 2 (c): The person is extremely ungues- 
tioning of and unconcerned with his relationships 
with others, role functions, etc. Relationships a 
occurrences or experiences which happen as they 
do and go on or cease as they do. With respect to” 
relationships there is an unmoving unconcern, an | 
utter ignoring or disregard of them. Elaboration 
upon them is minimally sparse. j 

“I rarely think about relationships with other | 
people." m 

“I almost never pay any attention to relation 
ships with other people." j 

9 and 10: The person expresses a basic content- 
ment with respect to relationships with others. Ja 
general, his experience of interaction is of a smooth 
interpersonal flow—an experience of comfort and 
satisfaction in his personal and social exchangt 
The person is calm around others, enjoys bei - 
around others. Interactional discomforts, role diš- 
comforts, interpersonal dissatisfactions, are tra 
sient experiences; for, in dealing with others, what. 
ever the momentary situation, the person indicates 
his continuing interpersonal functioning as experi 
enced with much that is satisfying and much tht” 
is comfortable, with a very obvious sense of c0 | 
tentment in that functioning. 

“I feel relaxed and comfortable with people il 
general.” 

"I feel pretty much at ease with people and D 
a good deal of satisfaction out of my relationships | 
with others." j 


2 The full scale, “Adequacy of Contentednoss 4 
Relationships with Others, Roles, etc." has beer i 
posited with the American Documentation Insti? 


Order Document No. 7336 from ADI Auxiliary Pub - 


lications Project, Photoduplication Service, Nr 
of Congress; Washington 25, D. C., remitting in 27. | 


Make checks payable to: Chief, Photoduplicati® 
Service, Library of Congress. 


Jow each main paragraph of an anchor may be 
two single statements (e.g. “I féel a kind of 
l strain when I’m with other people"). While 
containing complete anchors as shown were 
by diagnosticians, clients and therapists used 
es having only such single statement anchors, All 
fers were instructed to consider all anchors for a 


ing the particular subject. Nomograph scoring 
s achieved through giving double weight to the 
t choice, single weight to the second choice and 
ng. The resulting scores could range between 
d 13 points for each scale as used by any rater. 


erapy testing sessions. Therapists rated their clients 
mediately after the first and after the last inter- 
w. Diagnostician ratings were obtained for two 
of material: TAT transcripts and therapy inter- 
ew recordings. The TAT protocols were based on 
cards: 1, 2, 3BM, 4, 5, 6BM, 10, 11, 13MF, 16, 
IM, 19, The interviews were the first and last 
erviews of a given client's series of therapy ses- 
ns. 


Other Instruments Referred to in the Present 


"1, Posttherapy Questionnaire. This form sought 
Biographical data and also included a series of items 
Out the client's experience in therapy and reasons 
IT terminating. A typical item was: 


1. In general how satisfied are you with what 
your therapy accomplished? (not satisfied, some- 
E satisfied, pretty well satisfied, very well satis- 


nce the several questions about.the experience in 
y yielded highly correlated answers, a single 
posite score was computed from them. 
2. MMPI. The full-scale Minnesota Multiphasic 
sonality Inventory was taken by the client at 
me. After a testing session, he was given the 
[MPI booklet and furnished with instructions to 
mplete it in privacy without discussion with 
rs, He was provided with a stamped addressed 
\ lope for return of the material. The tests were 
Ored on the standard scales and four others. 
3. Q Sort. The sort was the same as that used in 
lous studies at the Counseling Center (Butler & 
n, 1954). The sort consists of 100 statements 
wn from counseling protocols. It includes such 
2 ments as: “I put on a false front"; “I am a 
sure”; "I feel adequate”; “I can usually make up 
Y mind and stick to it.” The measure obtained was 
Q adjustment score (Dymond, 1953). 
Social Attitude Scales. These include the Ethno- 
; ticism (E), Traditional Family Ideology (TFI), 
zion Conservatism (RC) scales developed 
evinson and others (O'Neil & Levinson, 1954). 
le form contained items for all three scales, 
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with items from each randomly dispersed through- 
out. The form ‘was entitled “Attitudes to Social 
Issues." Subjects were instructed: 


Mark each statement in the left margin accord- 
ing to how much you agree or disagree with it. 
Please mark every one, Write in: +1, +2, +3; or 
—1, —2, —3, depending on how you feel in each 
case, 
+1: I agree a little 
+2: I agree pretty 


—1: I disagree a little 
—2: I disagree pretty 


* much much 
+3: I agree very —3; I disagree very much 
much 


There were 39 items altogether. Typical items were: 


(E) One trouble with Jewish businessmen is that 
they stick together and prevent other people from 
having a fair chance in competition. 

(TFI) A child should never be allowed to talk 
back to his parents, or else he will lose respect for 
them. 

(RC) Every explanation of man and the world 
is incomplete unless it takes account of God's will. 


The relation of social attitude measures to changes 
in psychotherapy has been shown in several studies 
(eg., Gordon & Cartwright, 1954). 

‘Therapists: 

1. Rating of Integration. Attached to the therapist 
form of the 14 Behavioral Adequacy scales was a 10- 
point rating scale. Raters were asked to rate “the 
present psychological adjustment” of the client. The 
scale had anchors throughout. At Point 1 the anchor 
was: “Most extreme maladjustment”; at Point 10 the 
anchor was: “Optimal adjustment (fully functioning, 
optimal maturity).” 

2. Rating of Liking. Also attached to the 14 Be- 
havioral Adequacy scales was a 10-point scale for 
rating of the extent to which the therapist liked the 
client. The scale ranged from 1, “I dislike him very 
much” to 10, “I like him very much.” Verbal anchors 
were given for each point. 

3. Posttherapy Questionnaire. With slight changes 
in wording, this form matched the client posttherapy 
questionnaire. A typical item was: 


In general how satisfied are you with what ther- 
apy has accomplished for this client? (not satis- 
fied, somewhat satisfied, pretty well satisfied, very 
well satisfied) 


Once again, a composite score was derived from these 
questions since the responses were highly correlated. 
In addition, the therapist was asked to rate the out- 
come of therapy using the nine-point success rating 
scale from several previous studies (Rogers & Dy- 
mond, 1954). 

Diagnosticians: 

1. Sentence Completion Test. This test was adapted 
from one devised and used by Thelen and his co- 
workers (Thelen, Stock, Ben-Zeev, Gradolph, Gra- 
dolph, & Hill, 1954). Originally it had bees planned 
to have the 14 Behavioral Adequacy scales applied 
also to the SCT protocols. In pretesting, however, 
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the obtained interjudge reliabilities were extremely 
low; consequently it was decided to utilize the origi- 
nal Thelen-Stock variables. Ultimately we included 
only those variables which had substantial inter- 
judge reliability: these variables refer mainly to in- 
terpersonal attitudes of Fight, Flight, Dependency, 
Pairing, and Work. The subject may “accept” an 
item such as “When Hal felt angry with the group, 
he..." by completing with a response such as 
"came out and told them." The score would be 1 
point for the variable “Accept fight.” Thirteen such 
variables were included in our analysis. 

2. Ratings of Liking for Client. These scales were 
identical in form with that used by the therapists. 
The diagnosticians (TAT and interview) rated on 
the basis of their reaction to the protocol. These 
ratings were included so that we might judge the in- 
fluence of the rater’s overall affective response upon 
his 14 Behavioral Adequacy ratings. 

Psychometric considerations: 

1. Ordering of Scale Points. Diagnosticians were 
given the full scales, Since the ordering was explicit, 
we may assume there were no special problems about 
the ratings or about change scores obtained by sub- 
tracting pretherapy scores from posttherapy scores. 

The clients and the therapists were given each scale 
in the form of a set of 11 to 19 statements. The 
several statements for each modality were consecu- 
tive but the blocks were randomized so that there 
was no systematic progression (eg. from low to 
high) in the total listing for each of the scales, 
Hence the accuracy of the scale positioning of each 
statement is of importance. For any one modality, 
there would be little difficulty: the several successive 
statements rather obviously progress from low to 
moderate adequacy. However, the matter of corre- 
spondence between modalities presents possible diffi- 
culties: e.g., does the statement for Point 3 for Mo- 
dality a represent the same degree of inadequacy as 
the statement for 3 for Modality b? This corre- 
spondence between modalities was reviewed by the 
group of three investigators who developed the scales. 
It was also checked by having independent judges 
scale the statements in terms of adequacy. While the 
scale positions determined from these latter judg- 
ments differed in some instances from those we had 
established originally, the fundamental ordering was 
supported. The occasional discrepancy did not ap- 
pear to justify departing from the basic a priori 
Structure. It will be noted that our focus was on 
intraindividual comparisons between pre- and post- 
Scores, not on interindividual comparisons which 
would more frequently be across modalities. 

2. Reliability of the Scales. For a group of 30 stu- 
dents tested twice with an interval over 2 weeks, the 
median of the stability coefficients for the 14 scales 
was .68. Evidence for internal consistency was ob- 
tained by testing our assumption that a subject's 
first and second choices would have equal or ad- 
jacent values. For several groups (students, clients, 
and therapists), the analyses of such Correspondence 
between first and second choices yielded medians be- 
tween 75 and 85%, x 
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3. Ratings by Diagnosticians. It was financially 
impossible to have each TAT protocol and each in- 
terview rated by two diagnosticians. Hence we had 
one person rate all the TATs, pre and post, and a 
separate judge rate all the interviews. Thus while a 
particular change score might be influenced by idio- 


syncracies of the single judge, it would not be at T 


tenuated by interjudge differences. The TAT proto- 
cols were censored to eliminate cues indicating 
whether they were pre or post and were given to the 
judge in random groups which included both pre- 
and postprotocols. It was not possible to conceal 
from the interview rater whether an interview was 


early or late. However, the early and late inter | 


views were randomized in the order in which re- 
cordings were rated, with the restriction that the 
ones for any single case were well separated in the 
sequence. 

Before each diagnostician began to rate, she studied 
the scales until she was fully familiar with them, 
Practice materials were rated by each diagnostician 
and by another judge, the ratings being compared 
and discussed to maximize consensus of the inter- 
pretation and application of the scales. 

4. Change Scores. Change scores, the basic data of 
this study, have an inherent source of unreliability, 
above and beyond those sources associated with the 
level or status scores at pre- and posttherapy. The 
reliability of a change or difference score is equal to 
the mean of the reliabilities of the two compared 


scores when these scores are not correlated with each f 
other. The higher the correlation between these scores, 


the lower the reliability of the change score (ch 
Gulliksen, 1950, pp. 351-355). In a study where the 
two scores are obtained from the same person at 
different times, some degree of correlation can be 
expected. While no attempt was made to estimate 
the reliability of our change variables directly, it 
must be emphasized that these reliabilities are wr 
doubtedly no better than fair. 

The analysis of change scores depends more heavily 
upon the validity of the assumption of equal inter- 
vals than do some other operations. Consider a vati- 
able which may appear to be an equal-interval scale 
but which actually is stretched out at one end (e£ 
raw reaction time). While such a condition may hav? 
little effect on a product-moment correlation betwee! 
two such variables, it will bias change scores: an 5 
terval of two steps at one end of the scale may 1 
be equal to a two-step interval at the other. 

Another problem arises from the fact that a chang? 


score is correlated with the two level scores from ' 


which it is derived. In our case, we would eme. 
a positive correlation with posttherapy score bs 
negative correlation with initial score (cf. Lord, 195» 
1958; McNemar, 1958). We have quite intentio 
made no adjustment of change scores for different 
in initial level. We have formulated our goal as F 
determination of the factors in raw change scot 
regardless of the role of such components as a 
scores. The determination of the correlates of P" 


dictors of change is being deferred for later invest | 


gation. 
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TABLE 2 
CHARACTERISTICS OF THE SAMPLE 


Student or community Age Sex Marital status Length of therapy* 
Student 54 Range 17-52 Male 52 Married 36 Range 3-181 
Mean 28.8 Single 50 
Community 39 Median 27 Female 41 Divorced 2 Median 26 
Separated 5 


a Number of interviews. 


There are substantive reasons for not correcting 
change scores for initial level. Insofar as the objec- 
tives of both therapist and client are to make the 
client more adequate in the areas of his greatest 
relative deficiencies, both are concerned, in effect, 
with raising his low (unfavorable) scores and have 
little or no concern with raising his moderate or high 
(favorable) scores. Therefore partialing out initial 
scores may also partial out the very effects that are 
sought in the study. 


Subjects 


The sample for this study is composed of 93 sub- 
jects. A summary of certain characteristics of the 
sample is given in Table 2. The subjects represent a 
major portion of the applicants seeking psychother- 
apy at the University of Chicago Counseling Center 
between November 1956, and March 1958. The Coun- 
seling Center provides client-centered therapeutic 
service to students of the university and to residents 
of the Chicago area, on a mainly self-referral basis. 

Only two restrictions were placed upon subject 
participation in the present study: he must have had 
not more than six interviews of formal psychother- 
apy, of any kind, within the 3 years prior to his 
Present application for therapy; and he must be 
willing to participate in research. The restrictions on 
Participation introduced no significant selection bias, 
Since approximately 95% of those applying to the 
Counseling Center for therapy during the project in- 
take period met these criteria, and were thus eligible 
as subjects in this study. 

A total of 106 subjects was accepted into the proj- 
ect and given the battery of pretherapy tests. Of this 
total, 12 were attrition cases: 1 subject did not ap- 
Pear for therapy after completing the pretherapy 
testing } 3 subjects terminated with only one inter- 
View; 8 subjects had more than one interview but 
did not complete the battery of posttherapy tests. 

he range in number of interviews for the attrition 
qup was 0-13, inclusive. The sample was reduced to 
E. final number of 93 complete cases by the death 
(m One subject during the course of psychotherapy. 

-9 complete the collection of data, 6 of these 93 
es Were posttested after at least 71 interviews 

before they terminated therapy.) 
aus respect to social status, the sample may be 

aracterized as predominantly middle class, a rather 
e Proportion of the subjects occupying white- 
ar and professional positions. A few subjects oc- 


cupied very high status positions, senior executive 
or managerial roles, Another few subjects occupied 
skilled and unskilled labor positions. 

In terms of presenting problems, or clinical charac- 
teristics, the present sample appears to be typical of 
the population of clinical subjects described for the 
Counseling Center by Rogers and Dymond (1954). 


Therapists 


Of the 30 therapists, all trained in client-centered 
therapy, 22 were males, 8 were females. Therapists 
ranged in experience from 1 year through 24 years. 
The 5 most experienced therapists (more than 5 
years of experience) saw 18 cases; the 14 least ex- 
perienced therapists (first year interns with at least 
6 months of training beyond the practicum level) 
saw 36 cases. The 11 therapists with intermediate 
experience saw 39 cases. The range of cases seen per 
therapist was 1-8, with a median of 3. 


Procedure 


For each of the 79 measures taken at pretherapy 
and again at posttherapy, the pretherapy score was 
subtracted from the posttherapy score, yielding a 
difference or change score. In addition, there were 
the three scores from the clients’ and the therapists’ 
posttherapy evaluation of change, obtained through 
questionnaires, and two measures of length of ther- 
apy. (The log of the number of interviews and the 
log of the number of weeks between first and last 
interviews.) 

Each of these 84 change measures was transformed 
to standard scores which were analyzed on UNIVAC 
by a program yielding Eigenvectors of principal axes 
directly from the scores. Extraction was stopped 
after the sixteenth factor, the last several factors ex- 
tracting about the same amount of variance (2%) 
from the data. These factors were then rotated on 
ILLIAC by a varimax program, yielding an or- 
thogonal simple structure.3 

The rotations yielded a relatively good simple 
structure. Examination of the rotated factor matrix 
with the aid of representative plots of pairs of fac- 
tors supported the judgment that the orthogonal 


structure provided a satisfactory solution. While 


3'We are indebted to Henry Kaiser 4nd J, H. 
Snyder of the University of Illinois for their assist- 
ance ‘in carrying out this analysis. 
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small moves on some factors might have been car- 
ried out to produce a somewhat tighter but slightly 
oblique structure, the interpretations of the factors 
would not have been changed. (Such rotations would 
have produced small correlations between Factors 1 
and 3, and between Factors 1 and 4. At a later time, 
the same procedures were carried out on UNIVAC 
for the 79 pretherapy scores. Once again, 16 factors 
were extracted and rotated. Subsequent to these 
analyses, the ratings of the first and last interviews 
by a diagnostician were completed. These ratings on 
the 14 scales and on Liking were then added to the 
other scores and two correlation matrices were com- 
puted, one for the 94 pretherapy measures and one 
for the 99 change measures.) 
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RESULTS 


Table 3 shows the rotated factor matrix for _ 
change variables. Only the first five factors 
are reported here as being clearly interpret- 
able. 


Major Change Factors 


The first factor reflects the change in the 
client’s perception of his adjustment, the de 
gree to which he moves, in his reports, from 
feeling upset and in difficulties to feeling com- 
fortable and adjusted. Most of the high load- 


TABLE 3 
RorATED FACTOR MATRIX 
Factors and loadings 
Source Variable 
1 2 4 
Contentedness with Self (Impulses, Emotions) 51 | —01 34 03 
Contentedness with Relationships to Others 45| 03 24 | —20 
Evaluation of Own Capacity 25 03 30 | —16 
Evaluation of Own Behavior —12 | —02 03 18 
Evaluations of Others 28 | 05 | —01 | —08 
l Quality of Anticipations 21| —09 12 | —01 
Client | Choice Process for Courses of Action 68 | —07 00 00 
self-ratings Goals-Means Consideration in Courses of Action 40 | —03 | —05 | —09 
Energy Deployment over Courses of Action 21 | —06 10 | —09 
Personal and Interpersonal Considerations 45 | —01 06 03 
Interpretation as to Source of Felt Disturbance 06 00 | —06 | —04 
Differentiation of Own Feelings, Tensions, etc. 36 | —06 02 | —18 
Perception of Other People 20 | —12 | —0t 14 
Quantity and Range of Emotional Response 34 | —06 12 | —03 
Q sort adjustment score 66 | —07 22 01 
Social attitudes: Ethnocentrism —05 | —16 | —06 | 06 
Social attitudes: Traditional Family Ideology —10| —11 | —11 | —17 
Social attitudes: Religious Conservatism 20| —13 | —07 | —19 
Client posttherapy questionnaire: Composite 26 17 32 02 
MMPI: “2” -15| —06 | —03 | 02 
MMPI: L —25 03 12 40 
MMPI: F 36|.-10 | 00) 44 
MMPI: K 65| 07| 07 | —37 
MMPI: Hs 09 | —15 02 84 
MMPI: D 69} —09 | 20] 35 
MMPI: Hy 23| —12] 01| 79 
MMPI: Pd 10] 14] 14] 37 
MMPI: Pa 16] 05] —03| 1i 
MMPI: Pt 52} 04} 24] 40 
MMPI: Sc 4| 07| 03| 4 
MMPI: Ma -18| 12|—17| —01 
MMPI: Si 74| —04| 21| 04 
MMPI: Es 61} 02] 10} 19 
MMPI: MA 82] 08} 22| 13 
MMPI: SD 8] 03] 18] 11 
Log weeks in therapy —06 | —03 | —03 | —03 
Log number of interviews —13 | —04 16 03 
Contentedness with Self (Impulses, Emotions) 19 04 66 | —08 
i Contentedness with Relationships with Others 21 02 76 08 
Therapist Evaluation of Own Capacity 33] 16| 64] 04 
ratings | Evaluation of Own Behavior 11 | —13 47 | —06 
Evaluations of Others 06 | —07 65 | —05 
` Quality of Anticipations 03 | —03 76 12 
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TABLE 3— Continued 
1 Factors and loadings 
Source Variable 
1 2 3 4 5 

Choice Process for Courses of Action —14 | —10 71 04 | —06 
Goals-Means Consideration in Courses of Action 05 05 75 02 03 
Energy Deployment over Courses of Action 11 02 59 | —12 02 
Personal and Interpersonal Considerations 13 33 47 02 | —11 
Interpretation as to Source of Felt Disturbance 02 01 49 | —14 23 
Therapist Differentiation of Own Feelings, Tensions, etc. —02 01 56 16 | —02 
ratings Perception of Other People 16 00 44 02 00 
, Quantity and Range of Emotional Response 03 00 32 | —09 | —02 
Í AE -10| —09 | 32| -01| 23 
Integration 15 06 71 07 22 
Posttherapy questionnaire: Composite 12 16 64 11 46 
Success 12 20 65 00. 42 
l Contentedness with Self (Impulses, Emotions) 15 82 05 02 03 
| Contentedness with Relationships to Others 04 80| —02 | —05 07 
Evaluation of Own Capacity 06 71 06 02 08 
Evaluation of Own Behavior —08 86 00 | —01 | —07 
Evaluations of Others —02 76 | —04 | —07 | —19 
Quality of Anticipations 00 83 | —01 06 20 
TAT Choice Process for Courses of Action is 06 78 12 | —06 | —09 
ratings Goals-Means Consideration in Courses of Action 12 82 03 | —03 | —03 
5 Energy Deployment over Courses of Action 03 27 10 06 | —14 
Personal and Interpersonal Considerations 09 71 02 | —01 02 
1 Interpretation as to Source of Felt Disturbance 02 65 02 | —10 | —02 
Differentiation of Own Feelings, Tension, etc. -07 77 01 | —06 04 
Perception of Other People —18 69 00 | —01 | —10 
Quantity and Range of Emotional Response —07 74 05 13 09 
Liking —02 77 | —03 | —03 | 00 
Accept: Fight —04 | —02 | —02 | —08 23 
4 Accept: Dependency —10| —08 | —02 | —14 | —15 
» Accept: Pairing 10 15| —02 | —09 21 
Accept: Work 16| —09 05| —05| 06 
Overt fight -17 03 04 | —21 | —06 
Sentence Overt flight. 02 | —21 16 | —05 01 
completion Overt dependency 09 10 | —01 02 | —01 
variables Overt pairing —01 03 04| —14 | —12 
n Overt work 19 04 15 00 | —03 
4 Overt emotionality —07 00 | —15 | —15 18 
Total covert -21 17 04 27 | —09 
Total fight —12| —08 | —03 00 03 
| Ideation —18 | —09 | —04 | —10 25 


all MMPI variables except K, Es, and SD were reflected because t! 


9n these variables. 


ings are for MMPI scales: Social Desirability, 
Ego Strength, and K scores go up; Manifest 
Anxiety, Social Introversion, and D go down. 
Several of the 14 scales as reported by the 
client also go up, as does his Q sort adjust- 
Ment score. This factor may be tentatively 
- labeled Change in Favorableness of Client 
Self-Evaluation. 
The second factor is clearly Change in TAT 
dequacy. The change scores derived from 
- fhe TAT diagnostician’s ratings on the 14 
Scales and on Liking have, with one exception, 
loadings between .65 and .86 on this factor. 


Note.—Decimal points and plus signs omitted. 
The change scores were obtained by subtracting the prethera| 


y scores from the posttherapy scores. However, the signs for 
he majority of clients had lower numerical scores after therapy 


No loading for other measures exceeds .33; 
the great majority are below .10. 

On the third factor, all of the high loadings 
are derived from the several judgments made 
by the therapist, each of these variables hav- 
ing a loading of at least .32 and the majority 
being above .50. The only other appreciable 
loadings are for the composite index from the 
client's posttherapy questionnaire and the 
clients self-ratings on Scales 1 and 3, these 
three being in the low 30s. This factor obvi- 
ously represents the Therapist’s Perception of 
Change. 
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Factor 4 is largely a doublet involving the 
Hs and Hy scales from the MMPI but in- 
cluding loadings in the 30s and 40s for other 
MMPI scales: F, Sc, L, Pt, K, Pd, and D. 
Change in reported symptomatology, espe- 
cially somatic, seems to be the core of this 
Hs-Hy Factor. 

The fifth factor is primarily a doublet char- 
acterizing Length of Therapy. No other vari- 
able has a loading greater than .24 with the 
exception of the two scores based on the 
therapist's posttherapy evaluation of change 
or success (cf. Cartwright, Robertson, F iske, 
& Kirtner, 1961). 


Remaining Factors 


The first five change factors are strong in 
terms of amount of variance accounted for 
and appear to be clear-cut as to meaning; 
four of them also resemble in form or con- 
tent the factors found in previous studies. The 
remaining factors are more difficult to evalu- 
ate. On the one hand, each accounts for an 
appreciable part of the total variance (in the 
order of 2% each). On the other hand, there 
are a number of reasons for questioning their 
significance. 

1. Since the basic data, change Scores, pre- 
sumably have relatively low reliability, one 
must guard against overinterpretation. 

2. While the first four factors are also fac- 
tors in the initial scores, no other shows any 
close correspondence to any initial factors, 

3. For each of the first four factors, the 
mean change for each variable loaded highly 
on the factor is generally considerable and the 
changes are toward the optimal ends of their 
Scales. For the last 11 factors, the mean 
Changes are typically smaller, often being 
close to zero for variables loaded only on the 
given factor. In such cases, the changes are 
more bidirectional than unidirectional. While 
the possibility exists that meaningful changes 
on one variable may occur in both directions 
in different clients, the pattern approaches 
that expected on retest without intervening 
effects. 

4. For the first five factors, the variables 
with major loadings intercorrelate substan- 
tially (all 7’s but one are above .21, the value 
required’ for significance at the .05 level). For 
the last 11, the corresponding intercorrela- 
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tions never exceed .45 and frequently are les | 


than .21. The median intercorrelation among 
the variables for each of the latter factors iş 


typically below .21. (These correlations may | 
of course be depressed by the dissimilarity of | 


their loadings on other factors.) 
5. Most of the latter 11 factors have a pat- 
tern characterized by high loadings for one or 


at most two variables, all other loadings be- 


ing moderate or low. It appears likely that 
this peaking results in part from the extrac. 
tion of factors by a method equivalent to 
factoring a correlation matrix with unity in 
the diagonal cells. 

6. Separate intensive analyses of Factors § 
and 12 failed to support the hypothesis that 
each of these factors emerged because a fey 
clients made substantial gains on the relevant 
variables while the great majority did not. 


Interview-Diagnostician Ratings 


As mentioned above, the ratings by the 
diagnostician who listened to the first and last 
interviews are included in the intercorrela- 


tions but not in the factor analyses completed | 


at an earlier date. Examination of the inter- 
correlations makes it quite evident that the 
interview-diagnostician's ratings would have 
formed a separate factor if they had been in- 
cluded in the analysis of change scores, The 
15 change measures derived from these rat: 
ings have consistent positive correlations with 
each other, with a mode in the 30s. The it 
tercorrelations between these change measutes 
and those from other sources are systemati- 
cally near zero, with but occasional excep: 
tions: a couple of other variables might have 
small loadings on this conjectured factor. It 
is also quite clear that this cluster would be 
entirely independent of Factors 4 and 5 (Hs 
Hy and Length). 


Pretherapy Factors 


The factor analysis of the pretherapy scot 
yielded a series of factors, the first four cof 
responding almost exactly with those foun! 
for the change scores, but in different orderi 


TAT Diagnostician’s Perception of Adea | 
Favorableness of Self-Evaluation, Therapists | 


Perception and Adequacy, and Hs-Hy Th 
remaining factors tended to be assoc 
largely with technique or instrument: th 


Factor STRUCTURE IN THERAPY CHANGE 


were two from the Sentence Completion and 
one from the three Social Attitude scores. 
The others typically had one loading of .6 
or .7 and several between .3 and .5. None of 
these latter 12 factors closely corresponded 
to any of the factors in the change analysis. 


Discussion 


Results of this study are highly consistent 
with previous findings. Change factors tend to 
be associated with point of view or role (e.g., 
Cartwright & Roth, 1957), with type of data 
(e.g, Gibson et al., 1955), or combinations 
of these (e.g., Nichols & Beck, 1960). With 
the possible exception of the Gibson et al. re- 
sults, the ratings from any one person other 
than the client load on only one factor, and 
typically on a factor specific to that source. 
On the other hand, in at least two studies 
(Nichols & Beck and the present one), the 
client scores yield more than one factor. 
Table 4 shows suggested similarities, where 
possible, of present factors with those of 
previous studies. 

Our study yielded no clear discriminant 
validation of our 14 scales (Campbell & 
Fiske, 1959). Even when the same variable 
was being assessed by different persons, there 
were only minimal tendencies for the correla- 
tions for the same variable to be high com- 
pared to those between different variables. 
While this finding may indicate deficiencies 
in these scales, the overall finding of these 
four studies is the predominance of method 
factors over substantive factors drawn from 
More than one method. 

It must be noted that all four studies in- 
volved clients in university counseling cen- 
lers. The mean number of interviews was 
roughly 10-40 in the several groups. Al- 
though method factors predominated in these 
four analyses, it seemed possible that a some- 
what different picture would emerge for a 
Broup of subjects who had undergone much 
longer therapeutic experiences. 

We took a first step toward investigating 
the possibility that the findings might be dif- 
ferent for a group with substantial therapy. 

e isolated a subgroup of 28 cases who com- 
pleted a course of therapy with 37 or more in- 
lerviews. We selected two variables providing 
Télatively pure representations of each of the 
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TABLE 4 


SUGGESTED MATCHING or Present FACTORS WITH 
THOSE or PREVIOUS STUDIES 


Factor 


Present 

Gibson, Snyder, and Ray (1955) 
Cartwright and Roth (1957) 
Nichols and Beck (1960) 


Pier 


first four factors and of the Interview-Diag- 
nostician cluster. A majority of the possible 
intercorrelations between these variables were 
computed. The results were quite comparable 
with those for the entire sample: there was 
no evidence suggesting that change variables 
might be more highly intercorrelated for a 
population undergoing a considerably longer 
course of therapy. 

The phrase “method factor” has been used 
above to indicate that several of the factors 
tend to be associated with specific methods 
of measurement. In these four studies, each 
method utilizes a particular body of data, 
such as a kind of protocol, and an observer- 
respondent playing a special role. Thus the 
TAT-diagnostician worked from the tran- 
scribed TAT protocols and was a disinter- 
ested party. The interview-diagnostician lis- 
tened to selected interviews and was also dis- 
interested. The therapist heard all interviews 
and saw the client during them, and was in 
the role of helper to the client. The client had 
all these data, plus all of his other experi- 
ences, outside the therapy and testing rooms, 
but was a deeply involved participant. We 
believe that these differences in role and in 
available data are responsible for the essen- 
tial independence of the change factors. We 
also believe that each factor has some degree 
of validity: each represents some aspect of 
the changes that occurred during the course 
of therapy. The preponderant influence of 
method, however, makes impossible at this 
time any confident delineation of substantive 
dimensions of change, with the possible excep- 
tion of that suggested by the Hs-Hy factor. 

It has been suggested to us that all of the 
changes we analyzed depend on client state- 
ments which in turn stem from his perception 
of himself. Such an interpretation would re- 
quire some communality among the diverse 
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measures of change and therefore among the 
several factors. The evidence is to the con- 
trary. The intercorrelations among the change 
scores are typically near zero: they do not 
yield a positive manifold. The factors are 
orthogonal and their plots indicate that an 
oblique solution would produce only minimal 
correlations between factors. Specifically, it 
should be recalled that the Hs-Hy factor is 
clearly differentiated from the factor indi- 
cating Change in Favorableness of Client 
Self-Evaluation even though the latter has 
high loadings for several other MMPI scales. 

The interpretation suggested to us does call 
attention to the fact that all of our measures 
were derived from the psychotherapeutic in- 
terviews or from the testing situation. How 
promising are other sources of data and other 
observer-respondents? Such factual data as 
changes in the client’s job or job responsibili- 
ties and changes in avocational and recrea- 
tional activities could be analyzed although 
their significance would be problematical. Pro- 
fessional evaluations could be made on the 
basis of special interviews or other diagnostic 
procedures. Lay opinions could be obtained 
from significant others or from associates less 
involved with the client. It is quite likely that 
the change measures from each of these three 
sources would yield an additional factor: 
each source would involve a separate type of 
data and a distinct viewpoint subject to po- 
tential distorting influences. Since the judg- 
ments of others before and after therapy 
would usually be made with knowledge that 
psychotherapy had taken place, they could 
provide only relative, not absolute, measures 
of change. 

The problem of obtaining adequate assess- 
ments of the effects of psychotherapy may 
have to wait further developments in the 
measurement of personality. The recent work 
of Cattell (1957) and of Cattell and Scheier 
(1961) promises substantial improvement 
through the application of prefactored objec- 
tive test measurements. In the meantime, the 
presence of method factors seriously impedes 
any attempt to determine the specific sub- 
stantive changes occurring in any given client. 

Moreover, even the application of prefac- 
tored objective test batteries would still re- 
quire concomitant use of precisely such clini- 
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cal and real-life sources of evidence as the 
client himself, the therapist, clinical diag- 
nosticians, friends, and so on. The business 
of neurosis and recovery exists and is ex. 
hibited only in the contexts of the self, family, 
friends, employers, and clinical practitioners, 
We feel that our data were obtained in such 
manner as to maximize the possibility for per- 
sons in some of these various roles to agree 
about what they saw in a client’s changes, 
Yet there was generally little agreement. Nor 


was there systematic disagreement, however; - 


e.g. clients’ ratings did not correlate nega- 
tively with diagnosticians ratings. 


The absence of systematic agreement or 
disagreement might suggest that none of the. 
conceptual variables employed in our research | 


attains a degree of inherent meaningfulness 
such that persons in different roles can agree 
or disagree systematically. If this suggestion 
is correct, then it has an important implica- 
tion precisely for the lines of further develop- 
ment in the measurement of personality. Spe 
cifically, it means that the conceptual vati- 
ables themselves must be radically revised. 
A desirable conceptual variable in the domain 


of neurosis and recovery through psycho- 


therapy would likely have at least one prin- 


cipal quality: observers in different roles can - 


systematically agree or disagree about it. À 
second prerequisite would be that of inherent 
meaningfulness. All observers might easily 


agree that, following a sudden loud noise, | 


patient's galvanic skin response has changed 
in a certain direction and amount. But what 
such a change might signify would be ur 


clear. In particular, its relevance to daily 


functioning in ordinary interaction with others 
is in doubt. Yet it is precisely for such dally 
functioning that the appropriate conceptu? 
variables must have inherent meaningfulnes* 
What might be the nature of at least om? 
such conceptual variable? It must permit sys 
tematic agreement or disagreement betwee? 
different classes of observers, and also have 
the same degree and nature of inherent meal 
ingfulness to all of them. We find it impos 
sible, on the basis of existing research; 
think of even one such variable. The sé 
for such variables would seem to be the we 
important task for research in the field ? 
theory evaluation. 1 
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On the more positive side of present inter- 
pretive possibilities however, it appears that 
changes in psychotherapy may be viewed in 
the manner suggested by Leary and others 
(eg Leary & Coffey, 1955) with regard to 
diagnosis. Our Factor 1 is similar to their 
Level II—patients’ self-descriptive ratings on 
check list and MMPI; our Factor 2 is similar 
to their Level III—interpersonal themes at- 
tributed to heroes in TAT stories; our Factor 
3 is similar to their Level I—ratings of pa- 
tient’s behavior by therapist. (Another inter- 
pretation in terms of levels is made by Gibson 
et al., 1955.) 

The total picture from all the studies of 
change in psychotherapy has clear implica- 
tions for the experimenter planning a study 
to compare two types of treatment or some 
other investigation requiring a criterion for 
change associated with treatment. Such a per- 
son must choose between: a composite cri- 
terion based in some fashion on changes as- 
sessed from several points of view by differ- 
ent methods; and a set of separate criteria, 
each derived from a single instrument or 
Source. No single test score, no one rater's 
rating can be considered adequately repre- 
sentative of the diversity of measured changes 
accompanying psychotherapy. 

One implication for goal setting and evalua- 
tion of outcome by therapists seems war- 
ranted. It appears that there are important 
dimensions of change which are not ordi- 
narlly attended to by the therapist. In the 
present study these dimensions are reflected 
in the viewpoints of the client and of the 
two diagnosticians. For the therapist to make 
a more complete evaluation of his case, it 
Would seem necessary that he somehow gain 
access to these other dimensions. 
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CRITIQUE AND NOTES 


THE RELATIONSHIP BETWEEN AGREEING RESPONSE 
SET AND SOCIAL DESIRABILITY 


LEONARD SOLOMON * 
Boston University 


AND EDWARD KLEIN 2 
Veterans Administration Hospital, Montrose, New York 


3 measures of response set and 5 content scales were administered to 125 
schizophrenic patients and 135 college Ss. The data indicate that the Overall 
Agreement Score which ostensibly measures an agreeing response set also con- 
tains negative social desirability (SD). SDıs also appears to have a naysaying 
element in it. The results also show that truly balanced measures of acquiescence 
and Social Desirability might very well be orthogonal to each other and 
support both sets of authors’ contentions that their scales are not necessarily 
associated with elements of the other response set. 


In a recent paper, Couch and Keniston (1960) 
explored the influences of an agreeing response 
set as measured by an Overall Agreement Score 
(OAS) upon a number of objective personality 
scales. Using among other methods a clinical 
analysis of their high and low sample extremes 
(Yeasayers versus Naysayers), they concluded 
that the agreeing response tendency represents a 
more basic personality syndrome. In a recent 
issue of this journal, both Taylor (1961) and 
Edwards and Walker (1961a) question the 
manner in which Couch and Keniston have inter- 
preted their results. They indicate that the OAS 
measure may be confounded with another meas- 
ure; the tendency to endorse socially undesirable 
items. Edwards and Walker (1961b) summarize 
their position by stating that, “the OAS cor- 
relations with the MMPI scales were consistent 
with both the hypothesis that scores on the OAS 
(and MMPI scales) are influenced by social 
desirability tendencies and the hypothesis that 
they are influenced by acquiescent tendencies . . . 
there was no compelling evidence in favor of 


1This work was carried out at the Franklin D. 
Roosevelt Veterans Administration Hospital, Mont- 
rose, New York, as part of a program of Social 
Psychological Research with chronic schizophrenics, 
Herbert Spohn, Chief, Social Psychology Research 
Unit, provided the atmosphere that makes this kind 
of study possible. We also wish to express our grati- 
tude to the IBM Watson Scientific Computing 
Laboratory, New York, for training us in computer 
technology and giving us free access to their facili- 
ties. We „would also like to thank Alba Johnston 
for patiently typing the manuscript. 

2 Now at Veterans Administration Hospital, West 
Haven, Connecticut, and Yale University. 
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either hypothesis” (p. 180). Couch and 
(1961) in a reply to the two previoi 
articles report a correlation of —.34, s 
at the .01 level, between OAS and SD 
Harvard sophomores. They point out 
correlation although significant may re 
fact that 77% of the SD items were ke 
thus a negative correlation with the O. 
be expected on the basis of the agreeing 
set operating in the SD measure. Edwal 
Walker (1961b) argue that the OAS € 
more items with socially undesirable scal 
than items with socially desirable scale 
Thus a subject who has a tendency 
socially desirable responses would be moi 
to disagree with items that have so 
desirable scale values, and therefore 0 
low score on the OAS. 4 
The way out of this circle of confound 
tionships is further research which diff 
between the two response sets. Cle 
acquiescence scale which is relatively free 0 
as well as a measure of SD which is 
counterbalanced for acquiescence would 
clarify the relationship of acquiescence 
as well as their relative influence upo 
measures. In the course of a larg 
(Solomon & Klein, 1961) which sought | 
pare response set tendencies in samples 0: 
phrenic and normal subjects, we develop 
relatively “pure” measures of SD and à 
escence. The purpose of this paper is to 
some correlational and factor analytic da 
serves to provide a further test of Ed 
Walker's (1961a) hypothesis "that sco: 
OAS (and MMPI scales) are influenced 
social desirability tendencies" (p. 174). ' 
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TABLE 1 
RELIABILITIES AND ĪNTERCORRELATIONS OF RESPONSE SETS AND BALANCED CONTENT MEASURES IN 
SAMPLES OF COLLEGE STUDENTS ABOVE THE DIAGONAL AND HOSPITALIZED SCHIZOPHRENICS 
BELOW THE DIAGONAL 

Scale SC: | SC: | Pa che 
$elf-concept—Actual (SC1) —.34) 354 64 
Sil concept Comparative (SC) | -—.43** —.31** 75 
Paranoia (Pa) [aee | —.22* 3 
Psychopathic Deviance (Pd) 22* | —26**| .30** 29 
Christie F Scale —.06 231% —:03 164 
Yeasaying Inconsistency (YI) 104 4 203 169 
Naysaying Inconsistency (NI) 208 | —.10 -06 336 
Social Desirability (SD1s) —.52**| 24«| —.32** 753 
Overall Agreement Score (OAS) 28**| 03 127 183 
Split-half reliabilities (rxx)* 45 En 48 


* All correlations corrected for attenuation by the Spearman-Brown formula. 


* Significant at the .05 level. 
*** Significant at the .01 level. 


METHOD 
Subjects 


The schizophrenic sample was composed of 125 
relatively acute and testable male patients drawn 
from the privileged as well as nonprivileged wards 
of the Montrose Veterans Administration Hospital. 
A sample of 135 normal subjects was obtained from 
the City College and Westchester Community College 
evening session divisions. 


Measuring Instruments , 


The subjects were administered in groups an 
objective battery of personality scales totaling 235 
true-false items. Among these were five content scales 
and three measures of response set. Each content 
measure was “balanced” so as to contain an equal 
number of items keyed true and false. The measures 
of response set are the following: 

Yea and Naysaying Inconsistency. A 20-item 
Straight F Scale and 20 (psychological) reversals 
(Christie, Havel, & Seidenberg, 1958) matched for 
item content were placed throughout the test in a 
fixed sequence (every fifth item) and in a counter- 
balanced order. An instance in which the same 
response to an item and its reversal (true-true or 
false-false) is defined as an inconsistency, whereas 
some combination of true and false to a matched 
ltem pair respresents a consistent response. A subject 
can exhibit inconsistency by agreeing or disagreeing 
to a pair of opposite F Scale items. Each subject’s 
consistent responses were classified as Yeasaying 
(saying true to an item and true to its opposite) 
(TT), or Naysaying (FF) and a score assigned 
tanging from 0 to 20. Our data indicate that the 
authoritarian content is independent of the SD 
Tesponse set. Table 1 shows the correlations between 
the Christie F Scale and SD to be —.09 for students 
and .06 for patients? Thus the measure of agreeing 
— nų 


3 This finding is consistent with the work of 
Christie who found no relationship between Edwards’ 
39-item Social Desirability scale and Christie's 
balanced F Scale in samples of Western Reserve 
University medical students and Hofstra College 
Undergraduates, 

Richard Christie, personal communication, Colum- 
ia University, 


response set derived from the F Scale, Yeasaying 
Inconsistency, is free of contamination from SD. 
This measure of acquiescence is also highly cor- 
related with the type of measure Couch and Keniston 
employ, the OAS. The correlations are .56 and .74 
for normals and patients, respectively. 

Overall Agreement Score (OAS). This index of 
acquiescence is derived from the technique employed 
by Couch and Keniston (1960). It is based upon the 
subject’s percentage of “true” responses to the 195 
non-F-Scale items. 

Social Desirability Set. The 39-item SD scale of 
Edwards (1957) was included in the questionnaire. 
In order to achieve a balanced scale, however, our 
measure of Social Desirability was based on 18 items, 
9 keyed True and 9 keyed False.* 

Content Scales, The five “balanced” content meas- 
ures were as follows: Self-concept, actual (27 items) 
and self-concept by social comparison (20 items), 
both taken from the Tennessee Mental Health In- 
ventory (Tennessee Department of Mental Health, 
1956), abbreviated versions of the Paranoia (24 
items) and Psychopathic Deviance (25 items) scales 
from the MMPI; and Christie's F Scale (40 items). 


RESULTS 


Table 1 reports the reliabilities and intercor- 
relations of response sets and balanced content 
measures in samples of college students and 
hospitalized schizophrenics. The correlations for 
students (above the diagonal) are quite similar 
to those for patients (below the diagonal). The 
correlation in both samples between OAS and 
our balanced social desirability (SD) are —.45 
and —.43 for college students and schizophrenics, 
respectively. These correlations are higher than 
those reported by Couch and Keniston and in- 


4The nine items keyed True in Edwards’ 39-item 
SD scale were retained as well as nine items keyed 
False which did not overlap with the MMPI scales. 
The correlations between Edwards’ SD and our 
SDs were +.89 and +.88 for students and patients, 
respectively. These correlational findings indicate that 
our SD measure purged of naysaying still measures 
Social Desirability. 
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TABLE 2 


ROTATED Factor LoADINGS OF BALANCE CONTENT AND 
RESPONSE SET MEASURES FOR COLLEGE STUDENTS 
AND HOSPITALIZED SCHIZOPHRENICS 


Factor I Factor II 
Scale 
Students | Patients | Students | Patients 

SC, -06 .06 2. E 
SC; .20 19 53 198 
Pa .07 .09 —.58 —.56 
Pd .07 .08 —:55 —.55 
F .33 .32 23 22 
VI .84 .85 —.02 .00 
NI = =.173 —.09 =l 
SDis —.36 —.23 78 .68 
OAS .86 87 bear) —.27 


dicate that OAS is significantly negatively cor- 
related with social desirability for both of our 
samples. 

When our “purified” measure of the agreeing 
response set, Yeasaying Inconsistency, is cor- 
related with SD the correlations fall to —.16 
(ns) and —.22 (significant) for the normal and 
patient samples, respectively.® 

High negative correlations were obtained in 
both samples between the two subscales of the 
MMPI and Social Desirability, while high posi- 
tive correlations were obtained for our two self- 
concept measures and Social Desirability. These 
findings for patients and students are consistent 
with prior research in that Social Desirability is 
Positively related to a self-reported positive self- 
image and the denial of symptomatology. 

In order to further investigate the relation- 
ship between agreeing response set and Social 
Desirability, a factor analysis of the nine scales 
was carried out. These variables were intercor- 
related and the resulting correlational matrix 
was factor analyzed by the Thurstone complete 
centroid method. The number of factors that 

` Were accepted for orthogonal rotation by the 
quartimax method were determined by a pseudo- 
latent-root criterion; that is, all centroids the 


5When Edwards’ 39-item SD scale is included in 
the analysis, the results are as follows: 


STUDENTS PATIENTS 
SD» and OAS, r = —64 Sei 
SDis and OAS, r = —45 —43 
SDy and YI, r= —25 —35 
SD; and YI, r= —16 (ns) —22 


These differences between correlations of balanced 
and unbalanced measures of SD with measures of 
acquiescence emphasize Taylor (1961) and Couch 
and Keniston’s (1961) point that Edwards’ 39-item 
SD confounds Social Desirability with naysaying. 
These findings also indicate that as one moves from 
unbalanced to balanced measures of response sets 
that they may be truly independent of each other, 
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sum of whose loadings was 1.00 or more were 
then rotated.® Two factors met this criterion and 
accounted for 50% and 47% of the total common 
variance in the college and patient samples, 
respectively. 

Table 2 reports the results for patients and 
students. There is great similarity of factor 
Structure in the two samples with two Strong - 
independent factors emerging. Factor I is defined 
by high positive loadings on OAS and Yeasaying 
Inconsistency and the other by a high loading on 
Social Desirability. In both groups Social Desir 
ability is negatively loaded on the first factor, 
and OAS is negatively loaded on the second 
factor. Since the first factor is one defined by 
high scores on agreeableness, these results would 
seem to support Edwards and Walker (196la) 
and Taylor (1961) who concluded that OAS is“ 
influenced by the set to endorse socially undesir. 
able statements. The negative loading of OAS 
on the second factor indicates that the Social 
Desirability factor contains the tendency to dis- 
agree. However, when our two purified measures 
of agreeableness (YI) and Social Desirability - 
(SDs) are analyzed in Table 2, it appears that 
agreeableness (Factor I) does have an element of 
nonsocial desirability in.it; but that Social Deit- 
ability (Factor II) does not necessarily contain 
agreeableness.* 


Discussion AND CONCLUSIONS 


Couch and Keniston (1961) attempt to explain 
the correlation of —.34 between OAS and $D 
by proposing that Edwards’ measure of SD (9? 
items) reflects a naysaying response set since @ 
disproportionate number of the items (30) afe 
keyed false. An alternative interpretation Sub — 
gested by Edwards (Edwards & Walker, 19612) 
is that this correlation derives from the fact 
that the OAS items have an obvious SD loading, 
thus the measure of acquiescence really reflects 
a negative SD set. The data derived from olf 
own research indicates that even when “purified 
measures of acquiescence (YI) and SD (bal: 
anced SD:s) are utilized we obtain a low but 
statistically significant relationship between acqui 
escence and negative SD for one of our B 
samples (students). Moreover, when the Coud 
and Keniston measure of acquiescence (OAS) 5 
used these correlations swell more than twof 


$ Jacob Cohen, personal communication, New York 
University. wi 

1 When a third factor is extracted, it is defined 4 
a high loading (40 and .68) on F for students a 
patients, respectively, and relatively low loadings in 
the MMPI subscales. SD loads .17 and .07 for U^ 
two samples, which serves to reinforce our pIOP d 
tion that F content is independent of SD for U^ 
groups. 
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for both samples. It would appear that Edwards 
and Walker (1961b) and Taylor (1961) may be 
more nearly correct when they propose that the 
OAS confounds components of both acquiescence 
and negative SD. The factor anaylsis provides 
additional evidence in support of this interpreta- 
tion in that the two principal factors are acqui- 
escence and SD and the acquiescence factor loads 
highly on nonsocial desirability whereas the 
(balanced) SD factor is independent of our 
acquiescence measure (YI). 

At the same time it is important to note that 
to the extent that Couch and Keniston’s (1960) 
OAS scale does tap a pure agreeing response set 
we would expect their yeasayers who are charac- 
terized as “individuals with weak ego controls, 
who accept impulses without reservation, and who 
‘agree’ and easily respond to stimulus exerted on 
them” (p. 173) to also be high on negative SD. 
This appears to be a psychologically meaningful 
cluster. 

In summary, our data indicate that the OAS 
which ostensibly measures an agreeing response 
set also contains negative SD. SD:s also appears 
to have a naysaying element in it. While we agree 
that future studies should be more careful to 
differentiate between the two response sets, there 
is a good deal of evidence which indicates that 
such response sets are deep seated, pervasive, 
and highly reliable for individuals. This being 
the case, we believe that Couch and Keniston's 
attempt to uncover the social emotional roots 
Of such sets is likely to bear fruit. Our results 
also indicate that truly balanced measures of 
acquiescence and Social Desirability might very 
well be orthogonal to each other and support 
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both sets of authors’ contentions that their scales 
are not necessarily associated with elements of 
the other response set. Perhaps future studies 
along these lines will put more emphasis on the 
utilization of carefully controlled social situa- 
tional tests in addition to the usual paper-pencil 
test. 


REFERENCES 


CuaristTIE, R., HAveL, Joan, & Semenserc, B. Is the 
F Scale irreversible? J. abnorm. soc. Psychol, 
1958, 56, 143-159. 

CoucH, A. & Kentston, K. Yeasayers and nay- 
sayers: Agreeing response set as a personality vari- 
able. J. abnorm. soc. Psychol., 1960, 60, 151-174. 

CoucH, A. & Kentston, K. Agreeing response set 
and social desirability. J. abnorm. soc. Psychol., 
1961, 62, 175-179. 

Epwarps, A. L. The social desirability variable in 
personality assessment and research. New York: 
Dryden, 1957. 

Epwarps, A. L., & Watxer, J. N. A note on the 
Couch and Keniston measure of agreement re- 
sponse set. J. abnorm. soc. Psychol, 1961, 62, 
173-174. (a) 

Epwanps, A. L., & Warxzn, J. N. Social desirability 
and agreement response set. J. abnorm. soc. Psy- 
chol., 1961, 62, 180-183. (b) 

Sotomon, L., & Krrrw, E. A comparison of response 
set tendencies in normal and schizophrenic samples. 
Paper read at Eastern Psychological Association, 
Philadelphia, April 1961. 

TAYLOR, J. B. The “yeasayers” and social desirability: 
A comment on the Couch and Keniston paper. 
J. abnorm. soc. Psychol., 1961, 62, 172. 

TENNESSEE DEPARTMENT or Mental HEALTH. 
Manual for the TDMH self-concept scale. Nash- 
ville: TDMH, 1956. (Mimeo) 


(Received October 9, 1961) 


CONFORMITY TO GROUP PRESSURE IN RELATION TO 
THE VALUE OF THE TASK MATERIAL 


G. M. VAUGHAN? ann G. L. MANGAN 2? 
Victoria University of Wellington, New Zealand 


This study was an investigation of the effect of varying the value content of 
a perceptual task under conditions of group pressure, when factual anchorage 
was held constant. 20 Ss, each with both high and low scores on the Allport- 
Vernon Study of Values, were exposed to a group pressure situation in which 
task materials were of both high and low values for each S. The results showed 
that group pressure is resisted when highly valued material (particularly 
economic) is utilized, and that, in general, susceptibility to influence increases 


with pressure, 


„ The importance of task material variables as 
determinants of conforming behavior in a situa- 


p, grants for experimental assistance are due to 
; Carmichael, R. Boshier, and W. Alexander. 
Now at the University of Queensland, Australia. 


tion of group pressure has been generslly recog- 
nized. One such variable, factual anchorage, has 
been investigated by Festinger and Thibaut 
(1951) and by Blake, Helson, and Mouton 
(1956), with conflicting results. Festinger and 
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TABLE 1 
EIGHT STATEMENTS RELATING TO RELIGIOUS AND ECONOMIC VALUES 
Item Statement picaetive Exposure 
Rı The Bible should be regarded as revelation. f 15 
Ro Theology must be considered a fruitful study. fruitless 15 
Rs It is a reasonable assumption that God created the world. isn't 20 
Ry The church can play a vital role in modern life. minor 15 
Ei Stock market reports provide useful reading. 2.0 
E: Trade exhibitions fulfill a useful function. useless 15 
Es People should invest their surplus capital profitably. shouldn't 2.0 
Ey Production efficiency is a matter of vital concern. minor 20 
^ In seconds. 
Thibaut found conformity pressure less effective METHOD 


when a “discussion” (note writing) task related 
to a social topic (a delinquency case study) 
rather than to a factual one (football strategy). 
On the other hand, Blake et al. found conformity 
pressure more effective when the task material 
consisted of socially anchored items from a mili- 
tarism scale rather than factually anchored 
items (counting metronome clicks, arithmetical 
problems). A problem in both studies is that 
two variables, task anchorage and task value, 
are not treated independently, and it is difficult 
to determine the contribution of each to the 
results obtained. 

There has been little investigation of task value 
as an independent variable, although two studies 
are suggestive. Marinho (1942) demonstrated 
higher resistance to change in food preferences, 
when the task material consisted of preferred 
rather than nonpreferred food. Luchins and 
Luchins (1961) have indicated that the subject’s 
degree of expertise is negatively correlated with 
conforming behavior. 

The present research is directly concerned with 
the effect of varying the value content of a per- 
ceptual task under conditions of group pressure, 
when factual anchorage is held constant. In 
these circumstances, it might be expected that 
resistance to group pressure will be maximized 
when the task material is of high value to the 
subject, and minimized when the task material 
is of low value. 

There were two hypotheses: 


1. When the task material is highly valued by 
the subject, there will be significantly fewer 
yielding responses to group pressure than when 
low value material is utilized. 

2. There will be a positive relationship between 
number 6f yielding responses and amount of 
group pressure, the latter being measured by the 
subject’s position in the response sequence. 


Material and Subjects 


The experimental material consisted of six sets of 
five statements, which were constructed so that their 
content reflected religious, economic, aesthetic, politi- 
cal, theoretical, or social values. Some of these slalt- 
ments were based on items from the Revised Form 
of the Study of Values (Allport, Vernon, & Lindzey, 
1951) ; others were arbitrarily selected. 

Eleven judges were asked to allot each of the 30 
statements to one of the six value areas. Disagree- 
ments between judges resulted in some statements 
being omitted, and others being reworded. The 
judging procedure was repeated for the second form 4 
of the statements using a further 12 judges. On tht — 
basis of their responses, it was possible to select 
four statements in each of the six value areas which 
satisfied a criterion of 90% agreement between the 
second group of judges. The statements were then 
typed in capital letters, mounted individually om 
cards measuring 7” X 5", and exposed by means of 
an epidiascope to a further 25 judges. In this way 
the exposure time necessary for each statement 10 
be correctly recognized by 90% of the judges Wis 
determined. The statements in each category welt 
randomly allocated in item number from 1 to 4. The 
four economic and four religious statements, with the 
negative term utilized by the confederates, together 
with the exposure times, are presented in Table l 

Twenty male undergraduate volunteers were sub- 
jects in this experiment. These were selected by th 
following procedure. The Allport-Vernon Study % | 
Values was administered to beginning students n i 
psychology as part of a practical course in Pi 
sonality testing about 8 weeks before the prega | 
investigation. From the 63 protocols available, ? 
subjects were selected, each of whose profiles rev 
a high and low score in any two value areas. The 
were mainly related to Religious and Economie 
values. Subjects were allocated to the followiN® 


groups: 
HIGH Low GROUP N 
Religious Economic RE p 
Economic Religious ER 3 
Political Aesthetic PA 2 
Aesthetic Political AP 
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Mean scores for these subjects on high, low, and 
remaining values were 53.8, 28.4, and 39.5, respec- 
tively, with corresponding SDs of 4.2, 5.8, and 8.2. 
A follow up of each subject’s intended occupation, 
as indicated in enrollment cards, revealed that six 
RE subjects planned to be ministers, and all nine 
were active Protestants and members of student 
religious bodies, two ER subjects intended to be 
accountants, and two lawyers, and the three PA 
subjects were studying law. 

Scores on the shortened 28-item Taylor (1953) 

MA scale were obtained for each subject. The mean 
score for all subjects was relatively low, being 7.7 
with an SD of 3.8, 
. The confederates were three selected undergraduate 
male students. Before the experiment, these were 
fully instructed and practiced in general conduct 
during the sessions and the form of the negative 
statements they were expected to make. 


Procedure 


The three confederates and the subjects were 
instructed to report to the experimenter at a certain 
time, and all four were taken to the experimental 
room. The confederates arranged for the subject to 
sit at one end of the semicircle of chairs, which were 
located about 6 feet from the screen. 

The group was given the following instructions: 


I am going to project statements onto the 
Screen for short exposures and I want you to tell 
me what they are. That's all you have to do. To 
avoid confusion, it might be better if you wait 
until I ask you for your response. 


Using the epidiascope, each statement was exposed 
On the screen for the predetermined period. The 
Order of response in each trial was rotated so that 
the subject responded in each of the four positions 
9n two occasions, 

Two trials were noncrucial in that the subject 
Tesponded first and the confederates agreed without 
Comment. In the remaining six trials, the following 
Procedure was adopted. In the trials in which the 
subject responded second, the first confederate gave 
he incorrect response. After the subject had re- 
sponded, the remaining confederates gave the incor- 
Tect response. When the subject responded third 
(two trials) and fourth (two trials) in the sequence, 
the first confederate gave the correct response. He 
d Immediately assailed by the confederate(s) fol- 
wing him, but preceding the subject in the response 
Order. The first confederate appeared confused, then 
Conformed. In these trials, therefore, pressure was 
®xerted indirectly on the subject before he was 
Tequired to respond. 

p response position of each subject in the 
*quence for the first trial was randomly determined, 
a abba sequences were utilized to order the items 

their value, For example, Subject 3, who showed 


- high religious, low economic values, was treated in 


the following way: 
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"m 15002499 212411:58 1:6: 75 428) 
Position in 
SEQUENCE gtApiar lameness ay 4. 172. 3 
Item 4E 1R 2R 3E 4R 1E 2E 3R 


so that on Trial 1 he responded fourth to Item 4 
economic. 

There were two experimenters. One conducted the 
experimental sessions, and like the confederates, was 
unaware of the value of the task material for the 
subject. The other, at the end of each session, 
recorded the subject's introspective account of the 
purpose and conduct of the experiment (e.g., whether 
the subject suspected collusion between the con- 
federates), and disclosed the true mature of the 
experiment to the subject. 


RESULTS 


Results for subjects in all groups are shown 
in Table 2. The 20 subjects gave a total of 37 
(31%) out of a possible 120 yielding responses 
on crucial trials. Of these, 10.5 (18%) were made 
on high value statements and 26.5 (45%) on 
low value statements. This difference is highly 
significant (p= .004) by the sign test. 

An analysis of the results in Table 2 in terms 
of group differences shows that on high value 
material RE subjects made 7.0 (26%) yielding 
responses from 27 trials on high value material 
while ER subjects made 0.5 (3%) yielding re- 
sponses from 18 trials. This difference is margin- 
ally significant (p=.07, two-tailed) by Fisher’s 
exact test. The two groups differ little in per- 
centage of yielding response on low value ma- 
terial, In view of the small number of AP and 
PA subjects available, little can be said of any 
possible intergroup differences in these value 
areas. 

Results pertaining to the effects of increasing 


TABLE 2 


TOTAL NUMBER AND PERCENTAGE OF YIELDING 
RESPONSES ON THREE HIGH AND THREE 
Low VALUE STATEMENTS BY SUBJECTS 
IN Four VALUE GROUPS 


High value statements | Low value statements 


Group 
Total % Total pA 
ER 0.5 E 7.0 39 
RE 7.0 26 11.5 43 
AP 1.0 17 2.0 34 
PA 2.0 22 60 |. 67 


Note.—Five out of a possible 120 yielding responses were 
noncommittal, and were each allotted a score of .5 rather than 
1 (yielding) or 0 (independent). 
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TABLE 3 


PERCENTAGE OF YIELDING RESPONSES ON HIGH AND 
Low VALUE STATEMENTS BY SUBJECTS IN 
"THREE RESPONSE POSITIONS 


Position in response sequence 
Statement |— ———— ———| Q 


2 3 4 
High value 13 8 32 6.00* 
Low value 23 50 60 9.46** 


pressure, measured by the percentage of yielding 
responses made by all subjects in each of three 
response positions, for both high and low value 
material, can be seen in Table 3. 

Analysis of these results by Cochran's Q test 
shows a significant, positive relationship between 
number of yielding responses and position in 
response sequence, for both high value material 
(Q — 6.00, df —2, p < .05) and low value ma- 
terial (Q = 9.46, df = 2, p < .01). Inspection of 
Table 3 reveals that the significance of this 
trend is largely determined by an increase in 
yielding between Response Positions 3 and 4 for 
high value material, and between Positions 2 
and 3 for low value material. The overall finding, 
that an increase in pressure is associated with 
higher yielding, is in line with that previously 
reported for low anxious subjects (Mangan, 
Quartermain, & Vaughan, 1959). Beyond this, 
MA scale scores in the present study, ranging 
from 2 to 14, did not discriminate between 
subjects on conformity. 


Discussion 


The overall amount of conformity shown by 
all subjects with both types of material—31%— 
agrees well with that reported in studies using a 
similar procedure (Mangan et al., 1959, 1960). 
The first hypothesis, that there will be signifi- 
cantly fewer yielding responses on high than on 
low value material, when factual anchorage is 
held constant, is supported. This applies more 
particularly to subjects with high economic 
values, who made 3% yielding responses; those 
subjects maintaining high religious values show 
more conformity (26%) to group pressure on 
high value material, 

This finding of higher resistance to group pres- 
sure among ER than among RE subjects on 
valued material might be anticipated from re- 
sults previously reported on the relationships be- 
tween measures of Value, Authoritarianism, and 
Ascendance-Submission, Recent reviews by Mann 
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d 
(1959) and Blake and Mouton (1961) have 
summarized evidence of significant positive tor 
relations between Authoritarianism and yielding 
to pressure, and between Ascendance and resist. 
ance to pressure, Furthermore, positive correla- 
tions have been reported between Ascendance and 
Economic Value (Evans, 1952), and between 
Authoritarianism and Religious Value (Jones, 
1958). In a group pressure situation involving 
highly valued task material, therefore, it would 
be reasonable to expect subjects valuing economic 
material to be relatively ascendant and resistant 
to influence, and those valuing religious material 
to be relatively authoritarian and susceptible to 
influence. 

The second hypothesis, that there will bea 
positive relationship between number of yielding 
responses and amount of group pressure, is sup 
ported by the overall results shown in Table 4, 
Except for ER subjects, who made only 3% 
yielding responses on high value material, it 
clear that more pressure is required to induc 
conforming behavior when the task material is of 
high rather than low value for the subject, At 
the same time, there is less conformity (23%) | 
with low value material under conditions of low 
pressure (Response Position 2), than with high 
value material under conditions of high pressure 
(32%, Response Position 4). Appropriate mi 
nipulation of a crucial variable, in this cas 
amount of pressure, can result in higher sus 
ceptibility to influence when the task material 
is of high rather than low value. 

Bearing in mind the small size of the samples 
used in this study, the following conclusions welt 
drawn: " 

1. Resistance to group pressure increases WIM 
the value of the task material. This resistance > 
particularly marked when the highly valued mi 
terial is of an economic rather than a religii 
nature. : with 

2. Susceptibility to influence increases Y^ 
pressure. Maximum pressure, however, 18 E 
quired to induce appreciable conformity W 
material is highly valued. 3 the 

There is need for further investigation of t 
relationship between task value and conformis 
utilizing larger samples and examining d 
effects of group pressure upon high-low 
combinations other than ER and RE. 
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level of anti-Semitism in the women. 


The role of displaced aggression in the origin 
Mnd maintenance of ethnic prejudice has been 
emphasized by a number of theorists (eg., 
Dollard, Doob, Miller, Mowrer, & Sears, 1939) 
d supported in a number of empirical studies 
», Berkowitz, 1959; Miller & Bugelski, 1948; 
atherley, 1961). It is reasonable to assume 
Mat individual differences in the readiness to dis- 
ace aggression onto scapegoats largely originate 
i childhood, as an outgrowth of parent-child 
teractions, Particularly pertinent to the scape- 
bat theory is the manner in which the socializa- 
Of aggression is approached by parents. 
5h, restrictive parental handling of childhood 
ression is likely both to generate resentment 
ard authority and foster the development of 
‘ernal barriers against direct, overt expressions 
aggression, Hence the child learns to express 
aggression in indirect forms (e.g., prejudice) 

against substitute, “safe” objects (eg, 
lority group members). 

t harsh discipline is a factor in the etiology 
ethnic prejudice is suggested by the restro- 
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MATERNAL RESPONSE TO CHILDHOOD AGGRESSION 
AND SUBSEQUENT ANTI-SEMITISM 


DONALD WEATHERLEY 
University of Colorado 


2 measures of anti-Semitism were obtained from 39 college women. Information 
concerning maternal handling of their childhood aggression was obtained from 
their mothers. The results showed that relatively stern maternal discipline 
toward childhood aggressive behavior was associated with a relatively high 


spective accounts of childhood obtained from 
individuals characterized as authoritarian in 
orientation (Adorno, Frenkel-Brunswik, Levinson, 
& Sanford, 1950). Such data, however, are not 
entirely convincing; the extrapunitive tendencies 
associated with authoritarianism may well result 
in systematic distortions in the recollection of 
childhood treatment. Only two studies known to 
the author have investigated child rearing in- 
formation obtained directly from parents of 
intolerant children (Frenkel-Brunswik, 1948; 
Harris, Gough, & Martin, 1950). Their results 
support the general principle that rigid, stern 
discipline is related to the development of preju- 
dice, but these studies bear only incidentally on 
the question of parental reaction to childhood 
aggression. 

In the present study, data specifically concern- 
ing maternal handling of childhood aggression 
were obtained from mothers of college girls 
and related to two measures of anti-Semitism 
obtained from the girls themselves. 
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METHOD 


Thirty-nine non-Jewish undergraduate college 
women enrolled in elementary psychology courses 
and their mothers served as subjects. 


Daughter Variables 


Two measures of anti-Semitism were obtained 
from the daughters: Levinson Anti-Semitism (A-S) 
scale scores and scores based on an adaptation of 
the semantic differential technique (Osgood, Suci, & 
Tannenbaum, 1957). Both scales were administered 
during regular class periods, the semantic differential 
approximately 5 weeks earlier than the A-S scale. 

The semantic differential consisted of four names 
which were to be rated on a list of six seven-step 
bipolar scales (e.g., good-bad). Four of these bipolar 
scales, used in the present analysis, consisted of items 
heavily loaded on the evaluative dimension identified 
by Osgood and his associates (1957). 

Two of the names rated were Jewish-sounding 
(Samuel Goldblatt and Herbert Rosen); two were 
non-Jewish (James Brooks and Kenneth Taylor). 
The names were selected on the basis of their 
relative “Jewishness” as rated by college students 
in a previous study (Weatherley, 1961). The order 
of presentation of the four names was systematically 
varied to control for possible order effects, 

The sum of each subject’s ratings on the seven- 
step evaluative scales was obtained for the Jewish 
and non-Jewish sounding names. This resulted in 
scores which reflected the subject’s readiness to 
attribute negative connotations to Jewish and non- 
Jewish sounding names. 


Mother Variables 


The mothers were contacted by mail and asked 
to complete a four-item questionnaire concerning 
the way in which they reacted to their daughters" 
aggression in childhood.t Three of the items are 
pertinent to this study. All were rationally con- 
structed multiple-choice questions, 

Two items dealt with the extent to which the 
mother permitted her daughter to express verbal 


1A copy of this questionnaire may be obtained 
from the author. 


TABLE 1 


CORRELATIONS BETWEEN MEASURES OF MOTHERS" 
RESPONSES TO CHILDHOOD AGGRESSION AND 
DAUGHTERS’ ANTI-SEMITISM MEASURES 


Mother variables Daughter variables r 


Permissiveness | A-S scale score —.247 
Permissiveness | Jewish name evaluation | —.156 
Punitiveness A-S scale score 3-256 
Punitiveness Jewish name evaluation | +.328* 
Sternness A-S scale score E7333" 
Sternness Jewish name evaluation | +.338* 
Note.—N = 39. 


* Significant at the .05 level of confidence. 
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aggression. One of these items pertained to aggres 
sion toward other children, the other to agg 
toward parents. The alternatives for each ques 
were weighted in terms of the degree of Permissive. 
ness they implied; the average value of the alterna. 
tives selected by each mother on these two questions | 
constituted her permissiveness score, i 
The third item concerned the severity of punish- 
ment the daughters received for expressing pro- | 
hibited aggression. A mother’s choice of alternatives | 
on this item yielded a punitiveness score, Punitivenes; 
and permissiveness scores were unrelated (r 078), 
Each mother’s permissiveness and Punitiveness 
Scores were combined to produce a more general inder 
of disciplinary attitudes toward aggression. This was, 
the sternness score, obtained by subtracting each 
mother’s permissiveness score from her punitivencss | 
score. Thus high scores on the sternness variable 
indicated relatively high punitiveness combined with | 
relatively low permissiveness, W 
Ninety-eight percent of the questionnaires senl 
to the mothers were completed and returned. 


RESULTS AND DISCUSSION | 


Relationships between mother and daughter 
variables were assessed by the Pearson r. As i 
be seen in Table 1, the degree of permissiveness 
reported by mothers was not significantly related 
to either index of anti-Semitism obtained from) 


negative in both cases. 

Maternal punitiveness toward aggressive be 
havior was positively related to both indices 0 l 
anti-Semitism obtained from the daughters; M 
the case of the daughters’ evaluation of Jewish 
sounding names, the relationship was statistically 
significant. 

The sternness variable, derived from the 
mothers’ permissiveness and punitiveness scores 
combined, was significantly related to both mes 
ures of anti-Semitism. The more stern the mf 
ternal discipline toward aggression, the a 
likely was the daughter to agree with unfavorall 
statements about Jews on the A-S scale ad 
attribute negative connotations to Jew! 
sounding names. the 

It should be noted that the tendency for y 
daughters treated relatively punitively and stem”) 
to make more negative evaluations of JeW! i 
sounding names cannot be interpreted as Ie a f 
ing a generalized tendency to make ne 
evaluations. There was no significant relatio e 
found between the daughters’ evaluations ofa 11 
Jewish sounding names and the mothers d 
tiveness scores (r= —.065), sternness 90 
(r = +.048), or permissiveness scores 
—.234). i the 

The findings of this study lend support t di 
general thesis that important roots of pre 
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lie in childhood, in the nature of the parent-child 
interactions experienced. More specifically, the 
data fit neatly into a displaced aggression inter- 
pretation of prejudice. Child rearing practices 
likely to lead to the development of a strong 
readiness to displace hostility—i.e., relatively 
punitive, nonpermissive maternal reactions to 
childhood aggression—are here shown to be 
related to subsequent anti-Semitism in the chil- 
dren. These results complement other recent 
findings (Berkowitz, 1959; Weatherley, 1961) 
which have shown a relationship between an 
individual’s readiness to displace aggression and 
his level of prejudice. 

Nevertheless, the present data obviously are 
not decisive evidence for the validity of the 
scapegoat theory. It may well be, for example, 
that stern discipline toward childhood aggression 
is part of a pattern of authoritarian ideology in 
mothers. Hence it is possible that daughters of 
relatively stern mothers are more anti-Semitic 
not primarily because of the way in which they 
were disciplined for aggressive behavior, but be- 
cause in identifying with their mothers they have 
absorbed an authoritarian, anti-Semitic set of 
attitudes, 
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BEHAVIORAL VARIABILITY OF PROCESS AND REACTIVE 
SCHIZOPHRENICS IN A BINARY GUESSING TASK 


MARTIN ZLOTOWSKI: anD PAUL BAKAN 
Michigan State University 


Process and reactive schizophrenics differ in run producing behavior when 
attempting to generate a random binary series of heads and tails. Reactive 
schizophrenics manifest more response variability than process schizophrenics. 
Evidence for this is a lower correlation for reactives between number of runs 
in the 1st and 2nd % of a series of 300 binary responses. It is of interest to note 
that reactives are more variable than a normal undergraduate sample whereas 
process schizophrenics are not (Bakan, 1960). The process group produced 
significantly more runs than the reactive group. This difference appears due 
to a greater tendency for process schizophrenics to produce runs of 1 (simple 
alternations), a form of response stereotypy. 


* In recent years there, has been considerable 
Anterest in a differentiation of schizophrenia along 
a Process-reactive continuum. The typical process 
Schizophrenic is said to have an early and 
pou onset, apparently independent of pre- 
ene Stress. The psychosis appears to be a 
‘Culmination of a lifelong marginal adjustment in 


1 
Now at Boston University. 


which there has been a tendency to withdraw 
from human contacts. The typical reactive schizo- 
phrenic may be characterized as having an 
abrupt, stormy onset of psychosis following a 
severe stressful situation. The premorbid per- 
sonality is usually described as essentially normal 
or perhaps neurotic. 3 
Demonstration of behavioral or physiological 
differences between process and reactive schizo- 
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especially the massive social withdrawal of 
hospitalized chronic schizophrenic patients. 
Several investigators have attempted to modify 
the apparently stable, refractory adjustment of 
chronic schizophrenics. Of special interest is the 
work of Peters and Jenkins (1954) who reported 
significant improvement in the ward behavior 
of chronic schizophrenics to whom a guided 
problem solving technique was being adminis- 
tered. Apparently, as they achieved some mastery 
in the sphere of learning, their behavior in 
other spheres also changed. Similarly, Tilton 
(1956) was able to demonstrate improvement 
in general level of functioning after motor and 
verbal learning experiences. Spohn and Wolk 
(1957) found that withdrawn, deteriorated sub- 
jects improved the level of their social par- 
ticipation in problem solving groups when the 
members were heterogeneous in level of mental 
health. Each of these studies demonstrated 
experimentally that chronic schizophrenics re- 
main responsive to the external environment 
and are capable of benefiting from several types 
of sustained experience. 

The present study is especially concerned with 
the effects of certain types of social experience 
upon social withdrawal, Specifically, it investi- 
gates the hypothesis that extended experience 
in group problem solving situations would tend 
to improve levels of social participation in 
socially more complex and challenging situations. 
It was further hypothesized that improvement in 
social participation would be manifest in rela- 
tion to: individuals other than those with whom 
subjects would have extended social experience 
in the experimental situation, and it would be 
manifest in social situations other than the 
criterion group problem solving situations. 


METHOD 
Subjects 


Subjects were 32 hospitalized male chronic schizo- 
phrenic patients on closed wards, rated by ward 
psychologists using the Montrose Adjustment 
Rating Scale (Rackow, Napoli, & Klebanoff, 1953) 
as deteriorated and withdrawn. All had been hos- 
pitalized 2 years or longer. 


Procedure 


To all 32 subjects, working in four-man groups, 
eight criterion group problem solving tasks were 
administered. Each of these tasks required participa- 
tion by all for its solution. Groups were composed 
of subjects who had no prior social experience with 
one another. Four of the problem solving tasks re- 
quired little or no verbal communication. In one of 
these, for example, subjects were instructed to 
copy from a model a square figure. Each was to 
draw one side of the square using either a blue or 
a red pencil that each had been given but the 
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final product was to be all of one color, A second | 
set of tasks, administered in the same session and 
following the motor tasks required verbal com- 
munication—for example, subjects were to discus 
their preferences among desserts and to arrive at 
agreement on two desserts which all of them liked, 
The level of social participation in each of the 
tasks was rated by four experienced clinical psy- 
chologists observing subjects through a one-way 
vision screen and employing a social participation 
rating scale especially devised for the purpose of 
this research. The scale consists of six major rating 
categories: organizing (or leadership) behavior, 
mediating (or independent cooperative) behavior 
accommodating behavior (passive cooperation to 
the lead of another patient), blocking (passive 
noncooperation), withdrawing (physical withdrawal 
from the group—refuses to participate), and 
finally, interfering behavior (diverts, distracts others, 
or urges noncompliance). The categories had been 
rated on an 11-point scale by 12 staff psychologists 
according to the degree to which each reflected a 
contribution to the achievement of the group objec- 
tives in the type of group tasks employed in this 
experiment. The median rating for each category was 
assigned as its numerical weight. Ratings by the four 
raters used in the present research on a separate sam- 
ple of 16 subjects were correlated and yielded an 
interrater reliability coefficient of .88 (Pearsonian 1). 
A mean social participation rating in all of the 
eight criterion group tasks was obtained for each 
of the 32 subjects. Mean participation ratings wert 
at the level of passive noncooperation and vari- 
ability in the entire sample was low. 
An experimental and control group of 16 sub- 
jects each were then selected in such a fashion 
as to be equated with respect to mean and variandi 
of social participation rating scores in the criterion 
tasks, age, education, severity of illness, length of 
hospitalization, and level of social interaction 
the ward as rated by psychiatric aides? 
Subjects in the experimental groups then pate 
ticipated in three group problem solving sessions 
per week for 4 consecutive weeks. Working Wi 
the same four-man groups for the entire experien 
period, subjects were instructed at each session 
solve five group problems in succession. 
group tasks had been developed for this pupo 
The tasks required motor manipulation with 
or no verbal communication. For example; 9» 
series of tasks consisted of simple symmet? 1 
geometric figures that had been cut into fou 
isomorphic parts. Each subject in the four-mat 
group was given a part. A model of the ii 
was provided and subjects were instruct iil 
reconstitute the figure by putting their indiv 
pieces together. The level of social participation 
these sessions was high (characteristically DD |. 
form of “independent cooperation”) and $ 


developed 
nts. i 
efficients 
[22 


2A 20-item ward socialization scale | 
by the authors was used to rate all pati 
scale yields mean interrater reliability C0 
of .80 and split-half reliability coefficients 
rected for attenuation of .87. 
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“cantly greater than the level in the criterion tasks 
‘on first administration. 

"At the end of the experience period, level of 
“social participation in the criterion tasks for both 
experimental and control subjects (who had no 
jntervening social experience) was rated again. 
"Ratings were obtained in two  post-experimental 
sessions, first in a criterion tasks session in which 
“stranger” four-man groups (subjects who had 
- mever participated together) and then in a second 
"criterion tasks session in which “familiar” four-man 
“groups, composed of subjects who had worked 
‘together. Finally, psychiatric aides rated again all 
subjects for level of socialization on the ward. 


RESULTS 


Mean social participation ratings and standard 
deviations for control and experimental groups 
in the pre-experience criterion session and in the 
- two post-experience criterion sessions are shown 


in Table 1. 
An analysis of variance appropriate to re- 
peated measurements in two independent groups 
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“was carried out. 

The F ratio (3.51, df = 2) for the interaction 
of criterion sessions by treatment group is 
Significant between the .05 and .01 level of 
confidence, suggesting that level of social par- 
ticipation in the experimental group improved 
to a greater extent than that of the control 
group. 

_A further analysis by £ test, indicates that 
Improvement in level of social participation 
both In the stranger and in the familiar criterion 
Sessions is significantly greater in the experi- 
mental group, though differential improvement 
| between these two conditions in the experimental 
Stoup could not be demonstrated. 
, Mean gain scores in level of social par- 
1 ticipation (with scores in the two postcriterion 
Sessions combined) were calculated for the 
pore and verbal tasks separately and are 
3 p in Table 2. An analysis of differences by 
7? lest indicates that the experimental group 
improves to a Significantly greater extent than 

* control group in both the motor and verbal 

Stoup problem solving tasks. This finding mili- 


TABLE 1 


E AND STANDARD DEVIATIONS OF SOCIAL 
ARTICIPATION RATINGS IN THREE 
CRITERION SESSIONS 
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Familiar 
criterion 
session 


Stranger 
criterion 
session 


Precriterion 
Treatment session 


group 


: M  SD| m | sD} M | SD 
E; H 
| Cocimental 5.03 | .74 | 5.73 | 1.07 | 5.57 | .88 


ol 5.13 | .56 | 5.05 | 1.15 
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TABLE 2 


Mean GAIN Scores, STANDARD DEVIATIONS, AND 
CRITICAL Ratios OF SOCIAL PARTICIPATION 
RATINGS IN Motor AND VERBAL TASKS 


Motor tasks Verbal tasks 
Treatment group 
M SD t M SD t 
Experimental | .76] 1.49 |2.23* | .60 | 1.51 | 2.80* 
Control —.10| 1.53 —.54 | 1.62 


* Significant at .05 level. 


tates against the interpretation that it was 
primarily problem solving that was facilitated in 
the experience tasks, None of the latter required 
or elicited verbal interaction while a portion of 
the criterion tasks on which patients improved 
called for and elicited verbal exchange in the 
post-experimental sessions. Finally, an analysis 
of aide ratings of level of socialization on the 
ward before and after the experience period 
indicates that no significant changes could be 
demonstrated in the experimental group. 


DISCUSSION 


It will have been noted that the magnitude 
of improvement in level of social participation 
in the experimental group was small. On the 
average, patients changed from a pattern rang- 
ing from nonparticipation or total withdrawl to 
a pattern of passive cooperation, 

The results of this experiment do not suggest 
that the group problem solving techniques be 
employed as a form of group psychotherapy 
for inaccessible chronic schizophrenics. The 
results are regarded, however, as of theoretical 
interest. They support the view that situational 
determinants may modify social withdrawal in 
chronic schizophrenics. The social environment of 
the typical NP hospital ward does not naturally 
provide situations which facilitate social inter- 
action; our results suggest that perhaps they 
should, 

The present experiment, of course, throws no 
light on the psychological processes modifying 
the withdrawal tendencies of patients. The isola- 
tion of such processes and their precise relation 
to withdrawal patterns in chronic schizophrenics 
is the subject of further experiments now in 
progress. 
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EFFECT OF EFFORT ON BEHAVIORAL RIGIDITY IN A LUCHINS 
WATER JAR TASK 


KENNETH E. KNIGHT 


Carnegie Institute of Technology 


The experiment consisted of a paper-and-pencil problem solving task to test 
if the effort required to obtain 1 solution influences the S's ability to solve 
similar problems. The conclusions are that the greater the effort required to 
develop a solution procedure: (a) the slower the change to another more 
appropriate solution procedure, (b) the greater the propensity of change 
during extinction to an alternative decision procedure that is similar to the 
learned procedure, and (c) the less the propensity to recognize and use a 
trivial decision procedure that is also feasible. The results were significant at 
the .05 level. The Ss were 46 male undergraduate students. 


In a recent article, Festinger (1961) has 
argued that persistence in learned behavior is 
affected by the effort required during acquisition 
of that behavior. Specifically, he has suggested 
that the dissonance created by effort in learning 
is resolved or reduced by the development of 
extra preference for the learned behavior. The 
hypothesis provides at least a partial explana- 
tion for one conspicuous persistence phenomenon 
in human behavior, the Einstellung effect. Under 
the Festinger hypothesis, individual variations in 
problem solving rigidity should be partially re- 
lated to the effort expended in developing the 
strategy represented by the “set.” To test the 
proposition, we consider three specific predictions 
about the behavior of human subjects in a 
standard problem solving task: 

1, The greater the effort required to develop 
a solution procedure, the slower the change in 
that procedure when it is inappropriate. 

2. The greater the effort required to develop 
a solution procedure, the greater the propensity 
to change during extinction to an alternative 
decision procedure that is similar to the learned 
procedyre. 

3. The greater the effort required to develop a 
solution procedure, the less the propensity to 
recognize and use a trivial decision procedure 
that is also feasible. 
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METHOD 


The experimental task was a modification o 
the Luchins (1942) water jar test. For each e% 
perimental problem, the subject was presen 
with five hypothetical water jars. Two of the jas) 
were fixed, three were movable. Of the fixed ay 
one, “F,” was filled with water, the other, a 
was empty. The three movable jars were of WS 
different sizes and were labeled “L” (large); B 
(medium), and ^S" (small). The sizes of i 
various jars varied from one problem to B 
Subjects were told to obtain a given amoum " 
water in one of the fixed jars. In accomp, 
this, subjects were allowed to use only full L 
and S jars and no more than three of any 
size on any given trial. 


ont 


as quickly as possible with a tim 
minutes on Trial 1 (a training trial) and 2 1 
on each succeeding trial. The problems BW? 
Trials 1-21 are shown in Table 1. For € he 
a solution procedure is also indicated. Dome 
21 trials three solution procedures were d 

Procedure I—take L—M+3S from F am 
into E; Procedure II—Take L+S from P 
place into E; Procedure III—Subtract S fm read 
Procedure IV—Do nothing, the solution 5 © 
in F. D 

As is indicated in Table 1, only Procediti i " 

feasible on Trials 1-7. Procedures I, Ih fg 
were feasible on Trials 8-11. Procedures ©" 
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P TABLE 1 
EXPERIMENTAL PROBLEMS AHD FEASIBLE SOLUTIONS 


Problem 
Trial Size of water jars Target Feasible procedures* 
amount of 
F E L M s water 
1 
Low 1000 0 300 10 1 293 T 
High 1000 0 371 247 125 499 I 

2 1000 0 200 20 1 183 I 

3 1000 0 400 180 151 673 I 

4 1000 0 730 220 120 870 I 

5 1000 0 620 433 261 970 I 

6 1000 0 570 400 150 620 I 

7 1000 0 930 650 220 940 T 

8 1000 0 580 420 210 790 I, I, III 

9 1000 0 680 320 160 840 I, II, III 
10 1000 0 760 240 120 880 I, II, III 
11 1000 0 350 335 325 675 I, II, III 
12 1000 0 480 390 260 740 II, IIT 
13 1000 0 640 358 180 820 TETTE 
14 1000 0 820 340 90 910 II, III 
15 1000 0 530 490 235 765 II, III 
16 1000 0 870 532 130 870 TII 
17 1000 0 720 340 185 815 III 
18 1000 0 880 620 450 550 p 
19 1000 0 680 320 160 840 I, II, III 
20 1000 0 730 270 135 865 I, II, III 
21 1000 0 680 640 320 1000 I, II, IV 


Were feasible on Trials 12-15. Only Procedure III 
Was feasible on Trials 16-18. Procedures I, II, and 
MI were feasible on Trials 19-20. Procedures I, II, 
and IV were feasible on Trial 21. We assume 
po I to be more similar to Procedure II 
han to Procedure III. The procedures were not 
Bien to the subjects but had to be developed by 
them in the course of solving problems. 
Ee were assigned randomly to two treat- 
i i he treatments differed only in the problem 
Ei to the subjects in the first (training) 
Eu: n the Low Effort treatment, the problem (as 
Indicated in Table 1) was a simple one and could 
Bo with little or no effort. In the High 
E. reatment, the problem was more difficult 
m pred considerable effort. In both treat- 
bu. x Solution procedure was the same. 
Carnegi, x hih 46 male undergraduate students at 
‘administ nstitute of Technology. The task was 
Be ered in the form of a paper-and-pencil 
9 groups of subjects in classroom situations 
» p early as possible) equal numbers of 
m MN e condition in each class. Twenty- 
2 blest, S were given the Low Effort treatment; 
ich "d Were given the High Effort treatment. 
i Subject recorded both his own solution to 
Problem and the time taken to reach a 


- soluti 
depe; on. The test was proctored to insure in- 
ndence of response, 


m 


RESULTS 


E Predictions were made from the 
anger hypothesis with respect to differences 


* Procedure I—L — M + 3S; Procedure II—L +S; Procedure III—F — S; Procedure IV—F. 


between behavior of subjects in the Low Effort 
and High Effort treatments. 


Prediction 1 


Subjects who acquired Procedure I under the 
Low Effort treatment will change to a new 
procedure sooner than will subjects who 
acquired Procedure I under the High Effort 
treatment. 

We test the prediction by ranking all sub- 
jects in terms of the order in which change 
from Procedure I occurred. As indicated above, 
Procedure I is required in Trials 1-7, feasible 
on Trials 8-11, and not feasible on 12-18. Under 
the null hypothesis of no difference between 
the two treatments, the mean ranking should 
be the same (23.5). In fact, the mean rank 
for the Low Effort treatment is 18.0 and the 
mean rank for the High Effort treatment is 
29.0. This difference is in the predicted direction 
and significant at the .05 level by the Wilcoxon 
two-sample test. 


Prediction 2 


Subjects who acquired Procedure I “under 
the Low Effort treatment will be more likely 
to use Procedure III on Trial 15 than will 
subjects who acquired Procedure I under the 
High Effort treatment. 
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Of the 24 subjects in the Low Effort treat- 
ment who solved the problem on Trial 15, 4 
used Procedure III. Of the 22 subjects in the 
High Effort treatment who solved the problem 
on Trial 15, none used Procedure III. This dif- 
ference is in the predicted direction and signifi- 
cant at the .01 level by the ¢ test. 


Prediction 3 


Subjects who acquired Procedure I under the 
Low Effort treatment will be more likely to 
use Procedure IV on Trial 21 than will subjects 
who acquired Procedure I under the High 
Effort treatment. 

All subjects solved the problem on Trial 21. 
Of the 24 subjects in the Low Effort treatment, 
18 used Procedure IV. Of the 22 subjects in the 
High Effort treatment, 7 used Procedure IV. 
The results are in the predicted direction and 
significant at the .05 level by the ¢ test. 


Discussion 


The results uniformly support the Festinger 
hypothesis that effort required during the 
acquisition of behavior is positively related to 
persistence in that behavior. Compared to sub- 
jects who exerted little effort in finding a solution 
procedure, subjects who discovered the same 
procedure in the course of solving a difficult 
problem were slower to change that procedure 
to alternate feasible procedures, when forced to 
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BIRTH ORDER AND REACTIONS TO FRUSTRATION * 


DAVID C. GLASS, MURRAY HORWITZ, IRA FIRESTONE, anp JOEL GRINKER 
Research Center for Human Relations, New York University 


A reanalysis of an experiment on the ‘relation between frustration and ag- 
gression, employing 108 male undergraduates from New York University. 
The findings indicate that later born Ss react with greater annoyance to à 
frustrating agent than Ist born Ss. These results appear to contradict those 
reported in a previous study which found that 1st born males from Yale p 
University react more strongly than later born males to frustrating acts 
by others. This discrepancy between New York and Yale college students 
in the domain of hostile behavior appears to parallel a similar inconsistency 
in the domain of affiliative behavior as reported in 2 recent studies. We 
suggest the consideration of sociocultural factors in any effort to interpret 
these differences, since we regard birth order, like birthplace, etc., as an 
“ecological” variable rather than a psychological one. 


This paper presents some findings on the 
effectse of birth order which apparently con- 
tradict those reported in a recent study by 
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change were more likely to change to a po! 
cedure similar to the original one, and when 
confronted with a problem for which there wa 
a trivial solution procedure were more likely 
to use the original or similar procedure rather 
than the trivial procedure. ; 

The results suggest that the Festinger hy- 
pothesis may have some significant implications | 
for the study of behavioral rigidity. For example, 
the hypothesis provided a partial explanation 
both for the persistent low correlations found] 
between performance on water jar tests and} 
other measures of personal rigidity and for the 
apparent modest negative relationship between 
intelligence and problem solving rigidity in a 
water jar situation (Levitt, 1956). Where all 
subjects are presented with the same training 
problem, the amount of effort expended in 
acquisition of the set response is probably more | 
related to intelligence than to other generdl 
individual attributes. l 
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Dittes (1961). Dittes found that first bot 


persons react more strongly than latet 
persons to what would appear to be frustrate: 
acts by others. A reverse relationship was p 
in the present study. 

The data reported here weze obtained ff 
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a reanalysis of an experiment on the relation 
between frustration and aggression (Glass, 
Horwitz, Cohn, & Giniger, 1961). Groups of sub- 
jects were repeatedly frustrated by an accomplice 
of the experimenter without being informed as to 
the reasons for the frustrating acts. Under these 
conditions, subjects attributed to the frustrating 
agent an obligation to inform them of his rea- 
sons. In one condition (Nonopenness), the frus- 
trating agent stated he opposed revealing his rea- 
sons; in another condition (Openness), he stated 
that he favored revealing his reasons. The data 
showed that subjects expressed greater hostility 
to the frustrating agent in the Nonopenness 
condition than in the Openness condition. 

Table 1 presents the subjects’ responses, 
broken down according to whether the subjects 
were first born children (including only children) 
or were later born, Row A shows mean responses 
to an item asking how much annoyance sub- 
jects felt toward the frustrating agent. The 
data indicate that the increase in annoyance 

| from Openness to Nonopenness conditions tends 
to be greater for later born subjects than for 
| first. born subjects. The interaction F approaches 
“Statistical significance (p < .10). Subjects were 
also asked to rate their impressions of the 
frustrating agent on a list of favorable and 
unfavorable adjectives. The interaction effect of 
| birth order and openness, although in the same 
direction as that found for subjects’ self-ratings 
of annoyance, does not approach statistical 
Significance, 

n an attempt to determine the basis for the 
differential reactions of annoyance to the frus- 
trating agent, we considered whether first borns 
and later borns had different perceptions of how 
the frustrating agent had acted. Subjects were 
asked to respond to two questions: to what 
extent the frustrating agent appeared to want 

E reveal or not reveal the reasons for his frus- 
E actions; and to what extent he expressed 
E as wanting to reveal or not reveal his 
P" Indices of the combined responses to 
“se questions are given in Table 1, Row B. 
E indicate that the interaction ob- 
cnl pu annoyance is paralleled by differential 
P of the ) frustrating agent’s Donc 
E The relative increase in perceived 
AR oS of the frustrating agent was greater 
inter a to Nonopenness conditions for 
s Ins than for first borns. The interaction 
Eus Ec at beyond the .05 level. These 
m Uggest that the differences in degree 
Syance between later borns and first 


born 3 X 
-— 75 are due to differences in how they perceive 
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TABLE 1 
MEAN RESPONSES TO FRUSTRATION 
Openness Nonopenness 
First Later First Later 
born born born born. 
(N =40) | (N =18) | (N 535) | (N =15) 
A. Self-ratings of | 7.25 717 | 6.23 4,805 
annoyance* 
B. Perceptions of | 14.76 | 16.28 | 5.23 | 4.004 
nonopenness* 


* The lower the score, the more the annoyance. 

t Open versus Nonopen, F = 14.54, p < .005; First born 
versus Later born, F — 1.62, ms; Interaction, F = 2.80, 
p <.10. Within Openness: First born versus Later born, 
t —.27, ns; Within Nonopenness: First born versus Later 
born, / = 2.41, p < .05. 

* The lower the score, the greater the degree of perceived 
nonopenness. 

4Open versus Nonopen, F = 371.47, p < .001; First born 
versus Later born, F = 23, ns; Interaction, F = 5.54, p < .05. 
Within Openness: First born versus Later born, / = 1.90, 
? SEU Within Nonopenness: First born versus Later born, 

= 1.34, ns. 


the same objectively nonopen act. For some 
undetermined reason, later born subjects in- 
terpret the same act as signifying greater 
motivation on the part of the frustrating agent 
to be nonopen than do first borns—which cor- 
responds with their differential annoyance. 


Discussion 


A number of recent studies of birth order 
have investigated the effects of threat of shock 
upon affiliative behavior (Gerard & Rabbie, 
1961; Sarnoff & Zimbardo, 1961; Schachter, 
1959). Schachter reports that when subjects 
were threatened with electric shock, first born 
females chose to be with others more often 
than later born females. Sarnoff and Zimbardo 
report parallel findings for male subjects. The 
latter results are at odds, however, with the 
findings of Gerard and Rabbie. These authors 
found that threat of shock induced stronger 
affiliative tendencies among first born females 
than among later born females, but that the 
relationship was reversed for males. In the 
Sarnoff and Zimbardo experiment, subjects were 
undergraduates at Yale University; in the Gerard 
and Rabbie experiment, subjects were from 
Brooklyn College. We suggest that one might 
seek to reconcile the discrepant findings by 
considering possible sociocultural differences. 
This suggestion is reinforced by the apparent 
discrepancy between the results of the present 
study performed with New York University 
students and Dittes’ (1961) study performed 
with Yale University students. 


2No reliable birth order effects appeared with 
regard to other perceptions of the frustrating agent 
apart from his openness. 
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In the present study, a frustrating agent 
withheld information which the subjects felt he 
was obligated to divulge. Under these circum- 
stances, later born subjects expressed greater 
annoyance than first born subjects, In Dittes’ 
study, subjects were informed that they were 
negatively evaluated by a group of peers. First 
born subjects were then found to express greater 
negative evaluations of the group than later 
borns. The two studies differ in experimental 
manipulations and measures.* However, both may 
be viewed as concerned with the effects of ordinal 
position on the expression of hostility toward 
frustrating agents. On this interpretation, there 
is an apparent inconsistency between New York 
and Yale college students in the domain of 
hostile behavior, which seems to parallel the 
inconsistency found in the domain of affiliative 
behavior (Gerard & Rabbie, 1961; Sarnoff & 
Zimbardo, 1961). With respect to experimental 
manipulations which produce either affiliation 
or hostility, first born males from New York 
colleges react less strongly than later borns, 
while the reverse is true for Vale students. 

We suggest the consideration of sociocultural 
factors in any effort to interpret these differences, 
since we do not regard birth order as a psy- 
chological variable. First and later born children 
may be subject to differential child rearing pat- 
terns among different socioeconomic groups. 
Moreover, the current social situations facing 


3 In a personal communication Dittes proposes that 
the interpretation of the discrepant results might be 
sought in possible differences in motives that were 
Írustrated. He suggests that the dependency motive 
was frustrated in his study but not in ours, 
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A METHODOLOGICAL NOTE ON THE CLINE AND RICHARDS' 
STUDIES ON ACCURACY OF INTERPERSONAL PERCEPTION 


WILLIAM F. O'CONNOR 
Civil Aeromedical Research Institute, Oklahoma City 


A series of studies reported by Cline and 
Richards (1960, 1961) offer evidence for the 
existence of a global ability to judge others. The 
authors (1961) note that their findings are at 
variance with previous research which had failed 
to find any generality or global ability to judge. 

Cline and Richards in these studies used 
sound color films of interviewees to'serve as 
objects to be rated and judged. From an original 
pool of 20 technically acceptable films (each 
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first born or later born college students may | 
differ under different sets of socioeconomic con. | 
ditions. We suggest that birth order, like birth- 
place, income, or education, is an “ecological! 
variable rather than a psychological one, As 
such, birth order would not be expected to 
produce consistent psychological effects among 
samples of subjects which differ in other popula. | 
tion characteristics. As with any ecological 
variable, birth order may induce similar psy. | 
chological experiences—and therefore similar | 
psychological effects—within the same sociological 
grouping. However, it would be surprising to 
find that birth order, any more than income, 
etc., induced similar psychological experiences 
among different sociological groupings. It would 
therefore be equally surprising if birth ord] 
showed uniform effects among persons from | 
different sociological backgrounds. 
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film representing an interviewee), the number af 
films was reduced to 10 (1960) and then Wa 
to 6 films (1961). The technique for reducing 
the number of films and its rationale is uo 


as follows: 


a 
Selection was made on the basis of the extent t0, whist) 
films discriminated good judges from poor j 
and thus closely approximates standard item 4 
Selection on this basis does, of course, t0 ? f 


4 


extent "stack the cards” in favor of a general trait. 
This criticism can be made, however, of any internal 
consistency item analysis (1960, p. 1). 


The authors, though recognizing the introduc- 
tion of bias, employ a faulty logic in its defense. 
What the authors choose to regard as a “bias” 
in the standard use of internal consistency item 
analysis is the very purpose of the method, 
je, that through maximization of item-total 
covariance values a homogeneous scale of items 
“may be had. As the aim of their studies was to 
se ifa general ability to judge did in fact exist, 
‘the use of a technique, the aim and effect of 
which is to build a general factor, becomes a 
critical error in method. 

A similar logic could be employed to support 
a directly opposite approach. The use of item 
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In à recent comment on Cline and Richards' 
Studies Of person perception O'Connor (1963) 
‘has identified a point which was inadequately 
discussed in the original paper on accuracy of 
Interpersonal perception (Cline & Richards, 
1960) published in this journal. 

Previous investigators had found little or no 
generality of accuracy of interpersonal percep- 
tion, and from these earlier results, some had 
drawn the conclusion that it is impossible to 
pu & homogeneous scale of accuracy of 
M coul perception, The authors, however, 

* convinced that the failure to find a homo- 
a Scale was a result of the methodology 
a than being inherent in the subject 
after i In ss particular, previous investigators, 
ee h taining an initial sample of persons to 
Nen used this sample without further 
BU. n. However, it is well known that even 

Jaracteristics where it is well established 
d bs Possible to construct a homogeneous 
Numerical example, vocabulary knowledge or 
nitran facility), such a scale cannot be 
“sary to E from unselected items. It is neces- 
Bom, O an item analysis in order to define 

Ogeneous scale, Since no such item selec- 
Es, ine by previous investigators of ac- 
interpersonal perception, the authors 


=> c 


———— UT 
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analysis procedures which minimize item-total 
covariance values would admittedly introduce 
a bias in favor of finding specificity in judging 
ability. The use of Cline and Richards’ logic, 
however, would permit justification here on the 
basis that the bias so introduced can be said 
to exist in any item analysis where the aim is 
to develop independent scales. 
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CLINE AND RICHARDS' REPLY TO O'CONNOR'S 
METHODOLOGICAL NOTE 


VICTOR B. CLINE anp JAMES M. RICHARDS, Jr. 
University of Utah 


felt it was hardly surprising that no homogeneous 
scale was obtained. 

The authors therefore used a different procedure 
which involved sound-color movies of inter- 
viewees as persons to be judged. From an initial 
pool of 20 films, each film representing one 
interviewee, the number of films used was re- 
duced first to 10 and then to 6 by a procedure 
approximating item analysis, in which each 
interview film was treated as an item, Spe- 
cifically, the criterion used was the degree to 
which films discriminated good judges from 
poor as defined by a total accuracy of judgment 
score, or, in other words, a criterion comparable 
to an item-test correlation. Results of this in- 
dicate that when such a selection procedure is 
used, a relatively homogeneous scale of ac- 
curacy of interpersonal perception does appear. 

The authors are of the opinion that these 
results indicate that a homogeneous scale of ac- 
curacy of interpersonal perception will be found 
if a methodology appropriate to the definition 
of a homogeneous scale is used. O'Connor is of 
course correct that items could be selected to 
define a heterogeneous scale. The authors do not 
feel that this possibility invalidates either their 
procedure or their conclusions, since, as 
O’Connor points out, such a procedure could 
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also produce a heterogeneous scale of vocabulary 
knowledge. 

The authors suspect that the underlying diffi- 
culty is that their procedure is necessary but not 
sufficient for the definition of a meaningful 
homogeneous scale. In other words, the authors 
succeeded in establishing a reliable scale, but 
it is still to be demonstrated that their instru- 
ment has “construct validity” (American Psy- 
chological Association, American Educational 
Research Association, & National Council on 
Measurements Used in Education, 1954). In- 
vestigations relevant to this point are currently 
in progress both by the authors and by other 
investigators using their films and judging 
instruments. 
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INSTITUTIONALIZATION AND THE EFFECTIVENESS OF 


SOCIAL REINFORCEMENT: 
A THREE-YEAR FOLLOW-UP STUDY: 


EDWARD ZIGLER ax» JOANNA WILLIAMS 
Yale University 


49 familial retarded children who had been tested on a simple 2-part satiation 
game 3 years earlier were retested on the same game. The children were 
divided into high and low preinstitutional social deprivation groups, and 2 
reinforcement conditions were employed. On the original testing a positive 
relationship was found between social deprivation experienced and the effec- 
tiveness of social reinforcement (p<.05), with no significant reinforcement 


Zigler (1958, 1961b) examined the rela- 
tionship between the amount of preinstitu- 
tional social deprivation experienced by a 
group of familial retardates and their socially 
reinforced performance on a simple, open- 
ended, two-part satiation task. The more 
Socialy deprived children played the game 
longer, more frequently made the. maximum 
number of responses allowed, showed a 
greater increase in time spent on Part II over 
‘that spent on Part I, and tended to make 
fewer errors than the less socially deprived 
Children. These findings were interpreted as 
Supporting the view that the perseveration 
‘observed in institutionalized retardates is due 
to the enhanced effectiveness of social rein- 
forcers (Stevenson & Zigler, 1957; Zigler, 
Hodgden, & Stevenson, 1958) rather than to 
‘An inherent cognitive rigidity (Kounin, 1941; 
Lewin, 1936). 


, This research was supported by Research Grant 
Eu from the National Institute of Mental 
“alth, United States Public Health Service. 
Bu Study was made possible through the co- 
? Bu ie of Malcolm J. Farrell, Superintendent, and 
FER of the Walter E. Fernald State School at 
Cathe 6 Massachusetts. Thanks are extended to 
n lou E. Chipman, principal psychologist, and 
Consclen, „H. Davidson, staff psychologist, for their 
cial uw work in rating the subjects on the 
3 eprivation scale. The authors would like to 


effects being found. On retesting, significant reinforcement condition effects 
were found (p<.05), but no significant preinstitutional deprivation effects. 
‘An examination of the differences in the children’s performance in the 2 testings 
revealed a significantly greater enhancement 
reinforcement for the low than for the high socially deprived Ss (p <.01). 


in the effectiveness of social 


No relationship was found by Zigler be- 
tween the major measure of social reinforcer 
effectiveness, i.e., amount of time spent on the 
game, and length of institutionalization, per- 
haps the most often used measure of social 
deprivation. It is possible that a relationship 
between these two variables was obscured by 
the effects of a third variable, MA. Since MA 
was found to be negatively related to the ef- 
fectiveness of the social reinforcers dispensed, 
while being positively related to length of 
institutionalization, a general enhancement in 
the effectiveness of social reinforcers with 
longer institutionalization may have been 
concealed. 

Although the notion that institutionaliza- 
tion results in an enhanced effectiveness of 
social reinforcers is a common one, there are 
alternative views as well. The argument can 
be made that since institutional living pro- 
vides a great number of social contacts, the 
effectiveness of social reinforcers would be de- 
creased. This prediction can also be generated 
from a quite different view of the nature of 
institutionalization. For instance, Gewirtz 
(1957) has noted the possibility that institu- 
tionalization represents a condition in which 
social reinforcers are not paired with the 
child's responses sufficiently often to reinforce, 
i.e. condition, those responses. The result 
would thus be a passivity or lack of respon- 
sivity in the child. ° 

A study by Clarke and Clarke (1954) 
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suggests that the effects of institutionalization 
may be even more complex. They discovered 
that the greatest increase in mental test scores 
following institutionalization occurred in those 
retarded children who came from the worst 
homes. To the extent that the effects of the 
tester-testee interaction are reflected in the 
test scores, this finding supports the argument 
(Zigler, 1958, 1961a) that institutionalization 
constitutes extreme social privation only for 
children with relatively good preinstitutional 
histories. For such children prolonged institu- 
tionalization would result in the enhanced ef- 
fectiveness of social reinforcers. On the other 
hand institutionalization may ameliorate the 
effects of extreme preinstituitonal social de- 
privation, resulting in the decreased effective- 
ness of social reinforcers. 

The present study is a longitudinal investi- 
gation of the proposition that some systematic 
change in the effectiveness of social reinforce- 
ment occurs with longer institutionalization. 
All of the available subjects used by Zigler 
(1961b) were retested on the same two-part 
satiation task. They received the same sched- 
ule of social reinforcement they had received 
during original testing, approximately 3 years 
earlier. 

METHOD 
Subjects 


Forty-nine of the 60 children tested by Zigler 
(1961b) were available for retesting. All subjects 
were diagnosed as familial retardates and were free 
of any gross motor or sensory disturbances, (See the 
earlier article for a complete description of the 
sample and the selection procedure.) The character- 
istics at the time of original testing of those 49 
subjects and the 11 children who were no longer 
available are presented in Table 1. 

As can be seen, the two groups are quite compa- 
rable on all variables examined. Furthermore, the 
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two groups performed quite similarly in terms of the 
major dependent variable, the length of time spent 
on the game. 


Experimenter 


The experimenter at the time of the original test- 
ing was an advanced male graduate student in psy. 
chology with considerable experience in testing sub- 
jects in experimental problems and working with 
children in nursery and clinical settings. The experi- 
menter in the present study was an advanced female 
graduate student in psychology with approximately 
the same amount of experience with children. Data 
have been presented (Gewirtz & Baer, 1958) which | 
indicate that the effectiveness of social reinforcement 
is affected by the relationship of the sex of the 
experimenter to that of the subject. Therefore a Sex 
of Experimenter X Sex of Subject analysis on the 
total time score was done, both on the original test | 
data and the retest data. No significant sex inter- 
action effects were found in either set of data. To 
date, a number of similar studies done by both male 
and female experimenters (Green & Zigler, 196; 
Shallenberger & Zigler, 1961; Zigler et al, 1958) 
have resulted in consistent findings. 


Experimental Game (Marble-in-the-Hole) 


The game was very simple, repetitive, and 
monotonous. It was constructed to reduce relatively 
quickly the subject’s desire merely to “play the 
game,” thus allowing the assumed difference in 
motivation for interaction with the experimenter to 
become the dominant variable in determining pêr- 
formance. The subject’s task was to insert a marble 
of one color into one hole and a marble of another 
color into a second hole of an enclosed box. The 
apparatus included: a green box 9 X 6 X 6 inches 
with two small square holes along the middle of the 
lid of the box and separated by 2 inches; 400 
marbles, 200 blue and 200 orange; a red open con- 
tainer 3 X 4 X 4 inches to hold the marbles. 

Part I and Part II differed only in that the 
appropriate holes for the colors were reversed. The 
subject played the game until he indicated that he 
wished to stop either by telling the experimentet 
that he wanted to stop or by not inserting a marble 
for 30 seconds. The experimenter presented Part II 


TABLE 1 


COMPARISON AT THE TIME OF ORIGINAL TESTING OF RETESTED CHILDREN WITH TuosE No 
LONGER AVAILABLE 


Mean number Mean Social 


Group N [Mean CA* Mean MAs| Mean IO upon| of months | Mean CAs | Deprivation 
admission» x institu FETA score 
Retested in present stud 49 1 
Not included in present $ fa ut E id li p. 
study Ho| 13 | 62 564 21.0 9.5 3 


A. 


^ In years. 
» 1937 revision of the Stanford-Binet. 
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TABLE 2 


ul 
| h COMPOSITION OF THE HIGH AND Low SOCIALLY DEPRIVED GROUPS AT RETESTING 


Mean number of Mec AiD 


' Group N Mean CA* Mean MA* Mean IQ» months institu- : 
tionalized admision 

High 26 13.0 TA 55.3 55.2 8.4 

Tow 23 13.8 68 50.0 59.7 8.9 


as “another game,” and the subject played again 
- until satiated. 


Procedure 


Performance variable. The subjects were tested 
individually, and Zigler’s procedure was followed 
closely. Each subject played one part of the game 
under a Nonsupport (NS) condition and the other 
part under a Support (S) condition. In the NS con- 

| dition the instructions were given, and the experi- 
menter did not interact further with the subject 

l during the game. In the S condition the instructions 

l were given, and the experimenter continued after this 

to interact with the subject. This interaction included 
nonverbal support such as smiles and nods, and 
verbal reinforcement in the form of supportive com- 
ments about the subject’s performance, one after 
every tenth marble was inserted. (See Zigler, 1961b, 
for detailed description of procedure.) 

Each subject received the same reinforcement se- 

_ quence, support to nonsupport (S-NS) or nonsup- 

L Port to support (NS-S) as he had received during 
original testing. The experimenter recorded the 
number of responses and the total time spent on 
tach part of the game. 

Social deprivation variable. The social deprivation 
pis employed were computed at the time of 
ha testing. The score for each subject was the 
di id of ratings made by two experienced psy- 
m on the basis of the subject's social history. 
puse range of scores was from, 1 (extreme 
a es protected”) to 60 (extreme of “very 

i Be ). At the time of original testing, the 
the ar the deprivation scores was used to divide 
) ao ay jects into “high social deprivation” (HD) and 
c B deprivation? (LD) groups. Of the 49 
Ns PE ested; 26 were in the HD and 23 were 
Tem group. The composition of these groups 
ime of retesting is presented in Table 2. 


RESULTS 


ne time spent on the game by the 49 
Eur 5, divided into deprivation groups and 
rn Cement conditions, during original 

M and retesting is shown in Figure 1. 
o were conducted on the subjects’ 
and o ance on original testing, on retesting, 

n the difference in the subjects’ perform- 


H 


s [n years. 
| b 1937 revision of the Stanford-Binet given during the year preceding retesting. 
F 
! 


ance between the two testings. In all Rein- 
forcement Condition X Deprivation Group 
analyses, three subjects in the NS-S, HD 
group, one subject in the S-NS, HD group, 
and one subject in the S-NS, LD group were 
randomly dropped so that there would be an 
equal number of subjects in each cell. 


Original Testing 


To determine whether the performance of 
the 49 subjects on the first testing was compa- 
rable to that of the original sample of 60 
subjects, a Deprviation Group X Reinforce- 
ment Condition analysis was done for these 
49 subjects on time spent on Part I, on Part 
II, increase in time from Part I to Part II, 
and total time during original testing. The 
HD group spent significantly more time on 
the total game (F = 6.17, df= 1/40, p< 
.025) and on Part II of the game (F = 1.16, 
p < 025) than did the LD group. Although 
not significant, there was a tendency (F = 
3.40, p < 10) for HD subjects to increase 
more from Part I to Part II than LD sub- 
jects. Further analyses revealed that there 
was a reliable increase in time from Part I 
to Part II in both the NS-S (7 — 3.62, p< 
001) and the S-NS conditions (t= 2:07, 
p < 05). The effect of reinforcement condi- 
tions was not significant, nor was the inter- 
action between deprivation group and rein- 
forcement condition, Thus the performance 
of the 49 subjects was quite similar to that 
reported earlier for the 60 subjects. 


Retesting 


The S-NS group spent significantly more 
time on the total game (F = 4.08, p= 05) 
and on Part I alone (F = 7.18, P< 025) 
than did the NS-S group. The decfease in 
time spent from Part I to Part II of the game 
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Fic. 1. Time spent on the game by each group during original and retesting. 


was significantly greater (F = 5.83, p < .025) for the S-NS group (£— 3.04, p< .001), 
for the S-NS than for the NS-S group. Fur- but not for the NS-S group (f= 1.82, 
ther analyses revealed that there was a reli- p< -10). Since during retesting so many sub- 
able decrease in time from Part I to Part II jects in the S-NS group played both parts of 


TABLE 3 
PERFORMANCE OF THE Two Deprivation GROUPS ON THE EXPERIMENTAL TASK 
S-NS condition NS-S condition 
M ber of 
Parei | Pati Number of pi 
mean | mean | notat AEN EC tte 
times times times — Toth parts | times times time* | both parts 
eae testing 
High deprivation |11| 531 696 | 1227 2 399 613 1012 2 
Low deprivation |11| ^ 392 494 | 886 3 192 280 472 1g 
ny 
igh deprivation |11| — 730 542 | 1272 6 5 56 9 
Low deprivation |11| 968 661 | 1629 8 370 $10 1080 8 
Difference between 
eee 
igh deprivation |11 199 —=155, H 4 S —55 A 
Low deprivation |11| 576 ` 167 743 5 a0 230 609 1 


a [n seconds. 


the game to 
the decrease from Part I to Part II reflects 


nothing more than improved motor skill dur- 
ing Part II of the game. No significant de- 
privation effect was found, nor was there a 
‘significant interaction between reinforcement 
condition and deprivation group. Thus 
"whereas during original testing all significant 
effects were related to deprivation groups, 
"during retesting all significant effects were 
related to reinforcement condition. 


Difference between the Two Testings 


Difference scores on Part I, Part II, and 
total time spent on the game were computed 
‘for each subject by subtracting the time spent 

‘on the original testing from that spent on the 
petis. These difference scores for the HD 

"and LD groups are presented in Figure 2. 

A Reinforcement Condition X Deprivation 
Group analysis of variance was run on each 
of these difference scores. The difference 
scores of the LD group were significantly 
: greater for total time (F = 9.10, p < .005), 
— Part I alone (F = 5.83, p < .025), and Part 
"Il alone (F = 8.65, p < .001) than those of 
the HD group. 
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completion, it is possible that . 


Fic. 2. Time spent on the game during retesting minus tim 
during original testing by the HD and LD groups. 
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Further analysis showed that for the LD 
group there was a reliable increase in time 
from the original testing to the retesting for 
Part I (£ — 4.81, p < 001), and total time 
(£— 3.89, p < 001), but not for Part II 
(t = 2.00, p < .10). For the HD group, the 
increase was reliable for Part I (¢ = 2.15, 
p < 05), but the decrease was not reliable 
for Part II (£— 1.15) nor for total time 
(t « 1). 

These difference scores were unrelated to 
the reinforcement conditions, and there were 
no interaction effects between deprivation 
group and reinforcement condition. 

The change in performance during the two 
testings was examined further by making a 
comparison of the number of subjects in each 
deprivation group who inserted all 800 
marbles. In the LD group 4 subjects made 
the maximum number of responses allowed 
during the first testing, while 16 subjects did 
so during retesting. A test for the comparison 
of percentages in matched samples (Cochran, 
1950) revealed that this difference was sig- 
nificant (x? = 9.3, $ < .01). Comparable fig- 
ures for the HD group were 4 during original 
testing and 12 during retesting, a difference 


soo 
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TABLE 4 
ANALYSIS OF VARIANCE SUMMARIES FOR TIME SCORES 


Source MS df| F 
Original testing 
Part I Deprivation (D) 330,777) 1 | 3.39 
Reinforcement (R)| 302,949] 1 | 3.11 
DXR 12,957] 1 
Within 97,471 | 40 
Part II D 787,922) 1]|7.16* 
R 244,211) 1 | 2.22 
DXR 46,996) 1 
Within 110,068 | 40 
Total time D 2,140,615| 1 |6.17* 
R 1,091,790} 1 | 3.14 
DXR 109,102) 1 
Within 347,179 | 40 
Retesting 
Part I D 257,193} 1 | 1.72 
R 1,073,906] 1 | 7.18* 
DXR 79,506) 1 
Within 149,581 | 40 
Part II D 84,394| 1 
R 156,605| 11.86 
DXR 11,360| 1 
Within 84,120| 40 
Total Time D 636,243| 1 
R 2,050,705| 1 | 4.08* 
DXR 150,696) 1 
Within 503,048 | 40 
Difference between 
two testings 
Part I D 1,171,319] 1 | 5.83* 
R 236,085) 1 
DXR 28,154) 1 
Within 200,977 | 40 
Part II D 1,388,049| 1 | 8.65** 
R 9,691) 1 
DXR 12,146] 1 
Within 160,489 | 40 
Total time D 5,110,909] 1 |9.10** 
R 149,878] 1 
DXR 3,352] 1 
Within 561,665 | 40 
*p «.05 
wD <.01 


which is also significant (x? = 8.00, p < .01). 

These data indicate that while both the 
HD and LD groups increased during the 
second testing, this increase was more pro- 
nounced for the LD than the HD group. 
Since there was an equal, though small, num- 
ber of subjects in each of these groups who 
played the game to completion during original 
testing, the greater increase by the LD group 
during retesting cannot be attributed to an 
initial differerence in the number playing the 
game to completion. 


Other Findings 


At the time of original testing, an r of 
— 459 (p < .01) was found between MA and 
the time spent on the game for the 49 re- 
tested subjects. This r did not differ signifi- 


- cantly from the r of —.369 (p < .02) found 


for all 60 subjects (z < 1). During retesting, 
the r between MA and total time had dropped 
to —.240 and was no longer significant (p 
> .10). This drop from —.459 to —.240 was 
not significant (z — 1.02). Analyses were also 
run to examine the effect of MA during re- 
testing on time spent on Part I, Part IT, de- 
crease from Part I to Part II, and the rela- 
tionship of MA to the increase from the first 
to second testing. In no instance was a sig- 
nificant MA effect found, nor were there any 
significant interaction effects involving MA. 
The subjects’ present IQs were compared to 
those obtained at the time of institutionaliza- 
tion. The IQs of 4 subjects went up, 6 re- 
mained the same, and 39 dropped between 
the time they entered the institution and the 
present study. The mean decrease of 5.86 
points was reliable (¢ = 8.32, p < .001). The 
mean decrease of the LD (X — — 6.04) and 
HD groups (X — — 5.69) did not differ sig- 
nificantly (¢ < 1). The distribution of changes 
in IQ scores was divided at the median 
(—5.5) to form a small decrease (N = 24) 
and a large decrease (V = 25) group. In the 
small decrease group, 11 had been run under 
the NS-S and 13 under the S-NS condition; 
comparable figures for the large decrease 
group were 14 and 11. Two S-NS subjects in 
the small decrease and three NS-S subjects 
in the large decrease group were randomly 
dropped, in order to equate the number of 
subjects in each cell. IQ decrease Group X 
Reinforcement Condition analyses of variance 
were run on the retest data on total time, time 
on Part I, time on Part II, decrease in time 
from Part I to II, and on the difference be- 
tween the two testings in total time, time 
on Part I, and time on Part II. The analyses 
of the retest data revealed, in addition to 
the previously found significant reinforcement 
condition effects, that the large decrease group 
spent significantly more time on the total 
game (F = 7.83, p < .01); significantly more 
time on Part II alone (F = 4.38, p < .05); 
and tended to spend more time on Part I 
alone (F = 3.52, p < .10) than did the small 
decrease group. No significant interaction ef- 
fects between IQ decrease group and rein- 
forcement condition were found. No signifi- 
cant effects were found in the analyses involv- 
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ing the difference between the two testings 
data. However, there was a slight tendency 
for the large decrease group to show a greater 
increase in total time spent on the game be- 
tween the two testings than the small de- 
crease group (F = 2.02, p < 20). 


Discussion 


The results of the original testing clearly 
supported the view that the effectiveness of 
adult attention and approval is related to the 
degree of social deprivation experienced prior 
to institutionalization. Upon first examination 
of the children’s performance on the same 
game 3 years later, this positive relationship 
between preinstitutional social deprivation and 
the effectiveness of social reinforcement ap- 
pears to have vanished. All the significant ef- 
fects found during retesting appear to be the 
result of the reinforcement conditions, while 
during original testing, performance was un- 
related to the reinforcement conditions. The 
nature of this difference is most clearly shown 
in the children’s performance on Part I, 
which, unlike Part II, is not obscured by the 
interaction effects of two different reinforce- 
ment conditions (Stevenson & Snyder, 1960). 

On Part I during retesting the effectiveness 
of the S condition is clearly greater than that 
of the NS condition. This finding only during 
retesting becomes understandable if one as- 
sumes that 3 additional years of institution- 
alization increase the motivation for all so- 
cial reinforcers and correspondingly increase 
the sensitivity to differing degrees of social 
reinforcement. (e.g., approval versus attention 
alone). This assumption is supported by the 
finding that the subjects spent more time on 
the game during retesting than during the 
original testing. 

A further examination of this increase re- 
veals what is perhaps the most interesting 
finding in the present study; namely, that 
children with the better preinstitutional his- 
tories show a much greater increase between 
the two testings than do children with the 
poorer histories. This finding points up the 
error of conceptualizing institutional living as 
if it affected all children in the same manner. 
The present study suggests that for children 
Who have suffered the greater amount of so- 
cial deprivation prior to being institutional- 


ized, institutional living adds relatively little 
to the already high motivation for social re- 
inforcers. Indeed, in light of the extreme so- 
cial deprivation experienced by many institu- 
tionalized retardates (Clarke & Clarke, 1954; 
Zigler, 1958, 1961b), it would be surprising 
if institutional living were not less socially 
depriving for these subjects than their original 
homes, thus resulting in a lessened effective- 
ness of social reinforcers in the experimental 
situation. Children who have had compara- 
tively good homes, on the other hand, have 
less motivation for social reinforcers early in 
their institutionalization. However, the insti- 
tution is a relatively depriving environment 
for these children, and thus longer institu- 
tionalization produces a greater increase in 
the effectiveness of social reinforcers. 

Whereas during the original testing a nega- 
tive relationship was found between MA and 
total time spent on the game, this relationship 
was no longer significant during retesting. 
While the decrease in the correlation was not 
significant in the present study, such a drop 
would be consistent with the view that length- 
ened institutionalization enhances the effec- 
tiveness of social reinforcers even when they 
are dispensed in conjunction with a game that 
does not challenge the intellectual ability of 
the child. Since the relationship between MA 
and performance was negative on both origi- 
nal testing and retesting, and since the MAs 
have increased between testings, the reported 
differences in the effectiveness of social rein- 
forcers between the two testings would appear 
to be a conservative estimate of the actual 
differences. 

An unexpected finding of the present study 
was the general decrease in IQs between the 
two testings. The magnitude of this decrease 
was not significantly related to preinstitu- 
tional deprivation. While these data are con- 
trary to those of Clarke and Clarke (1954), 
who found a greater increase in the IQs of 
high as compared to low deprived retardates, 
it should be noted that the only subjects in 
the present study who increased in IQ (V 
— 4) were in the HD group. 

The failure to replicate the findings of 
Clarke and Clarke may be due to two fac- 
tors: their subjects were older and had been 
institytionalized at a later age than the re- 
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tardates used in this study; and the IQ 
changes reported by Clarke and Clarke took 
place during 2 years of institutionalization, 
while the IQ changes in the present study 
were based on 5 years of institutionalization. 
This latter factor becomes increasingly im- 
portant in view of Jones and Carr-Saunders’ 
(1927) finding that normal institutionalized 
children show an increase in IQ early in in- 
stitutionalization and then a decrease in IQ 
with longer institutionalization. 

The work of Clarke and Clarke, Jones and 
Carr-Saunders, and other work (e.g., Guertin, 
1949) dealing with changes in IQ following 
institutionalization have given central impor- 
tance to the intellectual stimulation provided 
by the institution contrasted with that pro- 
vided by the original home. This argument 
suggests that the change is one in the actual 
intellectual potential of the person. The pres- 
ent study suggests that the change in IQ re- 
flects a change in the child's motivation for 
social interaction rather than an actual change 
in his intellectual potential. That is, as social 
deprivation goes up, the desire to interact 
with the adult experimenter increases. Thus 
for the deprived child the desire to be correct 
must compete in the testing situation with 
the desire to increase the amount of the so- 
cial interaction. That this conflict would be 
resolved in favor of the latter motivation 
is suggested by the recent finding (Zigler, 
1962) that “being right” is relatively low in 
the reinforcer hierarchy of institutionalized 
retarded children. This argument would ap- 
pear to provide the conceptual framework for 
Clarke and Clarke's finding that HD subjects 
evidence an increase in IQ with relatively 
short institutionalization, with the LD sub- 
jects demonstrating no greater increase than 
a test-retest control group. One would fur- 
ther expect that with increasing institution- 
alization all children would exhibit a decrease 
in IQ, the phenomenon found by Jones and 
Carr-Saunders (1927), and one that appears 
in the present study. Direct support for this 
view comes from the finding of the present 
study that children who evidence the large 
decrease in IQ play the game longer during 
retesting than do children with smaller de- 
creases in IQ. 

The tendency (? < .20) for the increase in 
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performance from the first to the second test- 
ing to be positively related to the decrease in 
IQ lends further support to this position. It 
should be noted that the original IQ measure 
was obtained at the time of institutionaliza- 
tion, rather than at the time of the original 
testing. This factor may thus conceal the 
actual relationship between increase in time 
spent on the game and decrease in IQ. 

The performance of these children during 
original and retesting and the difference be- 
tween the two testings provide further evi- 
dence that the perseveration manifested by 
familial retardates is related to a variety of 
motivational factors rather than to an inher- 
ent rigidity. These data also suggest that 
many of the reported differences in perform- 
ance between normal and familial retarded 
children may be more parsimoniously related 
to the factors of early social deprivation and 
institutionalization, than to the numerous 
postulated quasi-organismic variables which 
emphasize immutable differences between 
these two types of children. 
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VERBAL SATIATION AND THE ROLE OF 
CONCURRENT ACTIVITY * 


ARNOLD MILLER ? 
Clark University 


2 studies, including 16 Ss in each, were performed to investigate the effect of 
concurrent activity on the maintenance of word meanings (verbal satiation). 
It was found, in both studies, that words repeated with activity maintained 
their meanings significantly longer than words repeated without activity. 
Word meaning maintenance was also found to relate to the kind of concurrent 
activity: words were more facilitated by actions customarily paired with 
them (eg. push with pushing activity) than by actions not customarily 
paired with them. In addition, results indicated that a word could be maintained 
longer when the concurrent activity included (Experiment II) the added 


sensory input of vision. 


A number of investigators have cor- 
roborated the finding that words become 
inadequate symbols, losing their meaning, 
when rapidly repeated or fixated visually 
(Bassett & Warne, 1919; Mason, 1941; 
Smith & Raygor, 1956; Wertheimer, 1958; 
Wertheimer & Gillis, 1958). Recently, this 
effect (verbal satiation) was used by Lambert 
and Jakobovits (1960) to test whether verbal 
symbols achieved their meanings via central 
(neural) or peripheral (muscular) means. In 
this test the authors apparently assumed that 
Speech movements represented the only pe- 
ripheral mediation available to the organism. 
Then, finding that speech movements, per se, 
had no effect on the satiation of word mean- 
ings, they concluded that symbolic meaning 
was essentially due to central mediation. 

That speech movements do not pre-empt 
the possibility of peripheral mediation, is 
suggested by investigations of the relationship 
between symbolic activity and muscle action 
potentials (Jacobson, 1930a, 1930b, 1931, 
1932; Max, 1934, 1937; Shaw, 1940). 
Jacobson (1931), for example, found that 
thinking about bending an arm was, if 


+Based on a dissertation, submitted November 
1958, to the Department. of Psychology, Clark Uni- 
versity, in partial fulfillment of the requirements for 
the PhD degree. The guidance of Bernard Kaplan 
and Seymour Wapner is gratefully acknowledged. 

The work reported was supported in part by 
Grant M. 348 and reformulated for publication under 
Grant M 4915 from the National Institute of Mental 
Health, United States Public Health Service. 

?Now with the Department of Psychology, 
Montana State University. 


visualized, accompanied by muscle action 
potentials in the ocular region, while if 
thought of as being performed, by muscle 
action potentials in the arm. Such findings 
suggest that the meaning of word symbols 
may be dependent on covert muscle activity 
entirely unrelated to speech movements. 
Whether speech movements or body move- 
ments, the present view is that the meaning- 
ful use of spoken or printed words requires 
that they be specifically related to some 
sensorimotor referent. However, adequately 
testing for the existence of such relationships 
is difficult because the absence of muscle 
action potentials in a given locus may mean 
only that muscles in a different site are 
covertly mediating symbolic meaning. 

This difficulty may be overcome by in- 
vestigating the effects of more easily observed 
overt activity on word symbols. A model 
for such an investigation is suggested by 
clinical observations of aphasic patients 
(Gelb & Goldstein, 1948; Head, 1926; 
Weisenburg & McBride, 1935). Weisenburg 
and McBride, for example, report one pa- 
tient’s ability to read EARTH only after circu- 
lar movements with one hand, and another's 
ability to say scissors only after he had 
wriggled his fingers back and forth as in 
cutting. 

It is argued that the described initial in- 
ability to name or speak words is due to the 
nonspecific, diffuse manner in which they 
were related to body referents. In order to 
regain the ability to read or speak these 
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patients are thought to have initiated overt 
actions which, because of their salience,* they 
could interpret as meaning laden word 
referents (circular motions for EARTH, etc.). 
Only when the relationship between a word 
and its body referent became, in this manner, 
sufficiently specific, could it be read or 
spoken. 

This formulation is consistent with Wer- 
ners (1957) broader statement that a 
“creative reorganization” based on an “acti- 
vation of primitive levels of behavior . . .” 
may occur following a dissolution of existing 
patterns. In aphasia the dissolution (diffu- 
sion) of former patterns was presumably 
brought about by cerebral injury. However, 
a similar diffusion of symbolic capacity may 
also be induced among normal individuals 
by means of verbal satiation. Typically, when 
a word is articulated, the meaning of the 
vehicle (spoken word) is dominant and thus 
easily grasped. However, as the same word is 
rapidly repeated the sound and speech activ- 
ity required to say the word becomes 
dominant. Thus, when repeating the word 
CHAIR the movements and sound related to 
the cu part of the word become increasingly 
pre-eminent. This progressive emphasis on 
the nonsense syllable cH results, within 
awareness, of the meaning cuam being super- 
ceded by the meaningless sound cH. Thus, 
in verbal satiation as in aphasia a situation 
is present in which the organism has re- 
duced capacity to relate the sound-form of 
à word to its meaning. 

Verbal satiation, as an analogue for 
aphasia, permits a test of the effect of overt 
action on word symbols. The general pro- 
cedure in the present investigation requires 
that subjects pair a word with an action and 
repeat both in rapid coordination. Then, if 
the satiating word takes longer to lose its 
meaning accompanied than unaccompanied by 
action, this is support for the view that nor- 
mal adults may use their body activity to 
convey the meaning of a word to themselves. 

Two separate studies, following the same 
general procedure, were performed. The first 


3 The salience of overt actions is thought due to 
kinesthetic “feed-back” (Freeman, 1948) to the 
cortex, 


compared the effect on word meaning main- 
tenance of four conditions (three Action and 
one No Action); the second investigated the 
effect on word meaning maintenance of vision 
and touch within the context of two kinds 
of action. 
EXPERIMENT I 

Method 


Subjects. The subjects were 16 men and women 
volunteers from the university population. Their 
ages ranged from 19 to 27. Each subject, prior to 
testing, went through a pretraining session. 

Pretraining. The purpose of pretraining was to 
enable the subjects to establish a criterion for 
verbal satiation as well as to more strongly relate 
certain words to the overt actions they connoted. 
It also prepared the subjects for the experimental 
task of simultaneously repeating both word and 
action with a metronome beating between 2 and 3 
times per second. 

Satiation criterion. Satiation was defined for 
the subjects as “the point at which the repeated 
word becomes jargon or meaningless sound.” The 
subjects repeated the practice words PEACE and 
Luck until certain they could determine, and 
report by stopping, the moment this transforma- 
tion from coherent word to jargon had occurred. 
In the course of establishing this discrimination 
subjects repeated each practice word in tempo 
with the metronome. 

Word-action relationships. The ^ word-action 
relationships were emphasized for the words 
PUSH, PULL, LIFT, and LOWER. This emphasis was 
brought about by the experimenter, on three sepa- 
rate occasions (while the subject performed an 
action), informally introducing the word that 
characterized that action. The experimenter would 
say the following to each subject: 


See this drawer. Place your hands on it as 
if you were going to push it shut. Now, when 
you try to close the drawer ... go ahead... 
you'll notice that the drawer keeps coming back 
making it possible to push it again. [Experimenter 
turns on the metronome.] Now, I want you to 
push the drawer in tempo with the metronome, 


A similar procedure was followed to stress word- 
action relationships while the subject performed 
pulling, lifting, and lowering actions. During each 
action the experimenter would introduce (three times 
per action) the words (PULL, LIFT, and LOWER) which 
described that action. The subjects were trained 
to perform their actions in a pulsing manner in 
tempo with the metronome. 

Apparatus. Pushing and pulling actions were 
performed on a spring-loaded drawer while lower- 
ing and lifting were performed on a pully-weighted 
window. The drawer was constructed toeprotrude 
2 inches from the table and to permit a rapid pulsing 
action, in the horizontal plane. When pushing the 
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subject placed both palms on the flat surface of 
the drawer and when pulling hooked fingers of 
both hands under the drawer’s overhang. With 
regard to the window the pulsing actions were 
performed in the vertical plane. When lifting the 
subject placed fingers of both hands under the 
window’s frame and when lowering placed fingers 
of both hands above the frame, It was determined, 
following the experiment, that in order to perform 
actions on either drawer or window at a rate 
of 2-3 pulses per second, subjects had to limit the 
force of their pulse to 3-8 pounds. 

Testing, The experiment began with the follow- 
ing instructions: 


Now that you have had practice in repeating 
words and then actions in tempo with the 
metronome, I am going to ask you to repeat 
both a word and an action together in tempo 
with the metronome. The word you will repeat is 
“push,” Each time you repeat the word you 
repeat the action unless I tell you to repeat the 
word by itself. When the word becomes meaning- 
less as it did when we were practicing, stop, as 
you did earlier, 


After the word pusm, the subject was required 
to repeat the word rrr. In all, the subject went 
through four experimental treatments during which 
the experimenter was outside the subject’s line of 
vision recording latency for each satiation on a 
stop watch. 

Experimental treatments. The four treatments 
employed were designated Concordant, Discordant, 
Tangential, and No Action. 

Concordant. An overt action considered to cor- 
respond to the meaning of the word being repeated— 
action against the drawer (line of force away 
from the subject) repeated with the word pusH 
was considered concordant, Similarly, lifting or 
raising the window (line of force upward) re- 
peated with LIFT was considered concordant. 

Discordant. An action presumed to oppose the 
word meaning—when the word PUSH was repeated, 
pulling the drawer (line of force toward the sub- 
ject) was considered discordant. Similarly, repeat- 
ing LT while lowering the window (line of force 
downward) was considered discordant. 

Tangential. An action with a vector presumed 
tangent to the word’s meaning, The subject re- 
peated the word pusa while lowering the window 
and then rrr while pulling the drawer. 

No Action. The subject repeated the word pusH 
and then rrr without action accompaniment, 

Design. A Latin square design with four repeated 
measurements, replicated four times with two males 
and two females for each of the four sequences, 
was used. Under each treatment two scores were 
obtained, one for PUSH and one for LIFT, each 
Score based on seconds elapsed before loss of mean- 
ing was 1?ported. Two distributions were established: 
The first from PUSH scores and the second from 
LIFT scores. Prior to performing analyses of 
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Fic. 1. Mean logi (seconds) of maintained word 
meanings under four treatments. 


variance, latency scores transformed into 


common logarithms, 


were 


Results 


The effects on maintenance of both word 
meanings of concurrent activity are shown in 
Figure 1. In general, this figure suggests that 
words accompanied last longer than words 
unaccompanied by action. However, Table 1 
indicates that the main effect for treatments 
was significant for pusu (F = 6.30, df= 
3/36, p < .01) although not for rir (F = 
2.26, df = 3/36, p > .05). Since the main 


TABLE 1 
ANALYSES OF VARIANCE FOR PUSH AND LIFT 


Source - CAPE F ms | F 

Between subjects [15 .0739 | «1.00. |.1276 | «1.00 

Sequence 3| 0433 | «1.00 |.1197 | «1.00 

Error (b) 12| 0816 1296 

Ordinality 3|.0423| 6.13*| .0069 | «1.00 

Treatments 3|.0435 | 6.30*|.0237| 2.26 

Square uniqueness | 6} .0200 .0030 

Error (w) 36| .0069 .0105 

"Total 63) 


Note.—Logarithmic transformation. 
*p <.01. 


NO ACTION ^ 
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TABLE 2 
TABLE OF £ VALUES BETWEEN TREATMENTS FOR PUSH 


Treatment Discordant| Tangential | No Action 


Concordant 2.20* 2.50* 4,35** 
Discordant 30 2.14* 
Tangential 1,83 


No Action 


* p <.05, two-tailed. 
**5 <.01, two-tailed. 


effect for treatments was significant in the 
case of PUSH, / tests were performed, using 
transformed scores, to examine the differences 
between the four treatments. Table 2 in- 
dicates that for PUSH concordant activity 
accompanied significantly greater mainte- 
nance of meaning than all other treatments. 
PUSH repeated with discordant activity main- 
tained its meaning significantly longer than 
with the No Action treatment (p < .05). 
When pusm was repeated with Tangential 
action it lasted longer than with the No 
Action treatment but this difference was not 
significant. 

Although the meaning of PUsH appeared 
influenced by concurrent activity, information 
about the effect on word meaning of subjects’ 
looking at their actions and touching drawer 
or window was not obtained. It is suggested 
that the meaning of words may be main- 
tained longer when the added sensory stim- 
ulation of looking and touching is present 
within the context of a concurrent activity. 
To test this hypothesis as well as to replicate 
the present findings another experiment was 
performed. 


EXPERIMENT II 


The second experiment tested the effect on 
the maintenance of word meanings of vision 
and touch within the context of two kinds 
of concurrent activity. 


Method 

Subjects. Another group of 16 subjects, evenly 
divided between men and women, was obtained 
from the university population. Ages ranged from 
18 to 23. 

Pretraining and apparatus. The same as Experi- 
ment I. 

Testing. The three variables in this experiment 
Were designated Acting, Looking, and Touching. 
Each was varied in two ways: Acting into Con- 
cordant and Discordant action, Looking (vision) 


into Looking and Not Looking (eyes closed), and 
Touching into Touching and Not Touching (drawer 
or window). 

Concordant and Discordant action. As in the 
first experiment actions commonly termed pushing 
with rusm or lifting with rer were considered 
Concordant; those commonly termed pulling with 
pusu or lowering with rrrrT were considered 
Discordant. 

Looking and Not Looking. This variable was 
defined by the subject’s Looking or Not Looking 
(eyes closed) at the action he was performing. 

Touching or Not Touching. This variable was 
defined by the subject’s Touching or Not Touching 
the object (drawer or window) in the action he 
was performing. 

Design. Each subject went through eight experi- 
mental treatments set up as a 2X2X 2 factorial 
within an 8 X 8 Latin square replicated twice (one 
male and one female for each sequénce). The treat- 
ments for each subject were distributed as follows: 


Condition Treatment 


A Concordant action, Not Touching, 
Looking 

Concordant action, Not Touching, Not 
Looking 

Concordant action, Touching, Looking 
Concordant action, Touching, Not 
Looking 

Discordant action, Not Touching, 
Looking 

Discordant action, Not Touching, Not 
Looking 

Discordant action, Touching, Looking 
Discordant action, Touching, Not 
Looking 


Hq FA Bt 


Included within each condition were four scores. 
Two of these scores were derived from rusg and 
LIFT repeated with either Concordant or Discordant 
action; the other two scores were derived from 
PUSH and LIFT repeated without action. Words with- 
out action paralleled in all other respects the 
treatment of words repeated with action. The two 
2X2X2 analyses of variance for LIFT and PUSH 
were based on difference scores obtained by sub- 
tracting the scores of words repeated without 
action from scores of words repeated with action. 
The effect of Action versus No Action was as- 
sessed by pooling a word’s Action scores and com- 
paring them with the word’s No Action scores by 
means of two £ tests for correlated measures, one 
for each word. Prior to performing analyses of 
variance and £ tests, all scores were transformed into 
common logarithms. 


Results 


Present findings partially support the hy- 
pothesis that word meanings last longer when 
visual and tactual stimulation are included 
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Fic. 2. Mean logo based on difference scores 
(seconds coded +5) for maintained word meaning 
under eight treatments. 


within the context of concurrent activity. 
They also provide a replication of Experi- 
ment I. Considered first are findings con- 
cerned with the effect on word meanings of 
added sensory stimulation and second, those 
concerned with replication. 

Added sensory stimulation. Tactile. Fig- 
ure 2 shows that in three of the four condi- 
tions (C,D,G,H) in which Pusu and LFT 
were repeated with tactual-object-contact, they 
maintained their meanings longer than words 
without such contact (A,B,E,F). However, 
Table 3 indicates that for neither pusH nor 
LIFT was this difference significant (for PUSH, 
F = 3.26, df = 1/98, p > .05; and for LIFT, 
F= < 1.00, df=1/98, p>.05). An F= 
3.94, df = 1/98 is required for the .05 level. 

Visual. Figure 2 shows that pusm main- 
tained its meaning longer when repeated as 
subjects looked at their actions (C,A,G,E) 
than when repeated with subjects’ eyes closed 
(D,B,H,F). Table 3 indicates that this dif- 
ference was significant for pusH (F = 5.07, 
df = 1/98, p<.05). This level was not 
reached for nr (F= < 1.00, df= 1/98 
p> .05). 

Replitated findings. Concordant versus 
Discordant effects. Figure 2 shows the rela- 


D 


MILLER 


TABLE 3 
ANALYSES OF VARIANCE FOR PUSH ann LIFT 


PUSH LIFT 
Source df | Ms F MS F 
Between subjects 15| .0388, 1.32 |.0460| 1.00 
Sequence 7|.0495| 1.68 |.0461| 1:01 
Error (b) 8| .0294| 10458) 
Ordinality 7|.0379| 1.97 | .0132| «1.00 
Conditions 7|.0528| 2.75* |.03901| 1:97 
Concordant-Discordant 1|.1649| 8.59**.1586| 8.01 
Touching-Not Touching | 1|.0625| 3:26 | :0163| «1.00 
Looking-Not Looking 1|.0974|  5.07* |.0064| <1.00 
Action X Touching 1|.0307 1.60 |.0155| «1.00 
Action X Looking 1|.0007| «1.00 |.0000| 21:00 
Touching X Looking 1|.0009| «1.00 |.0197| «1.00 
Action X Touching 
X Looking 1|.0128| «1.00 |.0573| 2.89 
Square uniqueness 42| .0210| :0157 
Error (w) 56| .0178 .0229 
Pooled Error 98| .0192 .0198 
Total 127 


Note.—Logarithmic transformation. 
*p«.05. 
> <.01. 


tive effects on word meanings of Concordant 
and Discordant action. As in Experiment I 
(for Pus) word meanings last longer with 
Concordant than Discordant action. Table 3 
reveals that these differences were significant 
for both words (for pusm, F = 8.59, df= 
1/98, p < .01; and for LFT, F = 8.01, df 
= 1/98, p < .01). 

Action versus No Action. Table 4 indicates 
a successful replication of the Experiment I 
finding that word meanings last longer ac- 
companied than unaccompanied by action. 
The means in this table, based on the log- 
arithms of seconds, were obtained by averag- 
ing the subject's eight action scores (four 
Concordant and four Discordant) and con- 
trasting them with his eight averaged No 
Action scores. The direct difference method 
for related measures was used to compute 


TABLE 4 


Means For WORDS REPEATED WITH ACTION (Averaged 
across Concordant and Discordant) versus WORDS 
REPEATED WITHOUT ACTION 


Word Action No Action t 
PUSH 
M - 7728 7268 261* 
SD A710 1491 ; 
LIFT 
M 7720 7132 2.96%" 
SD 1884 1674 j 


Note.—Logarithmic transformation. 
* p «.05, two-tailed. 
** 5 <.01, two-tailed. 
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t tests (for PUSH, t = 2.61, df = 15, p < .05; 
and for LIFT, £= 2.96, df = 15, p < .01). 


Discussion 


The findings of both experiments show 
that word meanings last longer accompanied 
by action than unaccompanied by action. 
These results support the view that normal 
as well as aphasic people may use their body 
activity to facilitate the meaning of word 
symbols. However, a critic of this formula- 
tion might argue that the effect of action on 
word meanings may more appropriately be 
attributed to distraction than facilitation. 

The inadequacy of a distraction explana- 
tion becomes apparent when the distraction 
expected effects of Discordant and Con- 
cordant action are compared with actual 
findings. If activity distracts the subject 
from awareness of a word’s loss of meaning, 
and one activity distracts more than another, 
then Discordant activity (e.g. pulling with 
PUSH) should be more distracting than Con- 
cordant activity (e.g., pushing with PUSH). 
Since more distraction would presumably re- 
sult in a word appearing to last longer, it 
would follow that a word accompanied by 
Discordant activity should last longer than 
a word with Concordant activity. This ex- 
pectation is, in both experiments (see Fig- 
ures 1 and 2), precisely opposite the observed 
effects of these actions. 

It is contended that the effect of Con- 
cordant and Discordant activity on word 
meaning is a function of the organism’s 
capacity to interpret activity as well as the 
accessibility of this activity. When the organ- 
ism is able to perceive a relationship between 
its activity and the word with which it is 
engaged, that activity becomes meaningfully 
related to the word. Potentially, any * action 
may, through such perception, be used to 
facilitate a word’s meaning. Thus, just as 
aphasic patients (Weisenburg & McBride, 
1935) facilitated their use of EARTH and 
Scissors by an interpreted body activity, so 
a normal subject reported that when she 


* Wertheimer (1958) has suggested that any 
factor which increased the strength of the con- 
nection between a word sound and its meaning 
would result in that word taking longer to lose its 
Meaning, 


reappraised the lowering action being re- 
peated with pusH as a pushing down, the 
satiating word pusH suddenly became more 
meaningful. In similar fashion the facilitative 
effect on pusH of a pulling activity can be 
explained: Pulling, although commonly 
thought of as a line of force toward the 
puller, may be interpreted instead as a 
pushing toward and thus facilitate the word 
PUSH, 

Although, theoretically, any body activity 
may be used to facilitate word meanings, 
present findings suggest that certain actions 
are more accessible than others. For example, 
if an action has customarily been paired 
with a particular word, and the word begins 
to lose meaning, the organism would more 
readily use this action than another for 
facilitation of the word’s meaning. The 
customary relationship between a word and 
its accompanying action is thought to ex- 
plain the finding, in both experiments, that 
PUSH maintained its meaning longest when 
accompanied by an activity generally thought 
of as pushing, less with an activity thought 
of as pulling, and least when not accompanied 
by an activity. 

Another factor contributing to the greater 
accessibility of an action is the salience of 
certain actions. If one ongoing action is more 
impressive or salient than another, then this 
action may be more easily used to facilitate 
word meanings. This explanation is consistent 
with the finding that pusm (in Experiment 
II) was more facilitated by a condition in 
which the subjects looked at the action they 
were performing. Presumably, the increased 
sensory input of looking at an action can 
make it more dominant and thus available 
for symbol facilitation. 
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INSTRUCTIONS AS DISCRIMINATIVE STIMULI IN 
VERBAL CONDITIONING AND AWARENESS* 


LAWRENCE SIMKINS 
Florida State University 


This investigation was to study the effects of preconditioning instructions and 
type of reinforcer on level of awareness and extent of conditioning. Prior to a 
conditioning task, Ss in different experimental groups were given varying 
degrees of information concerning the nature of the reinforcement and the 
reinforcement contingency. In some groups of Ss the reinforcement con- 
sisted of E saying “good” while in other experimental groups the Ss 
received points as reinforcement. The results indicated that the degree of 
conditioning and level of awareness were partially dependent upon the amount 
of information in the preconditioning instruction. However, in experimental 
groups where only partial information was given, Ss who received points 
conditioned better than Ss who received “good” even though there were no 
differences in the level of awareness. The nature of the response class was also 


found to be an important variable that affected the level of conditioning. 


The recent upsurge of interest in verbal 
conditioning has been accompanied by an in- 
creased concern of the role of awareness and 
learning. Previous investigations dealing with 
the role of awareness in verbal conditioning 
have certainly not produced unequivocal re- 
sults. For instance, Greenspoon (1955) and 
Wilson and Verplanck (1956) indicated that 
the majority of the subjects in their studies 
conditioned without awareness. In a summary 
article of the verbal conditioning studies, 
Krasner (1958) states, “Over half of the 
studies reported that none of the Ss evi- 
denced awareness. In all [studies] roughly 
5% of all patients in the 31 studies combined 
became ‘aware’ by the definition of each E? 
(p. 159). However, recent studies by Kanfer 
and McBrearty (1961), Matarazzo, Saslow, 
and Pareis (1960), and Mandler and Kaplan 
(1956) have in general shown that “aware” 
subjects produced significantly more condi- 
tioning than nonaware subjects. 

The conflicting results of past investiga- 
tions may be partially a function of the in- 
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structions (or lack of them) given to the 
subjects prior to the conditioning task. In this 
respect the instructions themselves can be 
viewed as discriminative stimuli in the pres- 
ence of which certain contingent stimuli 
may have differential reinforcing effects. The 
amount of information contained in the ex- 
perimental instructions could very easily af- 
fect the awareness level of the subjects. One 
of the purposes of the present investigation 
was to study the effects of preconditioning 
instructions on the degree of verbal condition- 
ing and the reported level of awareness. 

A previous investigation by Simkins (1961) 
indicated significant differences in condition- 
ing as a function of the type of reinforcer that 
was employed. The results of that study indi- 
cated that the verbal reinforcers, “good” and 
“that’s fine,” were highly variable in their ef- 
fects and dependent on the attitude displayed 
by the experimenter prior to the conditioning 
task. On the other hand, when “points and 
pennies” were employed as reinforcers, the 
performance of all groups was similar and 
relatively independent of the differing experi- 
menter attitudes. Examiner attitudes and/or 
test instructions could both be looked upon as 
types of discriminative stimuli which in part 
determine the effectiveness of different rein- 
forcing stimuli. Another purpose of the pres- 
ent study was to investigate the effects of 
type of reinforcer on conditioning and level 
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of awareness for different preconditioning 
instructions. 
METHOD 


The general procedure employed in this study was 
similar to that developed by Taffel. (1955). A series 
of 3-inch X 5-inch index cards was prepared. On 
each card were three words which varied in length. 
One word had three letters and was defined as a 
short word, Another had four to five letters and was 
defined as a medium word, and the third word had 
six or more letters and was defined as a Jong word. 
All words on a given card were matched in their 
frequency of usage from the Thorndike-Lorge word 
count. The order of the word lengths was random- 
ized in order to prevent a position effect. There was 
an attempt to have all three words on a given card 
as unrelated in content as possible, but, of course, 
it was realized that differences in degree of associa- 
tion value were not completely controlled. A total 
of 110 cards was prepared in this manner. Each card 
was presented twice to each subject for a total of 
220 trials, The order of presentation of the cards was 
randomized for each subject. 

An analysis of some control data had indicated 
that long words were selected significantly more 
often than the short and medium words. The re- 
sponse class of word length was therefore considered 
as an important variable for investigation. 


Subjects 


A total of 90 subjects participated in this experi- 
ment. These subjects were recruited from elementary 
psychology classes at Florida State University. The 
subjects were randomly assigned to one of six treat- 
ment groups. Each group had 15 subjects. The sub- 
jects in each treatment group were divided so that 
there were five subjects for each of the three types 
of response classes. 


Procedure 


The experiment was arbitrarily divided into three 
stages. The first three blocks of 20 trials constituted 
the operant period. The purpose of this stage of the 
experiment was to determine the frequency of usage 
of the three response classes of words. The “correct” 
response class was defined as that class having the 
lowest frequency provided it had at least a mean of 
four responses per 20-trial block. If the lowest fre- 
quency response class had a mean of less than four, 
the next highest response frequency was considered 
the correct response class.? ; 

The next five blocks of 20 trials constituted the 
conditioning period during which time the experi- 


?' The design called for five subjects in each re- 
sponse class subcell. Actually a total of 121 subjects 
participated in this experiment, but 31 subjects had 
to be eliminated because their operant response class 
frequencies did not meet the design requirements. 
There was considerable difficulty in obtaining sub- 
jects for the long word response class. 
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menter administered reinforcement every time that 
the subject employed a word that was in the re- 
inforced class. An extinction period of three blocks 
of trials, during which time the experimenter no 
longer administered reinforcement, followed the con- 
ditioning stage. 

Prior to the beginning of the experiment all 
subjects were given the following instructions: 


We are investigating the verbal behavior of 
college students and their ability to construct ade- 
quate sentences. I am going to present to you a 
series of 3 X 5 index cards. Each card will have 
three words on it. Choose one of these words and 
make a sentence with it. Tell me the word you 
are going to use before making the sentence. 


Control group. The subjects in this group were 
not given any further instructions and simply pro- 
ceeded to make up sentences for 220 trials. There was 
no reinforcement administered to these subjects. 

Experimental Group I. 'The subjects in this group 
were treated identically to the subjects in the Control 
group with the exception that during the reinforce- 
ment period, the experimenter said “good” each time 
the subject employed a word that was in the re- 
inforced class in constructing his sentence. During 
the extinction period, reinforcement was no longer 
administered. 

Experimental Group II. At the end of the operant 
period, these subjects were given the following 
instructions: 


From now on you will continue to make sen- 
tences in the same manner, but from now on I will 
say "good" each time you make up an adequate 
sentence. 


Experimental Group IIa. At the end of the operant 
period, these subjects were given the following 
instructions: 


From now on you will be earning points. Each 
time you construct an adequate sentence, I will 
say “your point" and you keep a tally of your 
points on this paper. Do not tally a point unless 
I say “your point.” 


Experimental Group III. At the end of the operant 
period, these subjects were given the following 
instructions: 


You will notice that each card has three words 
on it. All three words differ in their length. There 
are short words, that is, words having three 
letters, medium words, that is, words having four 
or five letters, and long words, and that is words 
that have at least six or more letters. From now on 
I will say “good” every time you make up an ade- 
quate sentence with a [reinforced class] word in it. 


Experimental Group IIIs. The instructions given 
to this group were the same as Group III, with the 
exception that the subjects were instructed to tally 
points in the same manner as the subjects in Group 
IL. 
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TABLE 1 


POSTTEST QUESTIONNAIRE AND CRITERIA FOR SCORING 
CONDITIONING AND EXTINCTION AWARENESS 
LEVELS 


Questionnaire 


1. What do you think is the purpose of this experi- 
ment? 


2. Do you believe that some of your sentences were 
better than others? Which ones? 


3. Did the experimenter say anything that might 
have led you to believe that some of your sentences 
were better than others? What did this mean to you? 


4. For what sentences or types of sentences did you 
think that the experimenter was saying this? 


5. Did this influence you to give more of these type 
sentences? 


6. Did you notice any particular period or periods 
of time when the experimenter was not saying “good”? 


7. What did you interpret this to mean? 


8. Did this in any way cause you to modify the types 
of sentences that you gave? In what manner? 


Conditioning Awareness Level Criteria 


Level 1: subject does not verbalize either the presence 
of the reinforcing stimulus or the correct reinforcement 
category, 

Level 2: subject verbalizes that he is aware of the 
reinforcing stimulus, but verbalizes an incorrect 
contingency. 


Level 3: subject verbalizes awareness of the rein- 
forcing stimulus and correct response reinforcement 
contingency. 


Posttest questionnaire. At the end of the experi- 
ment a posttest questionnaire was administered to 
determine the level of awareness at the termination 
of the experiment, and also as a validating check on 
our differing sets of instructions. The questionnaire 
further served the purpose of determining the aware- 
ness of our Group I subjects, since they had not 
been given specific instructions prior to the condi- 
tioning period. The questions asked were aimed at 
determining the extent to which the subject could 
verbalize the correct reinforcement contingency, if he 
was aware of the reinforcement during the extinction 
Period, and to the extent to which he was aware of 
any modification of his verbal behavior as a func- 
tion of the experimental procedures. The posttest 
questionnaire and the scoring criteria for the dif- 
ferent levels of awareness are presented in Table 1. 


RESULTS 


Behavioral Data 


An analysis of variance of the frequency of 
Correct responses given during the operant 


stage of the experiment indicated that the 
various treatment groups did not differ sig- 
nificantly (F = 1.72, df = 12/77). 

A Type III repeated measures analysis of 
variance (Lindquist, 1956) was performed on 
the conditioning and extinction data. The 
conditioning analysis indicated significant dif- 
ferences between response classes and trials 
(see Table 2). The first-order interactions 
were not significant, but the second-order 
interaction was significant at the .05 level. 
This second-order interaction is difficult to 
interpret. About the only safe statement that 
can be made is that there are differential ef- 
fects of both treatment and response class 
occurring over trials. In lieu of this signifi- 
cant second-order interaction, our conclusions 
concerning the main effects are limited 
accordingly. 

A Duncan analysis comparing treatment 
means over response class means appears in 
the lower portion of Table 2. In general, the 
level of conditioning appears to be partially a 
function of the amount of information given 


TABLE 2 
ANALYSIS OF VARIANCE OF CONDITIONING SCORES 
Source df MS F 
"Treatments (A) 05 1797.2 | 37.28"* 
Response class (B) 02 189.5 3.93* 
AXB 10 41.3 0.85 
Error (b) 72 48.2 
Total 89 
Trials (C) 04 39.2 4.45** 
€ 20 11.5 1.91 
BXC 08 9.6 1.09 
AXBXC 40 13.3 1.51* 
Error (w) 288 8.8 
Total 360 


Duncan Analysis of Treatments and Response 
Class Main Effects? 


"Treatment 
Group |Control| I EST LTA P DET eens 
M 30.40 |33.80| 37.66 | 47.26 | 80.90 | 86.06 
i | 
Response class 
Group Medium Short Long 
M 48.66 50.70 59.30 


[i 


a Any two means not underscored are significant at the 
level eee in the analysis of variance. 
< 


p «0l. 
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Fic. 1, Mean frequency of correct responses of experimental and control groups per block of 20 trials 
(N =15 each group). 


to the subjects preceding the conditioning 
task. There also appears to be an interaction 
between the type of reinforcement and the 
amount of information in preconditioning in- 
structions. Thus it does not matter what type 
of reinforcement is given if the subjects are 
informed of the complete response reinforce- 
ment contingency (Groups III and II,). On 
thé other hand, points seem to serve as a 
better reinforcer than does the verbal response 
good when subjects are only informed of the 
reinforcing stimulus. The significant difference 
in response class indicates that long words re- 
sulted in better conditioning than either the 
short or medium words. 

An analysis of the extinction results is pre- 
sented in Table 3. Again there is a significant 
second-order interaction and a significant 
treatment effect. A Duncan analysis of the 
treatment effect revealed that Groups III 
and III, did not differ from each other, but 
were significantly higher than all other 
groups. There were no differences between the 
remaining groups. The response class data 
were pouled for each of the treatment groups 
and the graph presented in Figure 1 shows 


the mean number of correct responses per 
block of 20 trials. 


Awareness Data 


The posttest questionnaire results were 
analyzed according to the criteria outlined in 
Table 1. The Level 1 awareness category was 
of no use in this study because all experi- 
mental subjects were able to at least verbalize 
the presence of the reinforcing stimulus. 


TABLE 3 
ANALYSIS OF VARIANCE OF EXTINCTION SCORES 


Source df MS E 
Treatments (A) 5 365.8 9.12** 
Response class (B) 2 35.5 0.89 
AXB 10 10.5 0.26 
Error (b) 72 39.9 

"Total 89 
Trials (C. 2 i5 1.03 
AX c : 10 6.4 1.88 
AXB 4 2.0 058. 
AXBXC 20 8.3 244 
Error (w) 144 34 

"Total 180 
**p <.01. 
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There were, however, differences between 
experimental groups in Awareness Levels 2 
and 3. 

The majority of subjects in Experimental 
Groups I, IT, and II, were classified in Aware- 
ness Level 2, while subjects from Groups IIT 
and III, were predominately in Awareness 
Level 3. 

When the awareness data and behavioral 
data are jointly considered, a paradoxical re- 
sult is evident. As would be anticipated, 
Groups III and IIT, manifested both a higher 
degree of awareness and higher degree of 
conditioning than all other groups. However, 
despite the fact that the awareness levels for 
Groups I, II, and II, were the same, there 
were differential degrees of learning between 
the three groups. It was proposed that the 
preconditioning instructions (or lack of them) 
may have been responsible for the differential 
conditioning effects. The awareness data were 
then rescrutinized in the hope of finding some 
clues that might account for this behavioral 
discrepancy. 

Although all subjects were able to specify 
the presence of the reinforcing stimulus, when 
they were questioned as to what the rein- 
forcing stimulus meant to them, their answers 
fell into two crude categories. One type of 
response, which will be referred to as the 
“don’t know” category, included such answers 
as “T don’t know,” “I didn’t pay too much 
attention to it,” and “Probably nothing at 
all.” The second type of response was classi- 
fied as the “correct” category and included 
such answers as “I made up an adequate 
sentence," or “That was the type of sentence 
that you were looking for.” 

A chi square analysis of the frequency of 


, Subjects who were classified into each of these 


Categories from Groups I, II, and II, was 
performed. The chi square was significant 
(x? = 6.34, df = 2) beyond the .05 level. A 
similar analysis comparing Groups II and II, 
also resulted in significance (x? = 6.08, df = 1) 
but the differences between Groups I and II 
were not significant (see Table 4). These re- 
Sults are more consistent with the behavioral 
results which showed that Groups I and II 


$ Subjects who had an awareness level of 3 were 
Rot considered in this analysis, 


TABLE 4 


FREQUENCY OF SUBJECTS CLASSIFIED ACCORDING TO 
MEANING ATTACHED TO REINFORCING STIMULUS 


Response category 


Group 
Don't know Correct 
I 7 6 
II 9 5 
Ma 2 10 


Note.—x? = 6.34, p =.05; x? = 6.08, p =.02 (Groups II 
versus IIa); x? = 0.28, ns (Groups I versus II). 


were not different (although Group II was 
different from the Control group) and Group 
IL, was significantly different from Groups 
I and II. 

DISCUSSION 


In general, the results of this study indi- 
cated that the amount of information supplied 
to the subject prior to the learning task, 
greatly facilitates the level of conditioning. 
An initial interpretation of these results might 
be that the higher the level of awareness the 
better the learning. However, awareness per 
se cannot completely account for the results 
of this study. Groups I, II, and II, subjects 
were all classified at the same level of aware- 
ness (as defined in this study), yet manifested 
differential degrees of learning. In the present 
research one variable that could account for 
the superior learning of Group II and II, 
subjects relative to Group I subjects, is the 
discriminative value of the instructions. In 
the absence of specific instructions  (i.e., 
Group I) the verbal response good is less 
likely to serve as a reinforcer, When instruc- 
tions are given prior to the conditioning task 
regarding the nature of the reinforcement (II 
and II,), there is a higher probability that 
good or points will be effective reinforcing 
stimuli. Even under these conditions there 
appears to be a differential effect between the 
type of reinforcing stimulus employed, In 
general, subjects who received points learned 
better than subjects who received the verbal 
reinforcement good. The analysis of the ques- 
tionnaire results indicated that under these 
conditions the subjects are more likely to 
interpret points as being “correct,” whereas 
the verbal response good is somewhflt more 
ambiguous. Finally, under conditions in which 
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the complete reinforcement continguency is 
presented, there appears to be no differential 
effects between points and good reinforce- 
ment. 

Actually, the entire experiment could prob- 
ably be viewed more appropriately as a 
problem solving task rather than in terms of 
the usual conditioning paradigm. The subjects 
in Group I did not know that there was a 
problem to be solved. The subjects in Group 
II knew vaguely that they were receiving 
some information concerning the “adequacy” 
of their sentences, but the verbal response 
good may represent ambiguous information. 
The majority of subjects who received points 
(Group IIa) interpreted the points as re- 
inforcement for having made a correct re- 
sponse. The differential effects of the rein- 
forcement is not too surprising when it is 
taken into consideration that the subjects 
were college sophomores who have had an 
extensive history of point reinforcement on 
examinations, grades, etc. 

Another important finding to be taken into 
consideration is that the differential learning 
effects are partially dependent upon the type 
of the response class used in the investigation. 
Tn this study the long word response class re- 
sulted in better conditioning effects than 
either short or medium response classes. 
"There are two possible reasons for this find- 
ing. It could be again that college subjects 
have had a history of differential reinforce- 
ment for using long words. Or, again, although 
certainly not specifically intended, the instruc- 
tions to construct an adequate sentence may 
have produced a certain degree of "ego 
involvement" such that the subject perceived 
the situation as some type of intelligence test 
and thus, in order to appear more sophisti- 
cated, would tend to select the longer words. 
That the subjects in the control pilot study 
showed a significant preference for long words 
would tend to confirm such a notion. 

The second reason for the relatively supe- 
rior conditioning of long words may be a 
function of the subject’s ability to discrimi- 
nate this response class from the other re- 
sponse classes. All short words had three 
letters, the majority of medium length words 
had four letters, but the long words had a 
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minimum of six letters. It would therefore 
probably be somewhat more difficult to dis- 
criminate between the short and medium re- 
sponse class than either the short and long or 
the medium and long response class. These 
results are consistent with the findings of 
Kanfer and McBrearty (1961) which indi- 
cated that awareness was dependent upon the 
degree of dissimilarity between the response 
alternatives. 

The analysis of the extinction results indi- 
cated that all groups showed the expected 
decline during this phase of the experiment. 
However, Groups III and III, continued to 
give a significantly higher number of correct 
responses than any of the other groups. The 
failure to produce extinction in Groups III 
and III, could be an artifact of their rela- 
tively higher level of conditioning and it 
might be expected that there would be a con- 
tinual decline if the extinction trials were 
prolonged. However, there is some speculative 
evidence to suggest that when one takes into 
consideration the instructions given to Groups 
III and IIa, there would be greater vari- 
ability in performance during the extinction 
phase. The extinction phase under these condi- 
tions is ambiguous and in effect represents a 
loss of behavioral control by the experimenter. 
An analysis of the questionnaire results of the 
Group III and III, subjects indicated a gross 
correlation between their behavior and the 
manner in which they interpreted extinction. 
All subjects but one verbalized that they 
noticed the withdrawal of the reinforcing 
stimulus during the extinction stage. However, 
four subjects who showed only a small decline 
from their conditioning level stated that they 
noticed the experimenter had stopped saying 
good, but they thought he “just wasn't saying 
it anymore." Two subjects responded they 
noticed that the experimenter stopped saying 
good and they tried other responses, but since 
that did not work either they returned to 
giving the same response. Behaviorally these 
two subjects showed a significant decline on 
the first block of extinction trials, which was 
followed by a recovery on the next block of 
trials. Other subjects were more variable in 
their performance and the general tenor of 
their statements was they just tried to make 
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up different types of sentences, but nothing 
seemed to work. 

Perhaps the major importance of this ex- 
periment is that it emphasizes the futility of 
attempting to account for learning in terms 
of awareness. Awareness is an explanatory 
construct which in itself does not enable us 
to make very accurate predictions. If the 
assumption is made that “awareness” facili- 
tates learning, a more profitable approach 
would be to directly isolate and investigate 
the variables that determine awareness and 
study their effects on learning. This study has 
shown that instructions, response class, and 
the type of reinforcement are important 
variables which affect both awareness and 
learning. 
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In a laboratory experiment, Ss performed 2 tasks, one unpleasant, the other 


neutral They were then led to expect tl 


of the tasks again. When subsequently presented with a choice, there was an 
increase in the probability of their choosing the expected task, even when it 


was objectively less pleasant. The resul 


extension of the theory of cognitive dissonance. 


In two recent studies, an attempt was made 
to link the theory of cognitive dissonance 
(Festinger, 1957) to other expectancy and 
consistency theories (e.g., Lecky, 1945; Tol- 
man, 1959). In one study (Aronson & Carl- 
smith, 1962), it was shown that if a person’s 
performance was at variance with a strong 
expectancy, he would modify this perform- 
ance to bring it closer to his expected level. 
This occurred whether the crucial perform- 
ance was better or worse than the expected 
performance. In a subsequent experiment 
(Carlsmith & Aronson, 1963), it was demon- 
strated that events which disconfirmed a 
strong expectancy were judged to be less 
pleasant than events which occurred when an 
individual had no expectancy. 

These results suggest some important con- 
sequences of expectancy for subsequent be- 
havior. Specifically, if a disconfirmed expect- 
ancy results in negative affect, then it follows 
that most individuals will attempt to avoid 
disconfirmation. Thus, if one can lead a 
person to expect to perform a particular be- 
havior on a given occasion, he will perform 
that behavior in preference to others, all other 
things being equal. Moreover, if the negative 
affect which follows disconfirmation is of con- 


1This research was supported by grants from the 
National Science Foundation (NSF-G-16838 and 
NSF-G-22316) to Elliot Aronson. The experiment 
was conducted while Merrill Carlsmith was on the 
tenure of an NSF fellowship. The authors are in- 
debted to Janet Darley for her help in obtaining 
subjects. 

? Now at Yale University. 
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Minnesota 


hat they would have to perform one 


Its were interpreted in terms of an 


siderable magnitude, then the person will per: 
form the expected behavior even when al 
other things are not equal—even if he has a 
choice between performing the expected be- - 
havior and performing a behavior which is ` 
objectively more pleasant. In general, the 
stronger the expectancy, the greater the nega- — 
tive affect following its disconfirmation, and 
therefore, the greater the likelihood that the 
expected alternative will be chosen. 
Consider a person faced with a choice be- - 
tween two alternative actions, one of which | 
he expects to perform. In this model, there — 
are two major sources of affect determining 
the choice. One is simply the difference be-— 
tween the desirability of the two alternatives. 
The second is the presumed negative affect 
that would result from the disconfirmation of 
an expectancy if the expected alternative wer 
not chosen. If the person expects to perform } 
the less desirable of the two alternatives, a! 
if his anticipation of the negative affect fol 
lowing disconfirmation is greater than | 
difference between the desirability of the í 
pected action and its alternative, then 
person will perform the act he expen to 
perform. 
Suppose a person arrives at a m and | 
is told that he will have to make an after- 
dinner speech. Suppose further that he detests 3 
the idea of making speeches, but perceives 
choice in the matter. All during dinner, 
envisions himself going through this un 
pleasant ordeal. If, just after dinner, he were 
offered his choice between speaking and no 
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speaking, we would predict that he would be 
more likely to choose to speak than if he had 
been offered his choice at the outset. In an 
almost literal sense, he is all dressed up with 
no place to go. 

On the other hand, it seems reasonable that 
the negative affect resulting from disconfirma- 
tion will rarely be of such magnitude as to 
override the extreme negative aspects of some 
undesirable expected alternatives. For ex- 
ample, if a person firmly expects to die a 
painful death, and at the last moment is 
offered a choice between this alternative and 
a long and happy life, we have few doubts as 
to the nature of his choice. 

Our model suggests an effective technique 
for the control of human behavior. Stated in 
bold terms: if you want a person to perform 
an unpleasant task, lead him to form a strong 
expectancy of performing it. Subsequently if 
faced with a choice between performing this 
task and a more desirable one, the prob- 
ability of his choosing the expected task will 
increase. 

PROCEDURE : 


In outline, the steps of the procedure were: (a) 
The subjects performed two tasks, one unpleasant, 
the other neutral; (b) some subjects expected to 
repeat the unpleasant task, others expected to repeat 
the neutral task; (c) some subjects held the expect- 
ancy for 20 minutes, others for only a few seconds; 
(d) all the subjects were asked which task they 
wished to repeat. 

The subjects were 59 female college students who 
were paid volunteers for an experiment in psycho- 
physics, It was presented to the subjects as an experi- 
ment in “cross-modal psychophysics.” After a general 
explanation of psychophysical procedures, two psy- 
chophysical tasks of the same duration were ad- 
ministered to each subject; the order of administra- 
tion of the tasks was randomized. One task involved 
making successive comparative judgments about the 
telative heaviness of small weights, the other involved 
making successive comparative judgments of the rela- 
tive bitterness of solutions of quinine sulfate. 

The weights were red plastic collapsible drinking 
cups, weighted with lead shot. Each subject was 
given two weights, lifted both, and reported which 
was heavier. This procedure was continued for ap- 
Proximately 8 minutes. 

Three solutions were used; one was ordinary tap 
water, the other two were diluted solutions of 
quinine sulfate. The quinine sulfate solutions con- 
tained, respectively, .14 and .23 grams per liter of 
water, both of which have a very unpleasant taste. 
Each subject was given two cups, each containing 
approximately 4 cubic centimeters of solutions, and 
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instructed to drink first one, then the other, and 
report which tasted more bitter, This procedure was 
continued for approximately 8 minutes. 

The comparative judgments of the weights and the 
solutions were used as a pretext for having the sub- 
jects perform both tasks; no further use will be 
made of these judgments in this report. 

Following the completion of both tasks, the con- 
cept of test reliability was explained to the subjects, 
and the experimenter announced that the subjects 
would have to repeat one, but not both, of the psy- 
chophysical tasks “exactly as before.” The subjects 
were then told which task they were to repeat, and 
were told that the assignment was random. 

All of the subjects were instructed to wait for an 
unspecified interval in an adjoining room, Some were 
allowed to wait for 20 minutes (Strong Expectancy 
condition), others waited for approximately 7 seconds 
(Weak Expectancy condition). After the specified 
time had elapsed, each subject was given the follow- 
ing explanation by a flustered experimenter: 


Tm sorry, I forgot that we have changed the 
experimental procedure somewhat. A ‘preliminary 
analysis of the data shows that it makes absolutely 
no difference in the results whether or not we 
randomly assign the task you have to repeat. 
Therefore, I think it’s only fair to give you the 
choice of what task to repeat. It’s entirely up to 
you; do you want to lift the weights again, or 
taste the quinine solution again? It really makes 
no difference to us. 


When presented with this choice, the subjects 
usually decided on one or the other of the alterna- 
tives. A few subjects expressed a willingness to do 
whichever task the experimenter preferred. In these 
cases the experimenter repeated that he did not care, 
that he wanted the subject to do whichever she pre- 
ferred, and waited for the subject to make a choice, 
As soon as the subject made her choice, the experi- 
ment was concluded. At the end of the session, the 
purpose of the experiment and the theory under- 
lying it were explained to each subject, and she was 
told exactly where she had been deceived, and why. 

There were four possible experimental conditions: 
expectation of taste for 20 minutes; expectation of 
taste for 7 seconds; expectation of weights for 20 
minutes; expectation of weights for 7 seconds. For 
reasons which will become apparent during the dis- 
cussion of the results, unequal numbers of subjects 
were run in these conditions. Twenty-one subjects 
expected taste for 20 minutes, 21 subjects expected 
taste for 7 seconds, 12 subjects expected weights for 
20 minutes, and 5 subjects expected weights for 7 
seconds. The subjects were assigned to conditions 
randomly, in accordance with the above distribu- 
tion; the conditions were all run concurrently. 


RESULTS 


We predicted that if a person expefted to 
perform some behavior in the future, she would 
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TABLE 1 


INDIVIDUAL CHOICE AS A FUNCTION OF EXPECTANCY: 
SrRONG ExPECTANCY ONLY 


Choice 
Expected 
Taste Weight 
Taste 9 12 
Weight 0 12 


be more likely to choose to perform that be- 
havior when presented with a choice, even if 
the behavior was objectively less desirable 
than the alternative presented. The first result 
relevant to this prediction is presented in 
Table 1. This table shows the relation be- 
tween the behavior the subject expected to 
perform and the behavior she subsequently 
chose to perform. It presents results only for 
those subjects in the Strong Expectancy con- 
ditions—i.e., for those subjects who waited 
20 minutes before being presented with a 
choice. Several things may be noted about 
this table. First, there is a strong bias toward 
choice of weights, irrespective of expectancy. 
This reflects the fact that there was a large 
difference between the relative attractiveness 
of the two tasks. No subject who expected to 
lift weights chose to taste the solutions. In 
fact, several of these subjects expressed some 
doubts about the sanity of the experimenter 
in presenting such a choice. Of the subjects 
who expected to taste the solutions, however, 
41% chose to taste the solutions. This dif- 
ference is significant at the .02 level (Fisher 
exact test, two-tailed) .* 

Although the experimenter was careful to 
emphasize that it made absolutely no differ- 
ence to him which task the subject chose, it 
is possible to argue that the subject neverthe- 
less had some residual feeling that, deep 
down, the experimenter would have preferred 
that she perform the task he had originally 
assigned. To check this possibility, we may 

3 Note that there were 12 subjects in the condition 
who expected weights, whereas there were 21 sub- 
jects who expected taste. This disparity is due to 
the fact that in the condition expecting weights, 
no subject chose taste. This proportion (and the 
incredulous reactions of these subjects when pre- 
sented with the option of choosing taste after expect- 
ing weights) convinced the experimenters that little 


more information would be gained by running more 
subjects in this condition. 
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look at the comparison between subjects who 
waited 7 seconds and subjects who waited 20 
minutes. Uncertainty about the “real” desires 
of the experimenter should be the same in 
both of these groups, since the original assign- 
ment to task and the subsequent choice were 
presented identically in both groups. There is 
no theoretical reason to expect changes over 
time in the subject's uncertainty about the 
real desires of the experimenter. However, the 
effect of expectancy on the choice of be- 
havior should, theoretically, vary over time. 
It seems reasonable to assume that the longer 
a person holds an expectancy, the stronger the 
expectancy. In fact, an expectancy that is 
held for only 7 seconds is probably so weak 
that it would be of little consequence. There- 
fore, we would predict little or no effect of 
expectancy on the choice behavior of subjects 
in this condition; their choice should be dic- 
tated primarily by the attractiveness of the 
two alternatives. In the group which waited 
20 minutes, on the other hand, there was ade- 
quate time for a firm expectancy to develop, 
and consequently we would predict that sub- 
jects in this condition would more frequently 
choose the expected alternative. 

To test this, we compared the choice be- 
havior of subjects who spent 7 seconds expect- 
ing to taste the bitter solutions with that of 
the subjects who spent 20 minutes expecting 
to taste the solutions. The results are shown 
in Table 2. It is clear that more people chose 
to taste the bitter solutions when they waited 
20 minutes before being offered their choice. 
The difference is significant at the .04 level 
(Fisher exact test, two-tailed). 

Table 2 includes only subjects who ex- 
pected to taste solutions. To complete à 
balanced design, it would have been necessary 
to compare subjects in the Weak Expectancy 
condition who expected to lift weights with 
subjects in the Strong Expectancy condition 
who expected to lift weights. However, such a 
comparison is uninteresting; there is no reason 
to expect differences since subjects who did 
not expect taste showed a clear and un- 
ambiguous preference for weights. To check 
this reasoning, five subjects were run in 
the Weak Expectancy condition expecting 
weights. As anticipated, all five quickly chose 
weights. 
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TABLE 2 


CnuoicE BEHAVIOR AS A FUNCTION OF STRENGTH 
OF EXPECTANCY: 
EXPECTANCY OF TASTE ONLY 


Strength of expectancy 


Choice 


Weak Strong 
(7 seconds) (20 minutes) 
Taste 2 9 
Weight 19 12 


These results might be interpreted in terms 
of some memory decay function. If the mem- 
ory for the unpleasant aspects of the bitter 
taste were to fade during the 20-minute wait, 
then the subjects in the 20-minute condition 
would be more likely to choose taste than the 
subjects in the 7-second condition. However, 
since the order of presentation of tastes was 
varied, it is possible to make an analysis 
which seriously weakens this interpretation. 
Consider the 7-second condition. For subjects 
who were given the taste test first, there was 
an interval of approximately 9.5 minutes (8 
minutes lifting weights, 1.5 minutes for in- 
struction) between the last taste experience 
and the choice, but only 7 seconds between 
the onset of the expectancy and the choice. 
For subjects who were given the weight task 
first, there was an interval of 1.5 minutes 
between the last taste experience and the 
choice. Unless we postulate a decay function 
Which is flat for 9.5 minutes and then falls 
abruptly, or some extraordinary nonmono- 
tonic function, it is clear that this alternative 
interpretation would lead to the prediction 
that there would be systematic differences 
between the subgroups that lifted weights first 
and the subgroups that tasted solutions first. 
No such differences appeared. 

The results show clearly that the existence 
of an expectancy about the inevitability of the 
performance of some act increases the likeli- 
hood of a subsequent choice of that act. 
Further, this effect is greater if the expect- 
ancy is held for 20 minutes than if it is 
held for 7 seconds. However, this experiment 
Sheds little or no light on the processes in- 
volved. It might be of some value to speculate 
briefly as to the possible nature of these proc- 
esses, There are at least four possibilities. 


223 


1. Set. If a person has a strong expectancy 
to perform a particular task he is cognitively 
(and perhaps physiologically) set for it. It is 
well known that in certain types of problem 
solving experiments or in simple reaction time 
experiments, the effect of set is to facilitate 
the response for which the subject is pre- 
pared. Our experiment is not clearly analogous 
to these, differing most sharply in two re- 
spects. First, the subject has a reasonable 
length of time to make a careful decision. 
Second, one of the alternatives is much more 
attractive than the other. Nevertheless, it may 
be that a similar process is involved. This 
process is typically described as a muscular 
or cognitive preparation for a response which 
increases the likelihood of the occurrence of 
that response to a variety of stimuli (e.g., see 
Holt, 1931). 

2. Anticipation of dissonance. As previously 
discussed, choice of an alternative different 
from the one expected should lead to dis- 
sonance. The cognition “I am prepared to 
taste a bitter solution" is dissonant with the 
cognition “I will not taste a bitter solution." 
If the subject chooses to lift weights, he will 
experience dissonance. If he can anticipate 
this dissonance, he will choose to taste the 
bitter solution. 

3. Active preparation during expectancy. 
1f, during the period of expectancy, the per- 
son actively prepares himself for the expected 
task, then his cognition that he worked hard 
to prepare for a given task may enhance the 
attractiveness of the task. Previous experi- 
ments by Aronson and Mills (1959) and 
Aronson (1961) demonstrated that the larger 
the expenditure of effort, the greater the 
increase in the attractiveness of the goal 
object. Thus, if a person spent 20 minutes 
actively preparing to taste the bitter solution, 
he might find the task more attractive (e.g., 
convince himself that his task had more 
scientific value, that the solutions were good 
for him, etc.). If this were the case, the choice 
of the expected task would reflect the choice 
of the now more attractive alternative. 

4. The perceived attractiveness of the bitter 
solution may have increased.for still another 
reason. In general, a person's cognitign that 
he is about to drink a bitter solution is dis- 
sonant with his cognition that it will taste 
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unpleasant. Of course, in this experiment, the 
subjects’ cognition that the experimenter 
asked them to drink the solution is highly 
consonant with drinking it—thus sharply 
reducing the overall dissonance. Nonetheless, 
if any dissonance remained it might lead the 
subjects to convince themselves that the task 
was less unpleasant than they had previously 
thought. 

No attempt was made in this experiment to 
distinguish among these processes. Our pur- 
pose was to provide a clear demonstration of 
the existence of the phenomenon. Further 
experimentation should make it possible to 
distinguish among these alternative processes. 
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2 experiments were conducted in which passive immunization was achieved by 
providing Ss with counterarguments against persuasive messages to which they 
were later exposed. In both studies, this procedure produced a significant tend- 
ency to displace the perceived content of the “change” message toward the stand 
advocated in the immunization message. These results were tentatively at- 


tributed to the Ss’ attempts to minimize 


the discrepancy between the opposing 


statements, both of which appeared to emanate from prestigeful sources. Judg- 
ments were not affected by active immunization, a procedure in which Ss at- 


tempted to refute (in writing) weakene 
which they were later exposed. 


In a recent series of papers, Manis (1961a, 
1961b) has suggested that distorted inter- 
pretations of persuasive messages may often 
result from the recipient’s attempts to main- 
tain his existing beliefs in relatively un- 
changed form. Thus, given the fact that most 
messages can be reasonably interpreted in a 
variety of ways, it is assumed that the re- 
cipient will select that interpretation which is 
least challenging to his convictions. 

One implication of this viewpoint is the 
prediction that there should be little distortion 
when the veridical interpretation of an in- 
coming message presents virtually no threat 
to the recipient’s beliefs. In a previous experi- 
ment (Manis, 1961b), this prediction was 
tested by presenting subjects with a series of 
persuasive messages, some of which appeared 
to come from low prestige communicators, 
thus presumably generating little pressure to 
change; under these conditions, as predicted, 
there was virtually no distortion, although 
these same messages were consistently assimi- 
lated towards the recipient’s own position 
when they were presented as coming from 
high prestige sources. 

The present experiments were designed to 
provide an additional test of the relationship 
between pressure to change and message dis- 
tortion; in this case, immunization procedures 

1All statements are those of the authors and do 
Not necessarily represent the opinions or policy of 
the Veterans Administration. The authors wish to 
thank E, L. Kelly for editorial suggestions in the 
Preparation of this report. 
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d versions of the change message to 


were employed (McGuire, 1961) in an at- 
tempt to temporarily alter the recipient's 
influenceability. In essence, after an immu- 
nization session in which they were given 
“defenses” against the persuasive communica- 
tions which followed, subjects were asked to 
indicate the positions advocated in a series 
of messages which opposed their views. It 
was anticipated that immunization would at- 
tenuate the effective pressure to change and 
hence would reduce the subjects’ motivation 
to distort, as compared to subjects in a con- 
trol, “no immunization” condition. 


EXPERIMENT I 


Experiment I was designed to assess the ef- 
fects of passive immunization on the inter- 
pretations elicited by persuasive messages; 
immunization was achieved by presenting sub- 
jects with fairly detailed counterarguments 
before exposing them to the change messages. 
There were three main experimental condi- 
tions: Specific Immunization (subjects had 
been prepared for the specific arguments 
which were later presented), General Immu- 
nization (the specific arguments that the sub- 
ject had been prepared for did not appear in 
the change message), and No Immunization. 


Method? 


Subjects. This study was conducted with 30 college 
students serving as subjects; data were collected dur- 


ing regular class hours. 


2The methods and experimental material’ which 
were employed in this study were adapted, with 
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Procedure. In order to disguise the purpose of the 
experiment and heighten motivation, subjects were 
told that the experimenters were attempting to de- 
velop a test for identifying and selecting people who 
were gifted in scientific areas; it was further ex- 
plained that all the test materials would be con- 
cerned with a relatively familiar subarea, namely, 
that of health and medicine. 

The experiment was conducted in two sessions. In 
the first session, subjects were immunized; in the 
second session, they were presented with three per- 
suasive messages which were designed to discredit 
the widespread use of penicillin, daily tooth brush- 
ing, and annual physical checkups. Immunization was 
achieved by having subjects read mimeographed mes- 
sages which contained counterarguments against some 
“misleading” stories that had supposedly been circu- 
lating, in criticism of the abovementioned, widely ac- 
cepted health practices. For example, one immuni- 
zation message defended penicillin against criticisms 

that it was a superficial treatment which had pro- 
duced a slowdown in research on other antibiotics. 

All immunization messages were presented as hav- 
ing been written by physicians affiliated with major 
medical centers. Upon receiving the immunization 
passages, subjects were asked to: underline in each 
paragraph the one clause or sentence which most 
thoroughly and briefly summarized the "general point 
being made in that paragraph," as a means of as- 
sessing their “reading skills and thinking ability.” 

In the second session, which was conducted 5 days 
after the first, subjects were presented with messages 
which attacked their views concerning each of the 
experimental issues. As was true of the immunization 
materials, each change message was purportedly writ- 
ten by a high prestige person, a physician affiliated 
with a major medical center. Since previous research 
(Manis, 1961b) has shown that subjects who have 
not been immunized will tend to assimilate prestige- 
ful communicators towards their own position, re- 
gardless of the communicator-recipient discrepancy, 
the use of the high prestige sources justified the as- 
sumption that the communicator’s “true” position on 
the attitude continuum was farther from the re- 
cipient’s stand than he (the unimmunized recipient) 
had inferred. 

Upon receiving the change messages, subjects un- 
derlined the “most important section” of each para- 
graph; when this had been completed, for each mes- 
sage, subjects were presented with four statements 
of opinion and were asked to rate the communica- 
tor’s probable degree of agreement with each, using 
a 15-point scale which ranged from “definitely agree” 


minor changes, from the procedures developed by 
McGuire, in his studies of immunization (McGuire, 
1961; McGuire & Papageorgis, 1961). The authors 
are grateful to W. J. McGuire for his help in graci- 
ously providing these materials. 

3 Previous research by McGuire and Papageorgis 
(1961) vindicates that among college students, there 
is virtual unanimity of opinion in support of these 
practices. 
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to “definitely disagree.” For each message topic, the 
average of the ratings obtained from the four opin- 
ion items was used as a measure of the communica- 
tor’s perceived scale position; these average ratings 
constituted the major dependent variable of the 
study. 

As a means of assessing the effectiveness of the 
immunization procedures, subjects were next. asked 
to indicate their own beliefs regarding the three 
health issues, using the same rating scales. 

When the data collection had been concluded, sub- 
jects were informed as to the true purpose of the 
experiment. 

Table 1 presents the experimental design in sum- 
mary form. Each row represents (in order) the treat- 
ments administered to a single experimental group 
(n= 5) ; the columns represent the three topic areas. 
The “Ps” indicate the topic areas in which a given 
group of subjects was immunized; the “Cs” which 
appear within every cell, indicate that all groups re- 
ceived change messages dealing with each of the ex- 
perimental topics. When the subscripts within a cell 
are identical, this is an indication that the immuni- 
zation message was specifically concerned with the 
arguments that were later raised in the change mes- 
sage; if the subscripts do not match, this is a sign 
that the subjects had been immunized, but not 
against the arguments which were later raised in the 
change message. 

The design shown in Table 1 controls for the fol- 
lowing factors: 

1. Each of the immunization treatments (Ia, Iv, Ie, 
etc.) was followed by specific attack for some sub- 
jects, and by generalized attack for others. 

2. Each change message (Ca, Cy, Ce, etc.) was pre- 
ceded by Specific Immunization for one group, Gen- 
eral Immunization for another, and No Immuniza- 
tion for a third. 

3. In responding to the change messages, there are 
six possible orders in which the three experimental 
conditions might occur (for example, Specific Immu- 
nization, Generalized Immunization, No Immuniza- 
tion; Generalized Immunization, No Immunization, 
Specific Immunization, etc.); each of these possible 
orders was administered to one of the six experi- 
mental groups. 


TABLE 1 
EXPERIMENTAL DESIGN 


Message topic 


ins Tooth eel 
Penicillin iiaening es 
Ia LC» ICa m^ 
In EG TaCe —Ct 
Ila T.C. Cs pon 
Ip LC» Ga LCs 
HN =e LCase IC 
IIIa 85 TC. TiCe 
Note.—See text. 
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Results 


To eliminate those effects attributable to 
differences between the various sets of opinion 
items, and differences in the “extremeness” of 
the various change messages, constants were 
added to the judgment data, so that the cor- 
rected mean for each message (pooling the 
three experimental conditions) would be equal 
to the common grand mean. The own opinion 
data were corrected in a similar fashion. 

Figure 1 presents the mean judgments 
elicited by the change messages under the 
three experimental conditions; judgments that 
are low on the ordinate reflect a belief that 
the communicators held unfavorable opinions 
regarding the various experimental issues. 
The subjects! own opinions, as assessed at 
the end of the experiment, are also shown. 
Each data point represents the average re- 
sponse to six different messages (7 — 5 for 
each message). 


QI..04 


904 
RECIPIENT'S OPINION 
80 


704 


60 


SCALE POSITION 


i COMMUNICATOR'S 
PERCEIVED STAND 


1 ES 
sof 
GENERAL SPECIFIC 


IMMUNIZATION 


Fic. 1. The effects of passive immunization on the 
Perceived stand of an opposing communicator, and 
the recipients’ final opinions. 


NONE 


The two sets of data, representing the sub- 
jects’ judgments and their opinions, were ana- 
lyzed separately, using Lindquist’s (1953) 
Type I design; the immunization variable 
constituted the effect within subjects, while 
the effect between subjects was represented 
by the six different experimental groups (see 
Table 1). 

Analysis of the judgment data indicated 
that the three immunization conditions had 
produced reliably different results (p < .05), 
while the Group variable and the Group X 
Immunization interaction failed to achieve 
significance. Immunization also had a signifi- 
cant effect (p < .05) on the subjects’ own 
opinions; as depicted in Figure 1, subjects 
who received immunization were more resist- 
ant to the change messages (i.e. were better 
able to maintain their initially favorable atti- 
tudes toward the experimental issues) than 
were subjects who were not immunized. 


Discussion 


As noted above, previous research con- 
ducted with nonimmunized subjects (Manis, 
1961b) has shown that in responding to mes- 
sages from prestigeful sources, subjects typi- 
cally displace the communicator toward their 
own favored position; it is thus reasonable to 
assume that in the present experiment, sub- 
jects in the No Immunization condition per- 
ceived the average communicator as occupy- 
ing a position which was more in keeping with 
their own views (ie., higher on the Y axis), 
than would be justified by an "objective" 
reading of the messages. Since immunized 
subjects perceived the average communicator 
as being even higher on the Y axis than did 
the nonimmunized subjects, the above rea- 
soning leads to the conclusion that, contrary 
to prediction, immunized subjects showed a 
greater tendency to displace the communica- 
tors away from their “true” positions at the 
low end of the opinion scale, than did sub- 
jects who were not immunized. 


EXPERIMENT II 


Experiment II was designed to investigate 
the replicability and generality of the results 
reported above. While Experiment a was 
solely concerned with the effects of passive 
immupization, Experiment II employed both 
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passive and active techniques; in active im- 
munization, subjects were called upon to re- 
fute the arguments that were later presented 
(in a stronger form) in the change messages. 
Experiment II was also designed to test the 
hypothesis that subjects whose views were 
salient while they were judging the change 
messages, would show a greater tendency to 
displace the communicators toward their own 
positions, than would subjects whose views 
were not salient. In this experiment it was 
assumed that saliency was achieved by hav- 
ing the subjects write out their own views on 
the various experimental issues, just before 
exposing them to the change messages. 


Method 


Subjects. Experiment II was also conducted using 
college student subjects (V=72); data were again 
collected in regular class hours. 

Procedure, In the main, Experiment II followed 
the procedures of Experiment I; thus, the experi- 
ment was again introduced as an attempt to de- 
velop a test for selecting people who were gifted in 
scientific areas. The penicillin and tooth brushing 
topics were retained for Experiment II, however, 
the annual medical checkup issue, which had been 
relatively insensitive to the experimental manipula- 
tions of Experiment I, was replaced by “the desira- 
bility of X-ray examinations as a means of detect- 
ing TB.” 

In the first experimental session, subjects were 
randomly assigned to passive and active immuniza- 
tion conditions. The passive immunizations were 
conducted exactly as in Experiment I; active immu- 
nization consisted of having the subjects attempt to 
refute, in writing, weakened versions of the points 
that were later raised in the change messages. 

In the second experimental session, which was con- 
ducted 2 days after the first, in an attempt to 
heighten the saliency of their views, half the sub- 
jects (randomly selected) were asked to write state- 
ments concerning their own opinions in each of the 
experimental areas; as a control condition, the re- 
maining subjects wrote statements concerning their 
views in other medical areas, irrelevant to the main 
experimental issues (the merits of aspirin, the im- 
portance of daily exercise, and the importance of a 
balanced diet). 

After completing their statements, subjects were 
presented with three change messages; they were 
given 3-3.25 minutes in which to read each mes- 
sage and to rate the communicator’s probable re- 
sponse to the associated set of four opinion items. 
As in Experiment I, the subjects’ responses to each 
set of items were averaged, to yield a measure of 
the cominunicator's perceived scale position. At the 
conclusion of the second session, all subjects indi- 
cated their own opinions concerning each of the 
health issues, using these same items. When these 
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procedures had been concluded, subjects were in- 
formed as to the true purpose of the study. 

Experimental design. As in Experiment I, each sub- 
ject served under three experimental conditions: No 
Immunization, General Immunization, and Specific 
Immunization. The design which is summarized in 
Table 1 was employed for each of the four experi- 
mental conditions which resulted, when all com- 
binations of active versus passive immunization, and 
salient versus nonsalient opinion arousal are consid- 
ered. In essence, it is as though Table 1 were made 
four layers “thick,” with each layer corresponding 
to one combination of the activity versus passivity, 
and saliency versus nonsaliency variables (for ex- 
ample, active-salient, passive-salient, etc.). There 
were 18 subjects randomly assigned to each layer, 
3 subjects to each of the six subgroups that are rep- 
resented in Table 1. 


Results 


Before analyzing the data of Experiment 
II, it was again necessary to eliminate the 
differences which existed between the various 


change messages and their associated opin- - 


ion items; as in Experiment I, this was ac- 
complished by adding constants to the ob- 
tained ratings, so as to set the mean judgment 
elicited by each message equal to the grand 
mean for all messages. A similar corrective 
procedure was applied to the own opinion 
data. 

When the judgment data were subjected to 
an analysis of variance, none of the main ef- 
fects approached significance; the interaction 
between type of immunization and degree of 
immunization was, however, significant at the 
.10 level. That is, the difference between the 
judgments of subjects who had been actively 
and passively immunized, tended to vary, de- 
pending upon whether they had been spe- 
cifically immunized, generally immunized, or 
not immunized; these data are shown in 
Figure 2. The subjects’ own views, as as- 
sessed at the end of the experiment, are also 
shown; each of the data points in Figure 2 
represents the pooled results of six different 
messages (n = 6 for each message). 

Inspection of Figure 2 suggests that pas- 
sive immunization led subjects to displace 
their judgments toward the positive end of 
the scale, with this effect being most pro- 
nounced following Specific Immunization; in 
essence, these results. constitute a replication 
of the effects obtained in Experiment I (see 
Figure 1). In order to obtain a more precise 


statistical evaluation of this trend, for each of 
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NONE 


Fic. 2. The effects of active and passive immuni- 
zation on the perceived stand of an opposing com- 
municator and the recipients’ final opinions. 


the passively immunized subjects, a correla- 
tion coefficient (tau) was computed between 
degree of immunization and the communica- 
tor’s perceived scale position. In computing 
tau, the No Immunization condition was as- 
signed a rank of 1, Generalized Immuniza- 
tion a rank of 2, and Specific Immunization 
a rank of 3, while the subjects’ corrected 
judgments (see above) were ranked from low 
to: high. For the passively immunized sub- 
jects, the results of this analysis confirmed the 
monotonic trend that is suggested in Figure 2; 
i.e., the average tau was significantly greater 
than zero (mean tau = .18, £ = 1.85, df = 35, 
$< 05, one-tailed test). For subjects who 
had been actively immunized, the mean tau 
Was negative, but not significantly different 
from zero. 

An analysis of variance for the own opin- 
ion data indicated that the effects attributable 
to degree of immunization (specific, general, 
Or none) approached significance ($ < 07); 
as shown in Figure 2, subjects who had been 
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specifically immunized tended to be more 
successful in retaining their initially favor- 
able attitudes than did those who had been 
generally immunized, while subjects in the 
No Immunization condition were most influ- 
enced by the change messages. While there ' 
was virtually no difference between the re- 
sults obtained following active, as opposed to 
passive immunization, there was a slight tend- 
ency (? < .12) for subjects in the “salient” 
groups (ie. subjects who had written mes- 
sages concerning the experimental issues at 
the start of the second session), to be rela- 
tively resistant to the change messages. 


Discussion 


Experiments I and II revealed similar 
trends, in that passive immunization led sub- 
jects to view the communicators as being more 
favorable toward the experimental issues, with 
this effect being most marked when subjects 
were prepared for the specific arguments that 
appeared in the change messages. As noted 
above, previous research (Manis, 1961b) sug- 
gests that the No Immunization condition 
probably elicited judgments which were more 
favorable than would be warranted by an 
"objective" reading of the messages. Con- 
trary to prediction, passive immunization 
seems to have facilitated this type of distor- 
tion, since it typically resulted in predictions 
which were displaced even farther toward the 
positive end of the scale. Active immuniza- 
tion on the other hand, seems to have had no 
consistent effect on the subjects’ judgments. 

In attempting to arrive at a possible ex- 
planation for these results, it was noted that 
in both experiments, there was a rough paral- 
lelism between the curves depicting the opin- 
ions and the judgments of the passively im- 
munized subjects (see Figures 1 and 2). This 
suggested that the relationship between de- 
gree of passive immunization, and subsequent 
message interpretation, may have been mainly 
attributable to the effects of immunization on 
attitude. 

To evaluate this hypothesis, for all sub- 
jects who had been passively immunized (to- 
tal N — 66), a partial tau was computed be- 
tween degree of immunization and tbe com- 
municators’ perceived scale positions, with 
the subjects’ own attitudes toward the three 
experimental issues held constant. The re- 
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sults indicated that the immunization pro- 
cedure had sot affected the subjects’ judg- 
ments through the mediating influence of their 
attitudes, for the mean partial tau was sig- 
nificantly greater than zero (mean partial tau 
ws eh 2.03,0) 5:65, 5 < 05). 

Another possible explanation for these 
effects is that the distortions which were 
obtained resulted from the recipients’ at- 
tempts to minimize the discrepancies be- 
tween the arguments and counterarguments of 
the two experimental sessions. Subjects who 
had received passive immunization might sub- 
sequently have read an antipenicillin state- 
ment, for example, and assumed that the 
“doctor” who had written the message would 
be unlikely to completely oppose the pro- 
penicillin arguments that the “first doctor” 
had introduced; immunized subjects might 
therefore conclude that the second author was 
probably not as “anti” as his statement might 
suggest. This explanation is based on the gen- 
eral assumption that the average recipient will 
typically minimize conflict between prestige- 
ful figures, as a means of reducing cognitive 
dissonance (Festinger, 1957). Since subjects 
who were actively immunized were exposed to 
only one message concerning each of the ex- 
perimental issues, this hypothesized tendency 
to minimize conflict between others would 
presumably be inoperative. 

It is interesting to compare the assimila- 
tion trend which was obtained following pas- 
sive immunization (ie., displacement of the 
change messages toward the immunization 
passages), with the contrast effects reported 
by Janis and Feierabend (1957). These in- 
vestigators had subjects rate the contents of 
each paragraph in an authoritative communi- 
cation which urged them to volunteer for civil 
defense work; half of the paragraphs stressed 
the desirability of civil defense, and half 
pointed to the inconveniences and hardships 
that the work entailed. When the con argu- 
ments were presented after the pro arguments, 
they were seen as being more unfavorable 
than when they were presented first; con- 
versely, the pro arguments were perceived as 
being more favorable when presented follow- 
ing the con arguments. It should be noted 
that while Janis and Feierabend’s subjects 
responded to the two sets of arguments with 
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virtually no delay, in the present experiments 
there was a 2-5 day interval between immu- 
nization, and the presentation of the change 
messages. A recent unpublished study by 
Manis offers some support for the hypothesis 
that the time interval between immunization 
and attack may be a crucial variable; de- 
spite some aberrant data (possibly attribut- 
able to a relatively inattentive reading of the 
experimental messages), the results suggested 
that contrast effects may be obtained if two 
opposing messages are presented in succession, 
with no intervening delay, while assimilation 
is produced if the second message is presented 
2 days after the first. 

While the results of the present experi- 
ments failed to confirm the prediction that 
led to their design, the apparent consistency: 
of the obtained effects warrant serious con- 
sideration. The tentative position taken above 
suggests that in some instances, magnitude of 
distortion may be relatively unaffected by 
differences in pressure to change, and may 
instead reflects the recipient's attempts to 
minimize discrepancies between the message- 
to-be-judged, and other messages that he has 
previously received. 
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Male students estimated their chances of success before performing an anagrams 


task. 3 reported probabilities of success (. 


.20, .50, and .80) were combined with 


3 lengths of the puzzle (8, 11, and 14 anagrams), and 14 Ss were randomly 
assigned to each of the 9 experimental conditions. Probability estimates were 
found to increase as reported probabilities increased (p < .001) and as the list 


of anagrams became shorter (p < .025). 
success for the .20 reported probability 


Ss tended to overestimate chances of 
and to underestimate chances for the 


.80 reported probability (p < .001). Probability estimates were negatively re- 


lated to debilitating anxiety (p «€ .05) 


and positively related to facilitating 


anxiety (p <.025), but correlations were very low. Results are taken to indi- 
cate the dominant role of past experience in shaping present expectations. 


In recent years a number of theoretical 
models have appeared in the psychological 
literature which employ a concept of expect- 
ancy or subjective probability in association 
with a concept of value or utility (Feather, 
19592). Atkinson (1960) has referred to these 
as "expectancy-value" models and suggests 
that they stand as useful alternative concep- 
tualizations to firmly entrenched theories such 
as those of Hull (1952) and Spence (1956) 
which involve concepts of drive, habit, and 
incentive. One important problem associated 
with expectancy-value models is the specifi- 
cation of the different factors which influence 
level of expectation and value. In previous re- 
search (Feather, 1959b), the writer examined 
the way in which the strength of a person's 
expectation of success (success probability) 
affects the value which he places on success- 
ful attainment of the goal (attainment attrac- 
tiveness) and his tendency to make a com- 
mitted choice (choice potential). The present 
study focuses upon the concept of expecta- 
tion, In keeping with the previous theoretical 
analysis (Feather, 1959a), the relationship of 
expectation of success to each of the three 
factors discussed in the present paper, is as- 
sumed to be mediated by past learning ex- 
periences. 

The first aim of the present investigation is 
to examine the way in which variation in re- 
ported probability of success and the struc- 
ture of a task, affects a person’s expectation 
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of success. A number of studies (e.g., Cran- 
dall, Solomon, & Kellaway, 1955; Irwin, 
1953; Marks, 1951) indicate that expecta- 
tion of an outcome varies directly with the 
objective probability of the outcome. The ef- 
fect of direct reporting of probability of suc- 
cess on a person’s expectation of success has 
not, to the writer’s knowledge, been explicitly 
investigated although this kind of technique 
is commonly used in studies in personality 
and social psychology involving reported 
norms (cf. Feather, 1961). We also expect 
a person’s expectation of success at a task 
to be related to the way in which the task is 
structured. For example, tasks may differ in 
their apparent length and complexity. These 
structural characteristics of the task provide 
influences on expectations which are addi- 
tional to external sources of information such 
as reported probabilities. Both reported prob- 
ability and task structure are assumed to be- 
come effective as influences on expectation of 
success by virtue of past learning. For ex- 
ample, a person would be more likely to com- 
plete a task successfully in a given time when 
it is short rather than long. Past successes 
and failures with tasks of differing length 
would influence his expectation of success 
when a task of given length is presented to 
him. In a corresponding way, the effect of 
reported probabilities on present expectations 
is assumed to depend upon past experience 
with tasks for which the objective probabili- 
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ties of success were known. In line with these 
assumptions it is predicted that expectation 
of success should vary positively with the re- 
ported probability of success for the task and 
negatively with a particular structural char- 
acteristic of the task, namely, its length. 

It is further predicted that subjects should 
tend to overestimate their chances of success 
when the reported probability is low and to 
underestimate their chances of success when 
the reported probability is high. This predic- 
tion is consistent with the results of several 
investigations which, using widely different 
procedures (Attneave, 1953; Cohen, 1960; 
Griffith, 1949; Preston & Baratta, 1948; 
Sprowls, 1953), indicate a tendency for sub- 
jects to overestimate low objective probabili- 
ties and to underestimate high objective prob- 
abilities. 

The second aim of the present study is to 
investigate the relationship of expectation of 
success to motivation elicited in an achieve- 
ment context. Studies by Marks (1951), 
Irwin (1953), and Crandall, Solomon, and 
Kellaway (1955) using a card guessing tech- 
nique, show that stated expectations about 
the occurrence of an event are influenced by 
the desirability of that event. For a given ob- 
jective probability, desirable events are an- 
ticipated more frequently than undesirable 
events. These studies indicate the effect of 
the value of an event on a person's expecta- 
tions about its occurrence. In another context, 
Atkinson (1957) argues that subjects high in 
the motivation to achieve success may bias 
their subjective probabilities of success up- 
wards (ie. are optimistic in their expecta- 
tions), while subjects high in anxiety about 
failure may bias their subjective probabilities 
downwards (ie. are pessimistic in their ex- 
pectations). He cites two experiments to sup- 
port this assumption. In a level of aspiration 
Situation where subjects were required to 
state expectations and where cues from past 
performance were ambiguous and conflicting, 
subjects high in n Achievement set higher 
levels of aspiration than subjects low in n 
Achievement (McClelland, Atkinson, Clark, & 
Lowell, 1953). In a second experiment, where 
subjects were asked to state the score they 
expected to get just before they performed a 
novel task, scores were positively related to 
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n Achievement (Pottharst, 1955). These two 
studies show the effect of differences in 
achievement motivation on level of aspira- 
tion defined as an expected score. In a more 
recent study (Atkinson, Bastian, Earl, & Lit- 
win, 1960) subjects high in n Achievement 
reported that they expected to beat more per- 
sons in a shuffleboard game than subjects low 
in n Achievement. The authors regard this 
difference as consistent with the assumption 
of motivational biasing of subjective prob- 
abilities. 

The present investigation studies the rela- 
tionship of expectation of success at an 
achievement task to n Achievement as as- 
sessed from TAT protocols, and to debili- 
tating and facilitating anxiety assessed by the 
Achievement Anxiety Test (AAT) developed 
by Alpert and Haber (1960). These authors 
distinguish between debilitating anxiety de- 
fined as anxiety which interferes with efficient 
task performance, and facilitating anxiety de- 
fined as anxiety which increases the efficiency 
of task performance. Questions concerning 
debilitating anxiety (AAT—) .are oriented 
towards failure at a task; questions concern- 
ing facilitating anxiety (AAT+-) are oriented 
towards success at a task. It is predicted that 
expectation of success should relate positively 
to both n Achievement and facilitating anx- 
iety, i.e., subjects concerned with succeeding 
should tend to bias their subjective probabili- 
ties of success upwards. In contrast, it is 
predicted that expectation of success should 
relate negatively to debilitating anxiety, 
ie. subjects worried about possible failure 
should tend to bias their subjective probabili- 
ties of success downwards. 

These two predictions are also based on the 
assumption that past experience is a basic 
determinant of present expectations. Alpert 
and Haber (1960) present data showing that 
debilitating anxiety and academic perform- 
ance are negatively correlated, and many 
studies reviewed by Child (1954) and Atkin- 
son (1960) indicate that high anxiety inter- 
feres with efficient performance of a complex 
task. Arguing from this evidence, we would 
expect persons high in debilitating anxiety to 
have experienced more failures in the past 
than those low in debilitating anxiety. This 
greater incidence of failure should determine 
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relatively lower expectations of success, In 
the same way, persons high in facilitating 
anxiety would be more likely to have experi- 
enced success in past achievement situations 
than those low in facilitating anxiety, and 
should therefore tend to have relatively higher 
expectations of success. This explanation 
therefore considers the predicted relationships 
between expectation of success and debili- 
tating and facilitating anxiety to be mediated 
by differential success and failure in past 
achievement situations. Previous successes 
and failures associated with level of debili- 
tating and facilitating anxiety provide a frame 
of reference in terms of which the individual 
estimates his own chances ‘of success when 
he comes to a task, A corresponding 
argument applies to the predicted positive 
relationship between n Achievement and 
expectation of success. 


PROCEDURE 


Subjects were male, first year, external students at 
the University of New England. They were presented 
with an anagrams test labeled “Word Puzzle” with 
the following instructions: : 


The object of this puzzle is to rearrange the 
letters to form the name of a fruit or vegetable. 
For example: APE is pea. You will be allowed 
two minutes to work at the puzzle. You succeed 
at the puzzle only if you are able to unscramble 
all the words correctly. When the signal is given 
to turn the page you will be allowed 10 seconds 
to look over the words. During this time try to 
estimate as accurately as possible what you expect 
your chances are of unscrambling all the words 
correctly. Write your estimate as so many chances 
in 100 in the space provided at the top of the 
page, For your guidance you are told the average 
probability of success for first year, external stu- 
dents who have attempted the puzzle in the past, 
le, the chances of success computed on the basis 
of their performance. Remember that you are 
allowed two minutes to work at the puzzle. Each 
word when unscrambled forms the name of a fruit 
Or vegetable. This is a test of verbal recognition 
so try to do your best. 


Three forms of the puzzle were employed and they 
Were combined with three reported probabilities in 
a factorial design. In Form A (the short form) 8 
anagrams had to be unscrambled. In Form B (the 
medium form) 11 anagrams had to be unscrambled 
consisting of the 8 anagrams in Form A with 3 new 
anagrams added. In Form C (the long form) 14 
anagrams had to be unscrambled consisting of the 
ll anagrams in Form B with 3 further anagrams. 
The structure of the task was thus varied by pre- 
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senting lists differing in length. The three reported 
probabilities were .20 (difficult), .50 (average), and 
.80 (easy). These probabilities were fictitious. The 
three reported probabilities in combination with the 
three forms of the puzzle yield nine different experi- 
mental conditions. Fourteen subjects were randomly 
assigned to each of these nine conditions to give a 
total N of 126. 

Immediately before he performed the task each 
subject completed the following statement which 
appeared at the top of the list of anagrams: 


Analysis of past results shows that first year, 
external students have (20, 50, 80) chances in 100 
of unscrambling the words below in 2 minutes. 
I estimate that I have about —— chances in 100 
of unscrambling all the words correctly in the time 
limit allowed. (Try to estimate your chances as 
accurately as possible.) 


Following the anagrams puzzle the Achievement 
Anxiety Test (AAT) was administered to all sub- 
jects.1 Four days later subjects were tested for n 
Achievement under neutral conditions according to 
the standard procedure (McClelland et al, 1953). 
The following four pictures were presented: 2, 48, 
1, and 7 (using numbers assigned by Atkinson, 
19582). 

Interscorer reliability ? for scoring n Achievement 
is r=.90. The means and standard deviations for 
the three tests are as follows (N= 126): for the 
AAT+, M=27.31, SD-—446; for the AAT-—, 
M = 24.60, SD = 6.13; for the TAT n Achievement, 
.28, SD = 4.80. The mean AAT— score ob- 
tained in the present study is significantly lower 
(p<.001) than the corresponding mean for the 
sample of American students tested by Alpert and 
Haber (M = 26.33, SD = 5.33, N — 379). 


RESULTS 


The mean probability estimates for each of 
the nine conditions of the experiment are 
presented in Table 1 with an analysis of 
variance. 

The results in Table 1 are consistent with 
predictions. The main effect of reported prob- 
abilities on probability estimates is highly sig- 
nificant (p< .001). Mean probability esti- 
mates increase as the reported probability of 
success varies from .20 to .80, implying a 
strong positive relationship between expecta- 


1 Analyses of variance indicate that there are no 
significant differences between the nine experimental 
groups in either mean facilitating anxiety scores or 
mean debilitating anxiety scores (F<1 in each 
analysis). Thus there is no evidence to suggest that 
the different experimental treatments influenced 
anxiety scores on the AAT. [4 

2'The writer wishes to thank Graeme Halford for 
his assistance in the reliability check. 
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tion of success at the task and reported prob- 
ability. In addition, Table 1 shows that there 
is a statistically significant linear regression 
(p < .025) of probability estimates on length 
of list. Probability estimates tend to be 
greater the shorter the list, implying that 
expectation of success is negatively related to 
length of the list. Finally, it may be seen that 
there is a marked tendency to overestimate 
chances of success when reported probability 
is low (.20) and to underestimate chances of 
success when reported probability is high 
(.80). Overestimation is greatest for the short 
list when the reported probability is .20 and 
underestimation is greatest for the long list 
when the reported probability is .80. An 
analysis of variance of the differences between 
probability estimates and reported probabili- 
ties shows that mean differences vary signifi- 
cantly over the reported probability classifica- 
tion (F = 25.37, df = 1/117, p < .001). 
Intercorrelations ^ between probability 
estimates, debilitating anxiety (AAT—), 
facilitating anxiety (AAT+), and TAT n 
Achievement are presented in Table 2. 
Table 2 shows that subjects high in debili- 
tating anxiety tend to estimate lower proba- 
bilities of success at the task than do subjects 
low in debilitating anxiety. In contrast, sub- 


TABLE 1 


MEAN PROBABILITY ESTIMATES FOR NINE 
EXPERIMENTAL CONDITIONS 


Reported probability 
-20 50 .80 | Row Ms 
Form A (short) 49 | .69 | .68 .62 
Form B (medium) 35 56 a} 55 
Form C (long) 37 ST .66 "SE 
Column Ms .40 | .59 | .69 
Analysis of Variance 
Source df MS F 
Reported probabilities 2 | 8,803.50 | 21.06*** 
Forms 
Linear regression. 1 2,211.44 | 5.29** 
Departures from 
linear regression 1 92.89 | «1 
Interaction 4 498.63 | 1.19 
Residual 117 417.97 
Dp <.025, 


Hep < 001. 
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TABLE 2 


INTERCORRELATION OF PROBABILITY ESTIMATES 
AND MOTIVATION MEASURES 


Debili- Facili- n 

tating tating | Achieve- 

anxiety anxiety ment 
Probability estimates | —.16* .18** —.13 
Debilitating anxiety =.53*** 12 
Facilitating anxiety —.03 


.. ^ The;r's of —.16, .18, and —.13 are within-groups correla- 
dipsa obtained by analysis of covariance procedure with 
= 116. 
* p <.05, one-tailed test. 
** p < .025, one-tailed test. 
***5 < .001, one-tailed test. 


jects high in facilitating anxiety tend to esti- 
mate higher probabilities of success at the 
task than do subjects low in facilitating 
anxiety. Both of these relationships are sta- 
tistically significant using a one-tailed test 
but the correlations are obviously very low. 
They are, however, consistent with prediction. 
It is interesting to note that debilitating 
anxiety and facilitating anxiety are not inde- 
pendent but intercorrelate r= —.53. This 
negative correlation is consistent with pre- 
vious results presented by Alpert and Haber 
(1960).* 

There is no statistically significant relation- 
ship between probability estimates and TAT 
n Achievement. Thus the prediction that 
probability estimates and n Achievement 
should intercorrelate positively is not sup- 
ported by the data. 


Discussion 


The tendency for estimates of probability 
of success to follow reported probabilities and 
to be influenced by the length of the list of 
anagrams is not surprising. We would expect 
the structure of a task and information about 
the performance of others to provide impor- 
tant guides to a person when he is asked to 
judge his own chances of success prior to 
performance. Studies in level of aspiration 
(Lewin, Dembo, Festinger, & Sears, 1944) 


3 Since facilitating anxiety (AAT+) also correlates 
positively with task performance (Alpert & Haber, 
1960) one can ask if subjects high in facilitating 
anxiety are in fact anxious. Items in the AAT+ scale 
seem to relate to task involvement and very few of 
the questions refer to anxiety or nervousness during 
an examination as a factor in increased efficiency. 


EXPECTATIONS AND NEED ACHIEVEMENT 


attest to the importance of such frames of 
reference. 

What is more interesting in the present 
study is the systematic tendency of subjects 
to overestimate chances of success when re- 
ported probability is low and to underestimate 
their chances when reported probability is 
high. As we have noted, this tendency to 
overestimate chances when the odds are long 
and to underestimate chances when the odds 
are short has now appeared in a variety of 
studies employing different procedures. It 
thus appears to have considerable generality. 
Preston and Baratta (1948) suggest that it 
may be accounted for in terms of Helson’s 
(1947) theory of the adaptation level. 

This tendency for judgments to move 
towards an intermediate point is not peculiar 
to studies of subjective probability. The 
“error of central tendency” is a common phe- 
nomenon in ratings (Guilford, 1954). In fact, 
it is possible that individual differences in the 
degree to which low objective probabilities are 
overestimated and high objective probabilities 
are underestimated may be related to a more 
general response set involving a tendency to 
avoid extreme judgments. 

The systematic departures of estimated 
probabilities from reported probabilities ob- 
tained in the present study could also be 
interpreted as reflecting the effect of values 
on expectations. When reported probability of 
success is low, the positive incentive value of 
success should be high and the negative in- 
centive value of failure should be low (cf. 
Atkinson, 1957; Feather, 1959a). It may be 
that the high positive value of achievement 
when the odds are long biases subjective 
probability of success upwards. Correspond- 
ingly, when reported probability of success is 
high, the positive incentive value of success 
should be low and the negative incentive value 
of failure should be high. In this instance the 
high negative value of failure when the odds 
are short may bias subjective probability of 
Success downwards. If this interpretation is 
valid we would expect the systematic de- 
Partures of estimated probabilities from 
reported probabilities to become even more 
Pronounced as a situation becomes more 
achievement oriented and success and failure 
become more important to the person. 
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The present investigation shows that the 
magnitude of the departure of estimated 
probabilities from a particular reported prob- 
ability is influenced by the structure of the 
task. The obtained results indicate that when 
reported probability is low (.20) overestima- 
tion of chances of success is greatest for the 
short list of anagrams. When reported prob- 
ability is high (.80) underestimation of 
chances of success is greatest for the long 
list of anagrams. One can conceive of a task 
so structured that overestimation of chances 
might not occur when reported probability is 
low, e.g., a very long list of anagrams. Simi- 
larly, it is possible that one could present a 
subject with a task which does not lead to 
underestimation of chances of success for a 
high reported probability, e.g., a very short 
list of anagrams. These considerations suggest 
a need to investigate the limits of the tend- 
ency for estimated probabilities to depart 
systematically from reported probabilities by 
varying the task from one extreme of a 
particular structural dimension to the other 
extreme. 

The main contribution of the present in- 
vestigation is the finding that expectation of 
success at a task is related in predictable 
ways to motivation elicited in an achievement 
context. The results of the present study are 
consistent with the predicted relationships be- 
tween expectation of success at the task and 
strength of debilitating and facilitating anx- 
iety. These relationships are assumed to be 
outcomes of differential amounts of success 
and failure in past achievement situations 
where level of task performance is associated 
with differences in degree of anxiety. In short, 
the relationships are assumed to be mediated 
by past experience. There are, however, at 
least three alternative interpretations of the 
obtained relationships. 

In the first place one might appeal to a 
principle such as autism (Murphy, 1947) 
which implies direct motivational effects on 
cognitive processes in the direction of need 
satisfaction. Lewin, Dembo, Festinger, and 
Sears (1944) in discussion of research on the 
level of aspiration also argue that wishes and 
expectations interact. A similar principle 
seems to be implicit in discussions of the 
effects, of values on expectations, which ap- 
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pear to become more “wishful” when high 
rewards are involved. These types of inter- 
pretation are important in recognizing the 
interaction of motivational processes and 
cognition but it is not always clear why inter- 
dependence of wish and expectation should 
take a particular form. One would like to 
know more about the mediating mechanism. 
For example, if a principle of autism is in- 
volved in a prediction one would like to know, 
in advance, not only what changes in cogni- 
tion are expected to occur but also which 
needs are assumed to underlie the changes, 
and the degree to which the needs can be 
effective within the situational context. 

The second alternative interpretation ap- 
plies more to the negative relationship ob- 
tained between estimated chances of success 
and debilitating anxiety. Ruebush (1960) 
argues that persons high in test anxiety may 
have learned certain defenses against anxiety 
in evaluative situations. He assumes that 
“cautiousness” is one such defense, and shows 
that high test anxiety has a facilitating effect 
on performance when the task is one where a 
cautious strategy is permitted. Since Alpert 
and Haber (1960) report a correlation of 
.64 between their debilitating anxiety scale 
(AAT—) and the Mandler-Sarason Test 
Anxiety Questionnaire, it might be argued that 
subjects with high AAT— scores should tend 
to be defensively cautious in a test situation 
and should therefore estimate lower chances 
of success. It seems to the writer, however, 
that there is an important difference in the 
ways in which expectations of success and 
cautiousness are assumed to be learned. In 
the present paper it is argued that a person’s 
expectation of success is determined primarily 
by his own experience of success or failure at 
the task or at related activities. In contrast, 
Ruebush assumes that cautiousness is learned 
as a defense against anxiety elicited in a test 
situation. Thus both expectations of success 
and cautiousness are acquired but the basis 
of the learning differs. It should be possible 
to integrate these two concepts in experi- 
mental research. If subjects high in test anx- 
iety have in the past been more successful at 
performing tasks where a cautious strategy 
is permitted, then their expectations of success 
should tend to be higher for such tasks than 
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for tasks not permitting a cautious strategy. 
Thus a highly cautious person could have a 
high expectation of success for a particular 
kind of task. Cautiousness does not neces- 
sarily imply a low expectation of success. 

The third alternative interpretation also 
seems more applicable to the negative rela- 
tionship obtained between estimated chances 
of success and debilitating anxiety. A small 
number of items in the AAT— seem rather 
indirectly to imply a low expectation of 
success. For example, Items 2 and 8 concern 
“fear of a bad grade" and “doing worse than 
the rest of the group,” respectively. Hence, 
the negative relationship obtained may reflect 
overlap between responses to such items and 
the probability estimates. This type of inter- 
pretation appears less plausible when applied 
to the positive relationship obtained between 
probability estimates and facilitating anxiety. 
Items relating to expectation of success seem 
to the writer to be less evident in the AAT+. 
Future studies of the relationship between 
expectation of success and achievement re- 
lated motivation should attempt to eliminate 
this possible interpretation by using measures 
of motivation which do not involve items 
either directly or indirectly related to expecta: 
tion of success. The issue raised here is not 
restricted to the present line of research but 
is a common problem when a questionnaire 
instrument is used to assess level of motiva- 
tion. Farber (1954), for example, in discussion 
of research involving the Taylor (1953) 
Manifest Anxiety scale, notes that responses 
to the scale may reflect particular habits as 
well as differences in drive level. 

The present emphasis on the role of past 
experience as an important mediator of the 
relationship between expectation and motiva- 
tion has implications about the conditions 
under which we would expect to find relation- 
ships between estimates of probability of 
success and achievement related motivation. 
These relationships should tend to appear 


' under conditions where past performance can 


serve as an important frame of reference in 
shaping present expectations, e.g., just before 
a person begins to work at a task (cf. Pott- 
harst, 1955). Prior to performance, a person's © 
expectation of success at a task is likely to 
be in terms of a general expectation based on 
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the relative incidence of past successes and 
failures in related activities. This general 
expectation becomes modified, however, once 
a person is immersed in a task, and his present 
performance is then likely to become the 
dominant factor influencing his expectations. 
Under these conditions of task performance, 
relationships between estimated chances of 
success and achievement related motivation 
may not be so evident. However, the rela- 
tionships might even appear in this latter 
condition if a person’s performance at the 
task were so ambiguous and confused that he 
is forced to fall back upon past performance 
in related activities as a guide to present 
expectations. Sutcliffe’s (1955) finding of 
greater generality in level of aspiration be- 
havior for tasks of high performance vari- 
ability might reflect the operation of this 
latter process. 

The results of the present study do not 
support the prediction of a positive relation- 
ship between n Achievement and expectation 
of success. It is likely that this relationship 
failed to appear because of the reality con- 
straints in the situation. The procedure al- 
lowed each subject a brief examination (10 
seconds) of the list of anagrams and pro- 
vided him with a probability norm. Further- 
more, the subject was requested to be as 
accurate as possible in his estimates. The 
predicted relationship might appear in an 
achievement oriented situation which provides 
less information and is more ambiguous. 

Finally, future investigations of expecta- 
tion of success might seek to discover other 
Properties of the task situation, apart from 
task length and reported norms, which deter- 
mine differences in subjective probability. The 
Search for these properties will not be easy 
particularly in view of the fact that different 
Measures of subjective probability have been 
Presented in the literature. Edwards (1962) 
discusses subjective probabilities which are 
inferred from decisions by way of a utility 
maximization model. He conceives of subjec- 
tive probability as a function of objective 
Probability and of “identifying character- 
istics” of an event which are not yet specified. 
» Becker (1962) describes methods for measur- 
ing subjective probabilities which depend on 
the subjectively expected value (SEU) de- 
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cision model. These approaches are much less 
direct than the procedure used in the present 
investigation. Adams and Adams (1961) de- 
scribe a procedure for investigating the real- 
ism of confidence judgments which makes 
use of direct estimates. Still other methods 
of varying expectation of success are used in 
investigations by Rotter, Fitzgerald, and 
Joyce.(1954) and by Atkinson (1958b). One 
wonders about the degree to which these dif- 
ferent procedures yield comparable measures. 
For any particular measurement procedure, 
however, the search for characteristics of a 
task situation which determine differences in 
expectation might begin by isolating those 
aspects of the situation which are obviously 
correlated with success and failure, e.g., ap- 
parent complexity and time allowed for com- 
pletion. A person's actual experience at a task 
and at related tasks, the frequency and pat- 
terning of his successes and failures, appear 
to be the dominant influences which shape his 
present expectations. 
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In a longitudinal study of antisocial aggressiveness in males, reports on direct 
observation of behavior in childhood and early adolescence were used to 
rate general level of aggressiveness. Criminal records were used to ascertain 
antisocial behavior during adolescence and adulthood. To distinguish between 
conditions which contribute to socialized aggressiveness and those which direct 
aggression into antisocial channels, the family backgrounds of men who had 
been equally aggressive in childhood were compared. The results suggest that 
extreme neglect and punitiveness, coupled with a deviant-aggressive paternal 
model produces antisocial aggressiveness. In contrast (though not contradic- 
tion), moderate neglect, moderate punitiveness, and ineffective controls produce 


socialized aggressiveness. 


The development of aggressiveness has 
been the focus of much recent research in 
social science. Attacks on the problem have 
been from two directions: studies of anti- 
social behavior (Bandura & Walters, 1959; 
Glueck & Glueck, 1950; McCord & McCord, 
1959) concerned with delinquents, and stud- 
ies of general aggressiveness (McCord, Mc- 
Cord, & Howard, 1961; Sears, Maccoby, & 
Levin, 1957; Sears, Whiting, Nowlis, & Sears, 
1953) based on samples of nondelinquents or 
predelinquents. The two approaches have 
yielded generally similar findings—of rejec- 
tion, punitiveness, inconsistency—in the back- 
grounds of delinquents and of aggressive boys. 
This apparent similarity in environment and 
the fact that the majority of delinquents par- 
ticipate in aggressive behavior has created 
a tendency to translate the results of 
delinquency studies into conclusions about 
aggression and vice versa? Yet, one of the 

. few attempts to measure the relation between 
socialized and antisocial aggression, reported 
by Robert Sears? gives no support for this 


1The authors wish to express their appreciation 
to the Ella Lyman Cabot Foundation, the Harvard 
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?One notes, for example, that Adolescent Ag- 
gression (Bandura & Walters, 1959) is a study of 
delinquent adolescents—although the focus of the 
book is on the development of aggression, 

Unpublished mimeographed report, “Relation of 
Early Socialization Experiences to Aggression in 
Middle Childhood” by Robert Sears. 


tendency. Sears, in a study of 76 boys and 
34 girls, found no positive relationship be- 
tween antisocial aggression and other forms 
of aggressiveness (prosocial, aggression 
anxiety, projected aggression, or self-aggres- 
sion), as measured by self-report scales of 
aggressiveness. Despite the difficulties in- 
herent in use of self-report techniques to 
measure aggressiveness, it seems reasonable 
to reassess the assumption that socialized 
and antisocial aggression have similar roots 
within the family circle. 

Since most previous work on the etiology of 
delinquency has utilized control groups of 
nonaggressive boys or boys who, at the 
minimum, were presumably less aggressive 
than the delinquents, and many studies of 
nondelinquent aggressiveness have been car- 
ried out without distinguishing predelin- 
quents from nondelinquents, the possibility 
arises that the two types of studies may have 
been confusing the origins of aggression with 
the origins of criminality. Both theoretically 
and practically—for some parents would like 
to see their children become “aggressive,” 
though few would wish them to be criminals 
—the distinction seems to have importance. 
Thus, the question remains: In what ways 
do the family backgrounds of aggressive 
delinquent boys differ from aggressive non- 
delinquent boys’? Or, somewhat differently 
stated, “What family environments tend to 
produce antisocial as opposed to socialized 
aggressiveness?” 
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There are, of course, many ways to define 
“antisocial” and aggressive. Some of these 
overlap in such a way that, by definition, 
the aggressive child is antisocial and vice 
versa. In order to inquire into the difference 
between forms of aggression which are con- 
sidered acceptable by society and those 
which are condemned as criminal, definitions 
must be used which will make the two forms 
of behavior distinguishable. For the present 
study, an outgrowth of the Cambridge-Somer- 
ville Youth Study (Powers & Witmer, 1950), 
it was possible to use a longitudinal ap- 
proach employing behavioral measures of 
both aggressiveness (in childhood and early 
adolescence) and antisocial behavior (in late 
adolescence and adulthood). 


METHOD 


The Cambridge-Somerville Youth Study took 
place between 1939 and 1945, during which time 
255 boys living in deteriorated areas surrounding 
Boston, Massachusetts, between (on the average) 
ages 10 and 15, were observed at home, at school, 
and at play. Teachers and social workers who 
visited the families approximately every other week 
recorded their observations of the behavior of 
parents as well as children after each visit, The 
social workers appeared unannounced and with a 
frequency which enabled a variety of types of 
Observations—at meals, in the midst of crises, 
and performing daily routines. In 1956 and 1957, 
trained researchers, reading these running records, 
classified each boy and his parents on variables 
ranging from occupation and religion to affectional 
interaction.* Interrater agreement, tested on a random 
sample, was high and the categorized ratings yielded 
strong relationships to completely independent meas- 
ures of social deviance among the subjects when they 
had become adults (McCord & McCord, 1960). 

To measure aggressiveness, the raters utilized 
reports made by teachers, camp counselors, and 
Psychologists in addition to those of the home 
visitors. Those boys who reacted aggressively to 
most forms of frustration (they were involved in 
fist fights, bullying, or destructive behavior) were 
designated as “highly aggressive.’ The boys who, 
although they occasionally reacted aggressively, 
were generally realistic in response to frustration, 
were classified as “assertive.” And those who 
rarely, if ever, exhibited aggression were con- 
sidered “nonaggressive.” Interrater reliability on 
this measure was .867. 

To ascertain antisocial behavior, in 1955 the 
names of the 255 boys in the study were checked 
for court records. Those who had been convicted 


* See McCord and McCord (1960) for a complete 
description of the ratings. 
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for larceny, breaking and entering, assault, or 
sex crimes were classified as “antisocial.” 5 

Using these measures of aggressiveness and 
antisocial behavior, we found that adolescent ag- 
gressiveness was strongly related to antisocial be- 
havior. Yet half of the aggressive adolescents had 
no records of antisocial behavior, and approximately 
two-thirds of the criminals had not been highly 
aggressive in adolescence. In order to concentrate 
upon the differentiating factors among aggressive 
boys, three groups were selected for comparison: 
the ,26 men who, during early adolescence, had 
manifested extreme aggressiveness and had criminal 
records as adults (aggressive-antisocial men); the 
25 men who, despite showing extreme aggressiveness 
during the earlier period of study, had not de- 
veloped criminal records (aggressive-socialized 
men); and the 52 men who were neither aggres- 
sive nor criminal (nonaggressive men). Use of these 
three groups permitted differentiation among pre- 
sumed causal conditions which contributed to 
antisocial aggressiveness and to socialized aggres- 
siveness—for the two aggressive groups had ex- 
hibited similar behavior in childhood. Those con- 
ditions which differentiated the aggressive-antisocial 
from the aggressive-socialized men may be presumed 
to promote antisocial aggression; those which 
distinguished the aggressive from the nonaggressive 
—but failed to differentiate between the socialized 
and antisocial—may be presumed relevant only to 
a general syndrome of aggressiveness. 


RESULTS 


Even more than their socialized counter- 
parts, the aggressive-antisocial men had ex- 
perienced family discord, neglect, and pa- 
rental attacks. Slightly more than half the 
aggressive-antisocial men had been reared 
by parents who were in almost constant 
conflict. Seventy-seven percent had had no 
adult supervision during childhood. The 
parents of 87% of the aggressive-antisocial 
men had frequently used extreme threats 
(eg., of castration or of turning out of the 
house) in their child rearing. Seventy-seven 
percent of their mothers (compared to 56% 
of the aggressive-socialized and 42% of 
the nonaggressive) rarely expressed, verbally 
or nonverbally, approval of or pleasure in 
their children. The fathers of 61% of the 
aggressive-antisocial men (compared to 45% 
of the aggressive-socialized and 23% of the 

5'This definition of “antisociality” is admittedly 
one which accepts society's definition of what is 
opposed to its interest. We recognize, of course, 
the numerous alternative definitions which might 
be employed with different results. 
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TABLE 1 
Famity BACKGROUND AND LEVEL or AGGRESSION 


The parents: errena 9h Porno eae Mec ae 
Were in conflict (N = 43) 12 (N = 21) 38 (N = 21) 57 
Provided little supervision (N = 52) 25 (N = 25) 48 (N = 26) 77 
Used extreme threats (N = 41) 32 (N = 22) 64 (N = 23) 87 
Rejected the boy (N = 42) 10 (N = 19) 21 (N = 23) 87 
Used inconsistent discipline (N = 52) 48 (N = 25) 72 (N = 26) 81 
Held low expectations for boy (N = 51) 55 (N = 25) 84 (N = 26) 81 
Provided no religious training (N = 50) 52 (N = 24) 79 (N = 26) 62 


Note.—The N reflects the number of subjects for whom there was sufficient information for a rating to be made for the 


category. 


nonaggressive) openly displayed dislike of 
their offspring. 

Other influences which are related to the 
level of aggression among socialized men 
(cf. McCord et al., 1961)—inconsistency in 
mother’s discipline, low expectations, and 
absence of religious training °—were not 
found with greater frequency in the back- 
grounds of aggressive-antisocial than of 
aggressive-socialized men. 

In addition to those conditions which 
also produced a relatively high proportion 
of aggressive-socialized men, the aggressive- 
antisocial men had been subjected to greater 
parental punitiveness. Parental discipline 
was judged on: the basis of direct observa- 
tion. Physical punishment (ranging from 
spanking, to brutal beatings) was considered 
"punitive" and was opposed, for purposes of 
categorization, to such techniques as with- 
drawal of privileges, scoldings, or isolation. 
A significantly higher proportion of the ag- 
gressive-antisocial men than of the aggressive- 
socialized men had received punitive disci- 


'pline from both their parents ( < .01).* 


Among single factors, the greatest direct 
influence on antisocial aggression, seems to 
come from the nature of the paternal model. 
Fathers were considered deviant if they were 
criminals or alcoholics; their aggressiveness 
was rated on the same scale as was that of 
their sons. A significantly higher proportion 
of the aggressive-antisocial men than of the 


A mother who attended church or Mass once a 
Week was assumed to provide religious training. 

" The chi square test, two-tailed, was used to 
test for the significance of obtained differences. 


aggressive-socialized men had been reared by 
deviant and aggressive fathers (5 < .05). 
The fathers of 3896 of the aggressive-anti- 
social men, 12% of the aggressive-socialized 
men, and 8% of the nonaggressive men had 
been deviant and aggressive. 

The measures of family interaction can- 
not be presumed to be independent, for a 
rejecting family was more likely to be 
punitive, a deviant father was more likely 
to be aggressive and rejecting. When, as 
frequently occurred, these characteristics 
appeared in combination, the result was most 
likely to be an aggressive-antisocial son. 

Other variables—intelligence, religious af- 
filiation, neighborhood, father’s birthplace— 
were tested and found to have no significant 
relationship either to antisocial aggressive- 
ness or to socialized aggressiveness within 
our sample. 

By combining various background condi- 
tions, it is possible to illustrate the different 
influences which relate to socialized and to 
antisocial aggressiveness. Rejection, punitive- 
ness, and use of threats, we presumed, would 


TABLE 2 
PUNITIVENESS AND DIRECTION OF AGGRESSION 


Punitiveness | Percentage of | Percentage of | Percentage of 


bs aggressive- aggressive- 
by: nonageressive | ‘Socialized | antisocial 
Both parents L| 20 22 70 
One parent " 35 61 26 
Neither parent 45 17 4 
Total (N = 40) 100| (N = 18) 100 | (N = 23) 100 
€ 


F 'Note.—The N reflects the number of subjects for whom there 
was sufficient information for a rating to be made for both 
parents. 
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TABLE 3 
SUMMARY: DIFFERENTIATING BACKGROUNDS 
Percentage of Percentage of 
Percentage of aee o ge o 
nonageressive ‘socialized ‘antisocial 
High drive production and: 
Deviant model 4 0 42 
Nondeviant model 2 8 8 
Moderate drive production and: 
Deviant model and high controls 6 8 35 
Nondeviant model and high controls 19 12 0 
Low controls 4 52 8 
Low drive production and: 
Low controls 17 12 7 
High controls 48 8 0 
(N = 52) 100 (N = 25) 100 (N = 26) 100 


Note,—The absence of “‘perfect” relationships may be attributed to failures in measurement, incompleteness of the theory, 
or to potential freedom in behavior. See McCord and McCord (1960) for a discussion of this issue. 


tend to increase aggressive drive. By counting 
the mother and father separately for re- 
jection and punitiveness, we obtained a drive 
producing scale ranging from zero to five— 
with scores of zero and one considered “low,” 
two and three as “moderate,” four and five as 
“high.” We assumed that supervision, pa- 
rental agreement, consistent discipline, high 
expectations, and religious training would 
tend to produce a controlled environment. 
A family providing two to five of these 
conditions was considered to have relatively 
high controls; families providing none or 
only one were considered to have low con- 
trols. The combinations of high drive pro- 
duction with deviant model (regardless of 
controls) and moderate drive production 
combined with deviant model and high con- 
trols tended to produce aggressive-anti- 
social men (p< .001). Moderate drive 
production with low controls (regardless of 
the model) tended to result in aggressive- 
socialized men (p < .001). Low drive pro- 
duction and high controls (regardless of 


model) tended to produce nonaggressive men 
(p < .001). 
REFERENCES 


Banpura, A., & Watters, R, Adolescent aggression. 
New York: Ronald, 1959. 

GLUECK, S, & GturcKx, Exranor. Unraveling 
juvenile delinquency. New York: Commonwealth 
Fund, 1950. 

McCord, W., & McCorp, Joan. Origins of crime. 
New York: Columbia Univer. Press, 1959. 

McCord, W., & McCorp, Joan. Origins of alcohol- 
ism. Stanford: Stanford Univer. Press, 1960. 

McCord, W., McCorp, Joan, & Howanp, A. Familial 
correlates of aggression in nondelinquent male 
children. J. abnorm. soc. Psychol., 1961, 62, 79-93. 

Powers, E, & Wirmer, Heren. An experiment in 
the prevention of delinquency. New York: 
Columbia Univer. Press, 1950. 

Sears, R, Maccosy, Ereanor E. & Lzvin, H. 
Patterns of child rearing. Evanston, Ill: Row; 
Peterson, 1957. 

Sears, R, Wmurmmo, J. W. M, Nowus, V, & 
Sears, PAuLINE S. Some child-rearing antecedents 
of aggression and dependency in young children. 
Genet. Psychol. Monogr., 1953, 47, 135-234. 


(Received December 4, 1961) 


Journal of Abnormal and Social Psycholo 
Jos Vo. 66, No. 3, 243-248 — A 


THE RISK HYPOTHESIS IN 


PREDICTIVE JUDGMENTS 


OF UNETHICAL BEHAVIOR ' 


SALOMON RETTIG AND 


HARVE E. RAWSON 


Depariment of Psychiatry, Ohio State University 


The study further explores the “ethical risk” hypothesis, which postulates that 


unethical behavior varies as a function 
tested: expectancy of gain (Ega) and 


of perceived risk. 6 determinants are 
reinforcement value of gain (RVg), 


expectancy of censure (Ecens) and reinforcement value of censure (RVcens); 
severity of offense, and reference group. It is hypothesized that Ecens and 
RVeens are significant determinants which account for more variance than the 
remaining ones. 64 items, each portraying a student in conflict about taking 


money illegally, were judged by 136 Ss 


as to the probability of taking the 


money. Except for the reference group, all main effects were significant. RVcens 


explained more variance than any other 
interpreted as RVcens rather than Ecens. 


Ethical propositions are considered by 
social scientists and educators to be among 
the most fundamental determinants of human 
conduct. However, most of these propositions 
have been confined to speculative and intui- 
tive analyses. Empirical studies of ethical 
conduct and the processes by which this con- 
duct is affected are relatively rare. One recent 
avenue of empirical investigation has been 
offered by use of the “ethical risk” hypothesis. 
This hypothesis postulates that unethical be- 
havior varies as a function of the perceived 
risk incurred by such conduct. 

Two experimental studies (Rawson, 1961; 
Rawson & Rettig, 1961) reported on the rela- 
tionship between ethical judgments and the be- 
havior of the judges under conditions of low 
and high ethical risk. Low and high ethical 
risk conditions, respectively, specified low 
and high probability of censure for engage- 
ment in morally disputable behavior. The 
first study related severity of judgment (ona 
factorially derived orthogonal dimension of 
exploitative-manipulative morality) to simu- 
lated deception, i.e., the overestimation of 
knowledge of examination questions. Ethical 
risk was defined in terms of the probability 
that deliberate overestimation may be instru- 
mental in obtaining higher academic grades, 
but at the risk of public exposure and severe 
censure. As predicted, it was shown that 


1This study was supported by a Research Grant 
M-5042 from the National Institute of Mental 
Health, United States Public Health Service. 
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source, Hence, ethical risk should be 


severity of moral judgment related to over- 
estimation of knowledge only when the sub- 
jects could assume that their responses affect 
their academic grades (r= —.35, p = .05, 
one-tailed). In a control group in which the 
subjects were assured that their estimation of 
knowledge of examination questions would 
bear no relationship to their academic grades, 
the correlation between severity of judgment 
and overestimation of knowledge was .16, 
which differed significantly from the first cor- 
relation (z = 1.59, p = .055, one-tailed). The 
experiment was replicated on a second larger 
sample with similar results (high risk r = 
—.19, p = .06, one-tailed; low risk 7 = .15, 
z = 191, p= .03, one-tailed). It appeared, 
therefore, that the above relationship is ob- 
tained only under “high risk" conditions, i.e., 
when the situation makes it possible for the 
subjects to engage in unethical behavior for 
the purpose of personal gain, with the subse- 
quent risk of exposure and negative sanctions. 
In the “low risk” (control) situation the 
ethical commitments of the subjects are 
apparently not put to a concrete test. 

In the second study (Rawson & Rettig, 
1961) the relationship between severity of 
moral judgment and unethical behavior was 
tested using an actual rather than simulated 
criterion of unethical conduct. Subjects were 
requested to trace the borders of a five- 
pointed star with their eyes closed. The 
experiment was presented as a defense' project 
concerned with the “night flying ability of 
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pilots. High monetary reinforcement was 
given for each point "successfully" traced. 
Since successful tracings could only be ac- 
complished through deception (with eyes 
open), a successful subject risked public 
exposure and censure. As in the first study, 
severity of moral judgments predicted un- 
ethical behavior (deception) under these high 
risk conditions (r= —.23, = .05, one- 
tailed). Both of these experimental studies 
derived from prior factor analytic studies 
which attempted to delineate invariant di- 
mensions of moral judgments (Rettig & Pasa- 
manick, 1959, 1960, 1961, 1962). 

In view of the above findings questions 
arose regarding the further definition and sali- 
ence of the risk hypothesis in the prediction 
of unethical behavior. Since the previous 
studies did not separately vary the subject’s 
expectancy (Esens) and reinforcement value 
(RVeens) of censure or the expectancy (Eg) 
and reinforcement value (RV) of the gain, 
it is not clear whether the effect was pro- 
duced by any one, or by the interaction of 
all of these determinants. Moreover, addi- 
tional information is needed regarding the 
magnitude of the effect produced by each 
source of variation relative to each other and 
to other important sources, so that further 
hypotheses regarding the relation of ethical 
risk to unethical behavior can be tested 
experimentally. Although specific hypotheses 
are tested in the present report, another im- 
portant purpose of this study is that of 
generating additional hypotheses for further 
research. 

Beside the four separate sources of varia- 
tion already mentioned (Evens, RVecens; Eg, 
RVgn), two additional sources of unethical 
behavior are postulated: the severity of the 
offense, and the reference group which would 
suffer the injurious consequence of the un- 
ethical behavior. Although these six sources 
of variation are not exhaustive, they are as- 
sumed to be mutually exclusive and are be- 
lieved to account for a substantial portion of 
the variation in the prediction of unethical 
behavior. The independence of the constructs 
of expectancy and reinforcement value is 
postulated by Social Learning Theory 
(Rotter, 1954). In accordance with the risk 
conjecture, it is hypothesized that the expect- 
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ancy of censure and the reinforcement value 
of censure are both separate and significant 


-sources of variation in predictive judgments 


of ethically unacceptable behavior. Further- 
more, each of these two sources will account 
for more of the variance in these predictive 
judgments than any of the other sources. 


PROCEDURE 


A questionnaire was administered to 136 students 
of the Ohio State University attending different 
courses of instruction in the liberal arts curriculum 
during the Summer quarter. The questionnaire con- 
sisted of 64 items, each “portraying a student in 
conflict about taking money which does not belong 
to him.” Each item included all six sources of varia- 
tion presented in the same sequence, but varying the 
level of any one treatment (high and low) from item 
to item. This combination of levels required 64 (29) 


- stimulus items. Subjects were asked to predict on a 


scale ranging from 0 (will definitely not) to 6 (will 
definitely) whether the student would take the 
money. The high and low levels were stated as 
follows: 


1. Reinforcement value of gain (RVs): high— 
the money is needed for an operation; low—the 
money is needed for a debt. 

2. Expectancy of gain (Esn): high—the operation 
will definitely cure the illness, the money will pay 
for the entire debt; low—the operation may not cure 
the illness, the money will pay only a small part of 
the debt. 

3. Reinforcement value of censure (RVeens): 
high—the student will be publicly expelled from the 
university; low—the student will be able to settle 
the matter privately. 

4. Expectancy of censure (Esens): high—the stu- 
dent expects to get caught; low—the student does 
not expect to get caught. 

5. Severity of offense: high—the student intends 
to steal the money; low—the student intends to 
borrow the money, but without authorization. 

6. Reference group: high—the student intends to 
take the money from a social fraternity of which 
he is a member; low—the student intends to take 
the money from the tolls of a parking lot which he 
attends. 


Following are two examples of complete items: 


9. The student treasurer of a social fraternity, 
who urgently needed a substantial sum of money 
for a crucial operation, was considering stealing 
money from the fraternity since he did not have 
any money of his own. The surgeon could not at 
all guarantee the student that the operation would 
cure his illness, but the student knew that sooner 
or later his theft would be detected. However, he 
was certain that he would be able to settle 
the matter privately with the president of . the 
fraternity. 


Risk HYPOTHESIS 


TABLE 1 
ANALYSIS OF VARIANCE OF PREDICTION SCORE 
Source SS | df | MS F 
E gain (A) 1676 1|1676.0 | 664.815 
RV gain (B) 1191 1/ 1191.0 | 472.431** 
E censure (C) mu 1|1111.0 | 440.698** 
RV censure (D) 4401 1|4401.0 | 1745.735** 
Severity of offense (E) 189 1| 189.0 74.970** 
Reference group (F) 0 1 — = 
AXB 71 1| 71.0 28.163** 
CXD 0| 1| = — 
BXD 5 1| 50 1.983 
AXC oj 1| = — 
FXD 0| 1| — — 
FXE 0| 1| — — 
BXE 7 1| 70 2.776 
EXC 2 1| 20 1793 
BXC 2 1| 20 +793 
AXE 9 1| 9.0 3.570 
CXF 0| 1| — — 
AXD 25 1| 25.0 9.916* 
AXE 1 1| 10 1396 
BXF 2 1| 20 1793 
DXE 3| nibns 1.190 
Additional interactions | 187| 42| — — 
Residual 21,787 | 8640] — 2.521 
Total 30,669 | 8703 
*p <. 
** p <.001 


44, The student attendant of a parking lot, who 
urgently needed a substantial sum of money to 
pay a personal debt, was considering borrowing 
the money from the parking tolls without authori- 
zation since he did not have any money of his 
own. The money would pay his entire debt. The 
student was certain that his unauthorized loan 
would not be detected, but he also knew that if 
it were detected, he would be publicly expelled 
from the University. 


The order of the items was randomized with the 
aid of a table of random numbers. Additional in- 
structions to the subjects emphasized that they were 
only to predict whether or not the student would 
take the money, and not to judge how wrong it 
would be to take the money. The Kuder-Richardson 
coefficient of reliability of the 64 items was found to 
be .98, indicating a high degree of consistency in the 
responses,? 


RESULTS 


Table 1 presents an analysis of variance of 
the six sources of variation and their first- 
order interactions. With the exception of the 


2 Here, a slightly modified computer program of 
the Kuder-Richardson Formula Number 8 was used. 
The modification consisted of the use of an item 
total less item rather than item total correlation 
analysis, This K-R formula is based on the assump- 
tion that the intercorrelations of the items are equal. 
However, this assumption was not fully justified in 
view of the results of the factor analysis conducted 
on the same items in the latter part of this 
report. Hence, the above coefficient may be an 
Overestimation of the "true" reliability. 
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reference group, each main source of varia- 
tion shows a significant effect on the predic- 
tion of unethical behavior. Although two 
first-order interactions are also statistically 
significant, the main effects of the five sepa- 
rate sources of variation are much more pro- 
nounced than the interaction effects. This 
indicates that the separate main effects are 
approximately the same for the different 
levels of each of the other sources of varia- 
tion (Lindquist, 1953, p. 214). 

Further inspection of Table 1 also shows 
that the most important source of variation 
in predictive judgments is RVcens (public 
expulsion from the university versus settling 
the matter in private). This source alone ac- 
counts for approximately 50% of the explained 
variance? Evens is not a greater source of 
variation than either Ej, or RV4,. Of the five 
separate sources of variation which did pro- 
duce a significant effect, the severity of the 
offense (stealing versus borrowing without 
authorization) is the least important one. It 
accounts for only 2% of the explained 
variance, 

The results of the analysis of variance 
support the first hypothesis that Esens and 
RV. are separate and significant sources of 
variation in predictive judgments of ethically 
unacceptable behavior. The second hypothesis 
that each of these two sources accounts for 
more of the variance in judgment than other 
sources is supported only in part. While 
RVeens was found to be the most important 
source, Evens was of no greater importance 
than either Eg; or RV. 

The identification of RV, as the most 
important source of variation in predictive 
judgments led to the further question whether 
this same source would also account for more 
covariations in judgments than the other 
sources. That is, do the subjects assign as 
much importance to RVeens in clustering the 
stimulus items as they do in predicting the 


8The formula used for the computation of the 
relative magnitude of variance of a main effect in a 
“fixed effect? model of analysis of variance is: 
af of main effect (ays, ettect less MSrestauai), aC- 


Total N 
cording to Duncan (1959, pp. 609-613). However, 


the above obtained percentages should only be con- 
sidered estimates, since each main effect is represented 
by orly two (possibly nonuniform) levels. 
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TABLE 2 
PATTERN, FACTOR LOADING, AND MEAN JUDGMENT FOR EACH STIMULUS ITEM 
Item pattern Varimax rotated factor loadings 
Item Severity | Refer- ge 3 
number} Eg, | RVgn| Ecens |RV ens of ence I II III IV v VI VII | VIII 
offense group 
GM eae ee EU ay H 84| 14|—23|—11|—11| 06] 01] —06] 804 | 2.31 
EA DA | |G | L H 73| 32|—24| 15,—06| 05] 20|—12| 773 | 1.96 
ET I er Pop] TUER H 72| 20|—32| 01|—12| 23| 05|—20| 770 | 246 
£5. d a rid RTI DD SO [D S L 68| 07|—35| 04|—10| 39|—11| 00| 768 | 2.59 
o |EILIL|H| EH H 65| 27|—27|—26|—22| 17,—26|—10| 790 | 1.87 
Rom art] Pus iren Mb d E H 63| 44|—14|—27|-21| —11| —20| 06] 790 | 1.32 
HT S ED qs CELA E L 57| 32|—15|—27|—22| 25|—11| 04| 643 | 121 
emai |G Taal ote |) ade L 56| 05|—36|—12|—10| 44| 10] 20] 722 | 2.56 
Rone SES ZEE feb | «Hi H 54| 20|—38| 03|—03| 32] 27|—39| 799 | 1.86 
Waa Pc Do o Rz TE L 52| 47|—23|—25|—19| 09|—13| 20| 710 | 1.36 
iei rm Bs ed Mas ta OO H 32| 36| 00|—15|—25| 26] 24|—16| 469 | 1.08 
4 |L|L|H|H|L H 21| 84|—05| 01|—08|—02| 06] 03| 769 | .61 
Bele b | Ho] H-|. b Ẹ 12| 82| 02|—02|—16| 15|—06| 01] 742 | .70 
s) |L|IL|EBH|H|H H 18| 80|—01| 03|—12|—05|—02| —09| 697 | .56 
EAEE PA [ED CES | ER L 18| 77, 03| 05|—11| 11|—04|—03| 657 | -50 
45 |H|L|H|H| H H 17| 70|—12|—42|—13| 14] 09|—05| 755 | .89 
4o | m|L|H|H|lL L 16| 64|—13|—44|—01| 18|—O1| 16] 607 | .97 
18 | H|IL|H|H jn H 15| 60|—15|—49|—18| 00| 12|—07, 699 | 1.07 
sS |L|H|H|H| H H 01| 58|—11| 10|—-59|—11|—27| —15| 818 | -98 
SEUDUDLOGHHIHB|. E L |-05| 55|—12| 11|—60| 01|—21|—12| 750 | 1.00 
2" HILIHIH|H L 06} 54|—24|—40|—11| 32] 11|—18| 677 | .88 
54 |HH|H|H| H E |-i11| 45|—45|-19|—44| 11|—33|—15| 795 | 1.85 
1$9| BH| E| E | L| L H 13| 03|—81|—10|—21| 03] 06|—13| 751 | 3.85 
Cerne LI |b | L H 33 | —02 | —78| —16| —21 | —02| —06 | —05| 789 | 443 
We HH LIE L 27|—06|—76|—05|—19| 25|—17| 12| 798 | 4.39 
49 |H|H!H|L|L L 18| 08|—75|—03|—38| 21|—10| 04| 792 | 3.67 
8|H|HL|L|H H 36| 04|—74|—02|—13| 06|—14|—20| 762 | 414 
144 |H H|H|L| HE L 12| 11|—73|—07|—23| 33|—08|—O05| 740 | 3.49 
eds pe s I n H 29| 06|—71|—13| 01| 13] 16|—31| 753 | 3.08 
3 |H|H!H|L| HR H 20| 18|—66|—04|—34| 26] 01|—28| 775 | 3.37 
TEENE |. de L L 38| 09|—65|—07,—05| 40|—04| 02| 742 | 2.89 
IPE Mn es L L 47| 07|—64|—17| 07| 28|—05| 18] 777 | 3.59 
ERE. USA USD I ues | 2E H 59|—04|—64|—18| 06| 17] 08|—10,| 838 | 3.68 
AUS fea ol fats LBS obs | ed. T 22| 02|—63| 12|—13| 21| 00| 27| 597 | 3.57 
SEEDS ELA UT al, S L H 38|—03|—62| 05|—32|—15| 01|—08| 615 | 3.64 
2|H|IH|L|L|H L 31|—04|—60|—01|—26| 32|—40|—07| 788 | 3.78 
BOW ||) BSEC Bub H 18| 08|—57| 07|—50| 00| 20|—16| 674 | 2.57 
oue o Hes Dp ec H 14| 08|—55| 01|—30| 13|—11|—39| 605 | 2.37 
ves sed sci M Ts H 55! 03|—53|—10|—03| 24|—14|—24| 732 | 3.19 
61 |L|H|L|L| E L 40| 06|—51| 12|—47| 19|—41|—05| 861 | 2.88 
As IET. ES E qe. E H 54| 03|—47|—02|—30| —07| —24 | —24 | 723 | 2.79 
got rro L PP Oe | B Wi H L 34| 12|—05|—51|—34| 48] —12] —05| 757 | 1.61 
S5 gem ||| La) Ge | ee | H 47| 25|—45|—48|—23|—08|—06| 02| 782 | 245 
4 |H|L'L|H|L L 49| 24|—46|—45|—23| 10|—09| 14] 804 | 2.04 
eh INSU EF IER DUE s L ]|-06| 21|—24|—12|-77, 09| 11) 11| 738 | 134 
i LIH|IL|H|'H L 14] 09|—11|—14|—69| 38|—11|—18| 721 | 1.76 
SARVED fins [arog SEES e H 35| 02|—26|—17|-68| —10|—05|—07| 605 | 1.97 
DORSET LED LS EET | SUE H 12| 03|—12|—41|—66| 23|—16,—17| 745 | 164 
4 |L|H|L|H|L L 16| 13|—25|—01|—66| 09| 02| 17 584 | 1.9 
5 LIH|H|H!|L H 06| 48|—11| 07|—64|—06|—21|—05, 707 | 1.03 
POM ema PH | OH Am cb L 16| 10|—43| O01|—s8| 25| 24| 03) 686 | 226 
VHS CHO A | E] n L |-01| 32|—55|—29|-54| os| 12|—01, 705 | 2.46 
3 |H|H H|H.L H 08| 20|—48|—12|—52|—04| O07|—04| 569 | 2.26 
9 |L HIH|L|H L 23| 12|—41| 09|—52| 21|—24|—22| 679 | 2.24 
5S yl Beet) | Ep |o E H 23| 26|—49|—20|—82| 03|—35|—19| 836 | 2.47 
2 H|H|L|H|L H 11| 20|—58|—31|-50|—06|—09| 00| 752 | 299 
36 H|H|L|H|L L 10| 15|—62|—39|—44| O1|—21| 15| 820 | 2.86 
10 |H HIH|H| H H |—06| 32|—58|—22|—43| 08| o1|—05] 686 | 2.19 
22:9 EEOAE ETE SE L 29| 12|—49|—08|—09| 64| 01|—13| 781 | 2-64 
133 |L L|H|L H L 29| 24|—30|—03|—18| 57|—03|—07| 595 | 1.76 
n ,HLILIL,H L 44| 19|—52|—08|—07| 55|—11|—03| 330 | 3-46 
H |.) LD) HESL DL L 33| 24|—39| 00|—32| 38| 03|—11| 531 | 2-16 
64 |H|HIL|H|H L 15| 15|—53|—32|—44| 09|—47|—03| 852 | 2.68 
2|BH|L|H|L H | H '| 29) 22)1—34 -01|-18| 15|—06|—68| 66 | 2-14 
"ap |-130 | .111 | 208 | .044 | .126 | .056 | .029 | .029 | (.733)| (2-28) 


Risk HyporHesis 


probability of the occurrence of unethical be- 
havior? To test the latter hypothesis, a 
principal factor analysis (Harman, 1960) was 
computed on the predictive judgments, and 
eight orthogonal factors were extracted which 
accounted for 73% of covariation in judg- 
ment. The criterion for the number of factors 
extracted was that the eigenvalue should 
exceed one (Kaiser, 1960). The extracted 
factors were rotated, using a normalized 
varimax (Kaiser, 1958).° The rotated factor 
structure is shown in Table 2, together with 
the item pattern. 

Low RVcens appears as a separate dimension 
in Factor III. This factor alone accounts for 
nearly 3096 of the explained covariations in 
judgments, which is more than the covaria- 
tions explained by any other single factor. 
Covariations due to high RVeens are found in 
Factor IV. High Ecens and high RVcens com- 
bine to form Factor II. Thus, RVcons (Fac- 
tors III, IV, and in part Factor IT) accounts 
for nearly 50% of the total explained co- 
variations. High RV, (Factor V) and 
low RV, in association with low Egn 
(Factor I) account for an additional 35%. 
From these findings it becomes apparent that 
RV, is the most important source of co- 
variation in the above judgments. 


Discussion AND CONCLUSION 


The above findings show that the severity 
of censure is the most important source of 
variation in predictive judgments of unethical 
behavior, Further analysis also reveals that 
the severity of censure is also the most im- 
portant source of covariation in such judg- 
ments, That is, subjects tend to cluster the 
stimulus items more on the basis of high 
and low censure than on any other basis. 
The expectancy of censure, although a sepa- 
rate and significant source of variation in 
predictive judgments, proved to be no more 
crucial than either the expectancy or the re- 
inforcement value of the gain. Furthermore, 


4The computer program of this factor analysis 
uses unities in the main diagonals as communality 
estimates. These communality estimates are not 
iterated by the computer. ^ 

5'The rotation was done on the UNIVAC Scien- 
tific 1103 Computer in the Computing Laboratory of 
Southern Methodist University. 
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the expectancy of censure did not emerge as 
a separate source of covariations in judg- 
ments. The consistent finding of the impor- 
tance of severity of censure as a separate 
source of variation and covariation in predic- 
tive judgments, especially in comparison with 
the expectancy of censure, strongly suggests 
that the concept of “ethical risk” in predict- 
ing unethical behavior should be interpreted 
more in terms of the risk of public exposure 
and censure than in terms of the risk of 
“getting caught.” Interpreted in the above 
manner, the concept of ethical risk appears 
to be the single most important antecedent 
condition for the engagement in unethical 
behavior. 

However, before such a conclusion is 
reached, one must evaluate the possibility 
that the range of the high and low levels of 
censure (public exposure versus private settle- 
ment) as expressed in the present stimulus 
items, may have been greater for the severity 
of censure than for the other five sources of 
variation. Such uneven range in the different 
sources of variation could seriously affect the 
range of predictive judgments associated with 
each source. However, since the severity of 
censure also proved to be the greatest source 
of covariation in judgments, the possibility 
of such a stimulus artifact is somewhat dis- 
counted since covariations in judgments are 
relatively little affected by the range of 
stimuli. Moreover, this problem brings up the 
perhaps more meaningful question as to why 
the range in the severity of censure may have 
been perceived as being greater than that of 
the other sources of variation. This question, 
of course, is closely associated with the more 
general question of why severity of censure 
altogether assumes such great importance in 
the prediction of unethical behavior. In a 
reply to the question, only speculation is 
possible at this time. The authors would sug- 
gest that one possible reason for the severity 
of censure to assume such importance lies in 
the fact that the administration of high 
censure (public expulsion from the univer- 
sity) would block the individual in an almost 
irreversible and permanent fashion from at- 
taining his more important goals. Since the 
implementation of high censure leaves the 
individual with practically no further chance 
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to secure his more important objectives, e.g., 
studying at a university, the risk involved to 
the individual is at a maximum. However, it 
should be clear that it is not the irreversible 
and permanent consequence per se which may 
be the deterring aspect but the irreversibility 
and permanence of certain types of social 
censure, since otherwise the potential loss of 
health or life (e.g., through the nonperform- 
ance of a crucial operation) would have a 
deterring effect equal to that of social censure. 
This distinction is an important one since it 
suggests that theorems of utility and reward 
(Brandt, 1954) may be insufficient in ac- 
counting for ethical propositions. To explain 
the significance of certain types of social 
sanctions in deterring unethical behavior 
one must, perhaps, resort to the additional 
theorem that publicity (public censure) favors 
universally acceptable rather than particu- 
laristic behavior (Stouffer, 1949; Sutcliffe & 
Haberman, 1956). 

Since the present study has been confined 
to students and to one particular mode of un- 
ethical behavior (taking money illegally), 
further generalizations are unwarranted at 
this time. However, these very limitations im- 
mediately suggest the further experimental 
exploration and extension of the ethical risk 
hypothesis to other samples and to other 
modes of ethically disputable behaviors. 
Similar research is now in progress. Among 
the questions further explored are the rela- 
tionship of verbal predictive judgments to 
nonverbal unethical behavior, the cross- 
cultural applicability of the ethical risk hy- 
pothesis, and the degree to which the impor- 
tance of goals other than the one pursued by 
the unethical conduct relates to the potential 
effect of public social censure. The last prob- 
lem may be of particular interest since it is 
not unreasonable to conjecture that if a 
person does not have alternate goals in life 
which are of greater importance to him than 
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the one he pursues by his unethical conduct, 
he has little to lose by this conduct, and any 
form of censure may become a meaningless 
proposition to him. 
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COMPARISON OF THE EFFECTS OF INTROVERSION- 
EXTRAVERSION AND ANXIETY ON CONDITIONING’ 


HARVEY A. SWEETBAUM 
Montgomery County Health Department, Rockville, Maryland 


This study investigates the effects of Eysenck’s introversion-extraversion theory 
and Spence-Taylor’s concept of anxiety on the eyeblink conditioned response 
by varying these factors independently. 56 patients were subjected to con- 
ditions of high and low anxiety. Group I (anxious group) were about to 
undergo major surgery; Group II (nonanxious group) were patients who 
had successfully recovered. These patients were divided into high introverts 
and high extraverts, yielding 4 groups (anxious introverts, anxious extraverts, 
nonanxious introverts, nonanxious extraverts). The anxious groups showed 
greater conditioning than the nonanxious groups and no significant differences 
were found between the high and low introvert groups, supporting the 
hypothesis that total drive is, in part, a function of internal anxiety. 


In recent years there has been increasing 
attention given to experimental studies of 
anxiety and conditioning. Using the Manifest 
Anxiety (MA) scale as a selective device, a 
number of studies have been conducted by 
Spence and Taylor in an attempt to test hy- 
potheses derived from basic Hullian theory, 
concerning the relationship between the per- 
formance of human subjects in certain learn- 
ing situations and the level of total effective 
drive (Farber & Spence, 1953; Spence, 1953; 
Spence & Beecroft, 1954; Spence & Farber, 
1953, 1954; Spence, Farber, & Taylor, 1954; 
Spence & Taylor, 1951, 1953; Taylor, 1951, 
1956; Taylor & Spence, 1954). Their use of 
the Taylor MA scale has rested on the as- 
sumption that the scores on this scale are 
related to the level of emotionality or drive 
(D) (Taylor, 1951). 

Eysenck has specifically criticized the 
Spence and Taylor work on the grounds that 
drives do not summate in the manner postu- 
lated by Hull to provide a general fund of 
energy (Eysenck, 1957). Eysenck does ac- 
cept, as do Spence and Taylor, that anxiety 
has the qualities of a drive. However, he 


1This article is based on a dissertation submitted 
to the University of Pennsylvania in partial fulfill- 
ment for the PhD, This author is indebted to the 
Staff at the Perry Point Veterans Administration 
Hospital, specifically Clinton Brown, William Marcus, 
and James Anker, Appreciation is extended to Sey- 
mour Feshbach and William Shaw for their extremely 
helpful suggestions and encouragement. 


particularly takes exception to the concept 
of anxiety as an irrelevant drive, 

Eysenck (1955a, 1955b) has developed two 
postulates which he calls the “Postulate of 
Individual Differences” and the “Typological 
Postulate.” The second of these postulates 
states that individuals in whom excitatory 
potential is generated slowly, and in whom 
excitatory potentials so generated are rela- 
tively weak, are thereby predisposed to develop 
extraverted patterns of behavior, and to de- 
velop “hysterical-psychopathic” disorders in 
cases of neurotic breakdowns; individuals in 
whom excitatory potential is generated quickly 
and in whom excitatory potentials so generated 
are strong, are thereby predisposed to develop 
introverted patterns of behavior and to de- 
velop *dysthymic" disorders in case of neurotic 
breakdown. *Dysthymia" is Eysenck's desig- 
nation for neurotics suffering from anxiety, 
obsessions, compulsions, phobias, or reactive 
depressions. From his Law of Excitation and 
Inhibition, Eysenck determined that intro- 
verts should form conditioned reflexes quickly 
and strongly. Conversely, extraverts will form 
conditioned reflexes slowly and with difficulty. 
There is no indication in this system that 
differences in irrelevant drive should produce 
any differences in conditionability. Conse- 
quently, his prediction points to a correlation 
between conditionability and extraversion- 
introversion, but not to any correlation 
between neuroticism and conditiohability. 
Eysenck further points out that his predic- 
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tion of finding introverts to be more condi- 
tionable should therefore apply to neurotic 
introverts (the dysthymic group) just as 
much to normal introverts and his prediction 
that extraverts would be difficult to condi- 
tion would apply as well to neurotic extra- 
verts (hysterics and psychopaths) just as 
much as to normal extraverts (Eysenck, 
1957). 

Eysenck criticizes the Spence and Taylor 
theory for what he terms the failure of the 
MA scale to be a pure measure of neuroti- 
cism. Eysenck, citing experiments by Franks 
(1956, 1957a, 1957b), states that the MA 
scale, although a good measure of neuroticism, 
also correlates to some extent with intro- 
version. 

It is Eysenck's contention that, according 
to Spence's theory, hysterics should have high 
scores, since they are neurotic. Eysenck states 
hysterics should be very difficult to condition, 
in view of their extraverted personalities. 
Eysenck explains the positive results obtained 
by Spence and Taylor as being due to the 
fact that the Taylor scale is an inaccurate 
measure of neuroticism. He says that, besides 
being a measure of neuroticism, it also has 
a projection on the introversion axis. 

It is thus seen that Eysenck's arguments 
rest on the assumption that his dysthymic or 
introverted neurotic group is drawn from a 
different population than a highly anxious 
group would be. If, however, it is found that 
Eysenck's dysthymic personality is, in essence, 
a highly anxious one, then his theory receives 
little or no support from these previously 
cited experiments. Taylor and Spence (1954) 
have stated: 


In the typical textbook treatment, the relationship 
between internal anxiety and overt symptomatology 
is described for each behavior disorder separately. 
It can be seen from these discussions that the as- 
sumption has not been made by clinical theorists 
that defenses against internal anxiety operate in 
uniform fashion and with equal effectiveness in all 
diagnostic groups, or that the presence of defense 
reactions precludes the appearance of some manifest 
anxiety responses, In obsessive-compulsive disorders, 
for example, manifest anxiety is frequently observed, 
disappearing only when the obsessive thought and/or 
compulsive act have occurred. In the classical hyster- 
ical syndrome, on the other hand, manifest anxiety 
is almost completely absent . . . . Furthermore, the 
internal anxiety postulated to lie behind the symp- 


Harvey A. SWEETBAUM 


toms is kept at a minimal level. Thus, the hysterical 
defense system may be considered to be particularly 
effective in checking internal anxiety and the 
obsessive-compulsive syndrome, in comparison, far 
less so [Italics added] (p. 498). 


This formulation is in agreement with current 
psychiatric and psychological concepts, i.e. 
that group of psychiatric disorders classified 
as obsessive-compulsives, depressives, and 
anxiety neurotics (Eysenck’s introverted neu- 
rotics or *dysthymics"), are considered to be 
more anxious than those groups termed as 
hysterics and psychopaths (Eysenck's extra- 
verted neurotics or *hysterics"). 

The objectives of the present study are to 
vary the effects of anxiety and introversion 
independently of each other. In order to 
accomplish this, subjects must be selected on 
a basis not determined by psychiatric diag- 
nosis, since any such attempt fails to avoid 
the possibility of confounding the groups. 
Similarly, the Taylor MA scale is subject to 
the same criticism. However, the problems 
inherent in the previous studies can be elimi- 
nated by selecting subjects from an essentially 
normal population and subjecting them to a 
gross stress situation, designed to raise their 
anxiety level. According to Eysenck's theory, 
the anxiety factor will have no bearing on the 
subjects’ conditionability. Spence and Taylor, 
on the other hand, would predict that those 
individuals subjected to the stressful situa- 
tion would condition more quickly than a 
similar group not subjected to such stress. 
Furthermore, if in addition, these groups are 
subdivided into introverted and extraverted 
groups, Eysenck would predict that only the 
introverted subjects, irrespective of the stress 
situation, would condition more quickly than 
the extraverted subjects. Such a gross stress 
situation is already provided in subjects about 
to undergo major surgery (Price, Thaler, & 
Mason, 1957). 

On the basis of the foregoing, the following 
differentiated predictions can be made, ac- 
cording to whether one adopts the Spence- 
Taylor or the Eysenck view: 

1. According to the Spence-Taylor view; 
when the effects of introversion are held con- 
stant, a group of subjects drawn from an 
essentially normal population who are made 
anxious in a stressful situation, such as ex- 
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perienced by patients about to undergo ma- 
jor surgery, demonstrates significantly greater 
conditioning than a similar group of patients, 
less anxious by virtue of their having re- 
covered from major and minor surgery. 

2. According to Eysenck's theoretical posi- 
tion, the following hypothesis would be as- 
serted: Introverted normals demonstrate sig- 
nificantly greater conditioning than Extra- 
verted normals where the effects of anxiety 
are held relatively constant. 

One should note that Eysenck specifically 
rejects the first hypothesis and maintains that 
anxiety level bears no relationship to ease of 
conditioning. 

PROCEDURE 
Subjects 


All subjects were selected randomly from the surgi- 
cal ward of the Veterans Administration Hospital at 
Perry Point, Maryland. Upon admittance to the 
ward, those patients who were found to have no his- 
tory of psychiatric illness, were administered the 
Guilford R scale, which was used as a measure of 
introversion-extraversion. Because this was a Vet- 
erans Administration Hospital, the patient popula- 
tion proved to be an entirely male group. The range 
of intelligence varied considerably, but because of 
the rather sophisticated vocabulary required for the 
Guilford R scale, most of the patients could be 
considered to have about average or better intelli- 
gence. These patients were then assigned randomly 
into two groups, Group I, the presurgery group, and 
Group II, the postsurgical group. In order to in- 
tensify the effects of anxiety, only patients about to 
undergo major surgery within a 48-hour period were 
selected for the experiment in the presurgery group. 
"This special selective feature was not utilized for the 
Postsurgical group, with the result that 3 minor sur- 
gery procedures, along with 25 major surgery pro- 
cedures, were included in the postsurgical group. All 
conditioning in the presurgical group was adminis- 
tered within 48 hours before their operation. All con- 
ditioning in the postsurgical group, on the other 
hand, was administered at least a week or longer 
following their surgery and, in each case, the patient 
Was considered by the attending surgeon to have 
made a satisfactory recovery from the operation. 
Fifty-six patients were finally selected, 14 in each 
of the four groups of anxious introvert, anxious ex- 
travert, nonanxious introvert, nonanxious extravert. 
Eighteen additional patients who were originally 
tested had to be rejected for the following reasons: 
1 because he failed to cooperate after the experiment 
had begun, 1 because he fell asleep during the ex- 
Periment, 14 because of instrument failure, and 2 be- 
Cause they failed to meet the latency criteria. 

Following the conditioning procedure for all sub- 
jects, the Guilford R scores were tabulated. Subjects 


who scored 38 and below, using Guilford’s scoring 
procedure, were designated as the introverted group. 
All subjects who scored 40 and above were desig- 
nated as the extraverted group. 


Apparatus and Method of Recording 


Subjects were seated in a straight-back chair, in a 
semidarkened room. They were told to fixate on a 
6-inch square smoky glass window, at a distance of 
3 feet in front of the subject. The conditional stimu- 
lus (CS) consisted of an increase in brightness of the 
window, which was provided by a 7-watt bulb situ- 
ated immediately behind the window. The duration 
of the CS on each trial was 750 milliseconds, Fol- 
lowing the onset of the CS by 700 milliseconds, the 
unconditioned stimulus (UCS), a puff of air, was 
administered in the region of the outer iris of the 
right eye. The puff of air was produced by a sole- 
noid, driving an arm against a tambour. The arm 
depressed a thin rubber sheet, stretched tightly over 
the tambour, thereby forcing air through a 4-foot 
rubber hose, a sixteenth of an inch internal diame- 
ter, the end of which was affixed to a pair of ordi- 
nary spectacle frames worn by the subject. Intertrial 
intervals of 15 seconds were employed. The onset of 
the CS was controlled by the closing of a switch on 
an electronic timer. The latter also controlled the 
duration of the CS and the onset of the UCS at the 
appropriate interval. 

The eyelid movement was recorded by a very small 
and sensitive photoelectric cell, .5 inch in length and 
.25 inch in diameter, which was attached to the 
spectacles just above the right eye so that the con- 
ditioning and recording of the blink were obtained 
from the same eye. The photoelectric cell measured 
the difference in the intensity of light reflected off 
the pupil, there being more light reflected when the 
eye was open and a decrease in light during the 
blink. The impulse obtained from this photoelectric 
cell was amplified and recorded on a Brush pen and 
ink, two-arm recording apparatus. A conditioned re- 
sponse was recorded whenever the record showed a 
deflection of one-eighth inch or more, in the interval 
200 to 700 milliseconds following the onset of the 
cs. 


Conditioning Procedure 


In order to further intensify the desired effects, a 
varied instruction technique was given to the pre- 
and postsurgery groups. The presurgery subjects were 
given little or no explanation as to the purpose or 
intent of the procedure they were about to undergo. 
Any questions they raised were deferred until the end 
of the conditioning session. At the same time, the 
complex recording equipment was in their view and 
the examiner made no attempt to allay their anxiety 
about it. 

For the postsurgery group, a partial explanation 
of the experiment was outlined to them with empha- 
sis on the fact that this study was being made for 
scientific purposes and had little or no relevance for 
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TABLE 1 


Comparison OF GUILFORD R SCALE SCORES BETWEEN 


ANXIOUS-NONANXIOUS INTROVERTS AND ANXIOUS- 
NONANXIOUS EXTRAVERTS 


Pre-Post df t 
Introvert 1 1.984 
Extravert 3 1.984 


them as individuals. All care was taken to allay any 
fears or anxieties they might have about the study. 
(The presurgery group was provided with the same 
explanation following the conditioning session.) All 
subjects were conditioned in a single session. 

Each subject first received three presentations of 
light alone in order to check for any initial condi- 
tioning responses to the light. A puff of air was then 
administered and spontaneous blinking recorded for 
20 seconds, Immediately following the preliminary 
trials, all subjects were given 60 experimental trials 
with the tenth of each block of 10 a test trial. Two 
measures of recording the conditioned response were 
employed. The first measure is number of condi- 
tioned responses on the six test trials for each sub- 
ject. The second measure is reported in Woodworth 
(1938) and will hereafter be called the Anticipatory 
Response. To paraphrase from Woodworth, the sub- 
ject sees the light and the light is followed by a UCS 
to which he automatically blinks. After a number 
of such presentations, the response begins to occur 
slightly before the onset of the reflex to the UCS, 
and the reflex blink finds the lid already in motion. 
Test trials are unnecessary for recording this response. 


RESULTS 


A simple ¢ test was made between the 
preoperative and postoperative extraverted 
groups, in order to confirm the assumption 
that these groups were equated in terms of 
their Guilford R scores, Table 1 presents the 
results of this test. No significant differ- 
ences were found between the two intro- 
verted groups and the two extraverted groups. 

Table 2 shows the total number of condi- 
tioned responses on the test trials, their mean 
and standard deviation for the four groups. 
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TABLE 3 
CONDITIONED ANTICIPATORY TRIALS 


Sum R M SD 
High anxious introvert 259 18.5 9.7 
High anxious extravert 255 18.2 4.9 
Low anxious introvert 70 5.0 40 
Low anxious extravert 83 5.9 4.8 


Table 3 shows the total number of condi- 
tioned responses as measured by the Antici- 
patory Response measure. Only the 54 non- 
test trials were considered, the test trials be- 
ing excluded in this measure. The results are 
striking. The total number of conditioned re- 
sponses in the test trials were 49 and 52, re- 
spectively, for the high anxious introverted 
and the high anxious extraverted groups, and 
their means 3.5 and 3.7. This is compared to 
a total conditioned response of 32 in the low 
anxious introvert and 34 in the low anxious 
extraverted group. The respective means were 
2.3 and 2.4, This result is borne out and em- 
phasized even more dramatically in the an- 
ticipatory conditioned response. As shown in 
Table 3, there is a marked similarity between 
the high anxious introverted and the high 
anxious extraverted groups, with total condi- 
tioned responses of 259 and 255, respectively, 
with means of 18.5 and 18.2. Again, in the 
low anxious introverted group, a total condi- 
tioned response of 70, as compared to a total 
of 83 in the low anxious extraverted group, 
was obtained. The means here were 5 and 5.9. 
Analysis of variance for the test trials shows 
a significant difference, p < .01 between the 
presurgical and postsurgical groups. There 
were no significant differences found between 
the introverted and extraverted groups, as 
well as no interaction effects (see Table 4). 


TABLE 4 
ANALYSIS OF VARIANCE FOR TEST TRIALS 


TABLE 2 
CONDITIONED TEST TRIALS BOUE sum | af| m F 
Anxious-Nonanxious (A) | 21.875| 1 21.875 | 7.275* 
Sum R 
x d SA, Introvert-Extravert (B) 446| 1| .446| .148 
High anxious introvert | 49 3.5 64 AXB .286| 1| .286| .095 
High anxious extravert 52 37 .69 Within groups 156.375 | 52 | 3.007 
Low ankious introvert 32 2.3 46 Total 178.982 | 55 
Low anxious extravert 34 24 41 


*p <.01. 
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TABLE 5 
ANALYSIS OF VARIANCE FOR ANTICIPATORY TRIALS 


Source Sum x? | df | M? F 
Anxious-Nonanxious (A) | 55.618| 155.618 | 23.320* 
Introvert-Extravert (B) 1.001} 1| 1.001} .420 
AXB .440| 1| .440| .184 
Within groups 124.021 | 52 | 2.385 

Total 181.080 | 55 
*p «0t, 


Bartlett’s test for homogeneity of variance 
was found to be nonsignificant. 

Table 5 shows the analysis of variance for 
the anticipatory response with the same rela- 
tionships holding true. 

Bartlett’s test for homogeneity of variance 
was found significant on the Anticipatory Re- 
sponse measure, and a transformation factor, 
using the square root of X + .5 was made. 

An overall Pearson correlation between the 
introversion and extraversion scores and ease 
of conditioning proved to be nonsignificant 
(r = — 059). 

DISCUSSION 


The results of this experiment strongly sup- 
port the first hypothesis. This hypothesis as- 
serts that total drive is, in part, a function of 
the level of internal anxiety and consequently, 
subjects with a high degree of anxiety would 
possess more drive and therefore, according 
to the Spence-Taylor position, should condi- 
tion more quickly. The data from the pre- 
surgery or anxious group and the postsurgery 
or nonanxious group shows the former to be 
consistently superior in the amount of condi- 
tioning in both measures, with the differences 
being highly significant. These results are 
contrary to Eysenck's assertion that level of 
anxiety has no effect upon ease of condition- 
ing. The second hypothesis, which is based 
upon Eysenck’s Introversion-Extraversion the- 
Ory of conditioning, is not supported by the 
data. There was no significant difference in 
the conditionability between the introvert 
and the extravert groups under conditions of 
either high or low anxiety. Furthermore, the 
Overall Pearson correlation between intro- 
Version-extraversion scores and ease of condi- 
tioning was not significant. 

These results suggest that Eysenck’s dys- 
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thymic group and hysteric group could more 
parsimoniously be conceived in terms of high 
and low anxiety groups, respectively. Thus, 
it would appear that Eysenck has confounded 
his factor of introversion-extraversion with 
the concept of anxiety. Eysenck’s factor ana- 
lytic techniques appear to have provided a 
more clear-cut continuum of predisposition to 
anxiety, rather than introversion-extraversion. 

The fact that these results are consistent 
with the Spence-Taylor position is not neces- 
sarily evidence for the particular mechanism 
which they suggest is responsible for the ef- 
fects of anxiety upon conditioning. The ques- 
tion of whether anxiety facilitates condition- 
ing because it operates as an irrelevant drive, 
or because of some other characteristic of the 
anxiety state (e.g., type of response made to 
anxiety), is a problem which needs to be re- 
solved by subsequent investigation, 
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CONCEPTUAL AREA ANALYSIS OF THE TEST PERFORMANCE 
OF MENTALLY RETARDED ADULTS * 


A. B. SILVERSTEIN anp PHILIP J. MOHAN 


Pacific State 


Hospital 


The Object Sorting Test from the Goldstein-Scheerer battery and the Similar- 


ities subtest of the WISC were adminis 
and scored by means of McGaughran ai 


tered to 50 mentally retarded adults 
nd Moran's conceptual area analysis. 


Age, IQ, and length of hospitalization were significantly related to a number 
of the conceptual area measures, whereas sex and diagnosis were generally 
unrelated to these measures. There was some evidence of consistency in 
performance on the 2 conceptual tests, but the 2 parts of the Object Sorting 
Test seemed not to be measuring the same thing. It was suggested that for 
retardates, at least, the basic dimensions of conceptual area analysis are not 
independent. The abnormal concreteness of the retarded appears to differ 
from that of both schizophrenics and brain damaged individuals. 


The conceptual behavior of the mentally 
retarded is often characterized as abnormally 
concrete, but retardates are not alone in this 
respect, for schizophrenics and brain damaged 
individuals are also said to suffer from an 
impairment of abstract ability. Abstract and 
concrete are very general terms, however, and 
a series of studies by McGaughran and Moran 
has shown the fruitfulness of specifying some 
of the variables which these terms imply. 
These authors have postulated two dimen- 
sions of conceptual behavior—public-private 
and open-closed—the interaction of which 
yields the four conceptual areas—open-public, 
closed-public, open-private, and closed-private. 

The public-private dimension is defined in 
terms of the extent of social agreement on a 
concept, its communality of meaning. A rela- 
tively high degree of publicness implies that 
the principle underlying the concept is shared 
and freely communicated by the majority 
of persons in our culture. For example, 
“triangularity” is a more public concept than 
“attractiveness,” since the boundaries for the 
former are more culturally denotable than 
those for the latter. The open-closed dimen- 
sion is defined in terms of the number of 
common attributes which are used or implied 
in a concept, its order of conceptual classifica- 


1This investigation was supported by a research 
Brant from the California State Department of 
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tion. A concept based on a single attribute 
common to all the objects in a sorting, e.g., 
their red color, would be maximally open, 
whereas a concept binding together all the 
perceptible attributes of the objects, e.g., their 
absolute identity, would be maximally closed. 
Every concept can be scored on both dimen- 
sions, and thus falls into one of the four 
conceptual areas. 

One study has already applied conceptual 
area analysis to the test performance of the 
retarded. Iscoe and Giller (1959) adminis- 
tered Rapaport’s (Rapaport, Gill, & Schafer, 
1945) modification of the Object Sorting Test 
(Goldstein & Scheerer, 1941) to a group of 
“garden variety” retardates, to test the hy- 
pothesis that “there would be an increase in 
abstractive ability according to an acceptable 
classification system as chronological age 
increases” (p. 112). The authors gave no 
data for the open-closed dimension, but re- 
ported that the subjects had few open con- 
cepts, i.e., concepts based on a single common 
attribute. With respect to the public-private 
dimension, they found that the older subjects 
had significantly fewer public concepts than 
the younger ones, a finding which went 
directly counter to their initial hypothesis. 

The present study was designed to seek 
answers to four questions: 

1. What are the effects of age, sex, IQ, 
diagnosis, and length of hospitalization on 
the conceptual behavior of retarded ádults? 
To date, the only information on any of these 
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factors comes from the study cited above, 
which dealt primarily with age changes. 

2. What degree of consistency do the 
retarded show in their performance on dif- 
ferent conceptual tests? Previous studies have 
shown that normals, schizophrenics, and brain 
damaged individuals each tend to adopt a 
specific approach to a variety of tests. 

3. What are the relationships among the 
measures of conceptual area analysis? The 
available evidence suggests that there is a 
relatively high negative relationship between 
the closed-public and open-private measures, 
but that the public-private and open-closed 
dimensions are independent. 

4. What are the similarities and differences 
between the conceptual behavior of the re- 
tarded and that of the other groups? Earlier 
investigations have demonstrated that the 
public-private dimension most clearly distin- 
guishes schizophrenics from normals, whereas 
the open-closed dimension shows the greatest 
difference between schizophrenics and brain 
damaged individuals. 


METHOD 
Subjects 


The sample consisted of 50 resident patients at 
Pacific State Hospital, 26 males and 24 females. 
From each of five diagnostic categories,? 10 patients 
were selected who had passed their twenty-first 
birthday, and whose most recently obtained IQ 
placed them at the mild or moderate level of retarda- 
tion (Heber, 1959). The mean Wechsler IQ of the 
sample was 56.4 (SD = 8.05); their mean age, 36.2 
years (SD=10.66); and their mean length of 
hospitalization, 15.2 years (SD = 9.92). 


Procedure 


The two tests employed were  Rapaport's 
(Rapaport et al., 1945) modification of the Object 
Sorting Test (Goldstein & Scheerer, 1941) and the 
Similarities subtest of the Wechsler Intelligence Scale 
for Children (Wechsler, 1949). The Object Sorting 


2 Encephalopathy due to postnatal cerebral infec- 
tion, mechanical injury at birth, and postnatal 
injury; cultural-familial mental retardation; and 
mental retardation, other, due to uncertain cause 
hes the functional reaction alone manifest (Heber, 
1959). 

? In studies conducted before the introduction of 
conceptüal area analysis, Bolles (1937) and Stacey 
and Portnoy (1951) used the Object Sorting Test, 
and Stacey and Markin (1951) the Similarities sub- 
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Test was administered according to the procedure 
described by Rapaport: on the 7 items of Part I 
(active concept formation), the patient sorted the 
objects and defined his sorting; on the 12 items of 
Part II (passive concept formation), the patient 
defined the examiner's sorting. 

The procedure for administering Similarities dif- 
fered somewhat from that described by Wechsler: 
first the 4 Analogies items were given to each patient 
as a warm-up, and then all 12 Similarities items 
were given, regardless of the number of failures as 
judged by Wechsler’s scoring criteria. The WISC was 
used, rather than the Wechsler Adult Intelligence 
Scale, because its items were presumed to be easier, 

In previous studies, scoring agreement for the 
Object Sorting Test has ranged from a high of 93% 
(McGaughran & Moran, 1956) to a low of 83% 
(Iscoe & Giller, 1959). The present writers inde- 
pendently scored the complete records of 10 randomly 
selected subjects, i.e. 190 items, and obtained only 
70% agreement. Consequently, the remainder of the 
Object Sorting Test records and all of the Similarities 
records (excluding the Analogies items) were 
scored jointly. It should be noted that the relatively 
low reliability was obtained despite the use of 
McGaughran and Moran’s revised supplementary 
scoring instructions, and despite a considerable 
amount of discussion and practice beforehand. 


RESULTS AND DISCUSSION 
Effects of Background Variables 


Table 1 gives the means and standard 
deviations for the conceptual area measures. 
The first question which this study sought to 
answer was that of the effects of a number 
of background variables on conceptual be- 
havior. Iscoe and Giller (1959) have reported 
some evidence of a negative relationship be- 
tween age and the public-private dimension 
in the retarded. These same authors found 
no sex differences in the conceptual area 
measures for a portion of their sample. 
McGaughran and Moran (1956, 1957) have 
noted substantial relationships between intel- 
ligence and most of the conceptual area meas- 
ures in both normal and clinical groups, and 
have consequently controlled the effects of 
this variable in their analyses. In the present 
study, correlations were calculated between 
age, sex, IQ, and length of hospitalization, 
on the one hand, and the conceptual area 
measures, on the other. Analyses of variance 
were also performed to compare the five 
diagnostic groups on each measure. The 05 | 


test of the Wechsler-Bellevue Intelligence Scale, fo 
study the conceptual behavior of the retarded. 


TABLE 1 


MEANS AND STANDARD DEVIATIONS OF CONCEPTUAL 
AREA MEASURES FOR MENTALLY 
RETARDED ADULTS 


Object Sorting Test 
WISC 


Similarities 
Meanire Part I | Part II | Whole 

M |SD| M |SD| M|SD| M | SD 

Open-public 0.9 |1.05| 1.9 |1.79| 2.8/2.31| 1.6 | 1.40 
Closed-public 2.1 |1.77| 2.3 |2.29| 4.4|3.56 | 3.6 | 2.15 
Open-private 2.3 |1.36| 4.0 |2.85 | 6.4|3.21| 3.1 | 1.82 
Closed-private | 1.7 |1.34| 3.7 |2.85| 5.4|2.97| 3.7 | 2.14 
Public 2.9 |1.61 | 4.3 |2.59| 7.2/3.26 | 5.2 | 2.39 
Open 3.2 |1.75| 6.0 |3.33 | 9.2 |4.07 | 4.6 | 2.20 
Autistic index® | 5.2 |2.78 | 6.7 |4.45 |12.0/6.00 | 4.4 | 3.46 


Note.—N = 50. 

a Open-private minus closed-public, plus 5 for Parts I and II 
of the Object Sorting Test and Similarities; plus 10 for the 
Object Sorting Test as a whole. 


level of significance was adopted for all sta- 
tistical tests reported in this paper. 

Age was significantly related to several of 
the conceptual area measures: closed-public 
(r = 30), open (7 = —.33), and the autistic 
index (r = — .31) on Part I of the Object 

' Sorting Test; closed-public (r = .30) on the 
Object Sorting test as a whole; and open- 
public (r = — .33) and open (r = — .30) on 
Similarities. In general, with increasing age 
the patients tended to have fewer concepts 
based on a single common attribute. These re- 
sults differ from those obtained by Iscoe and 
Giller (1959), where the primary effect of age 
appeared to be a decrease in the number of 
concepts based on principles that are freely 
communicated in our culture. The findings of 
the two studies do agree to the extent that 
in both the effects of age were found to be 
deleterious, i.e., the conceptual behavior of 
the older retardates became more “concrete” 
on either the public-private or open-closed 
dimension. Unfortunately, no comparable 
data are yet available on age changes in the 
conceptual area measures for normals. 

The only conceptual area measure to show 
a significant sex difference was the closed- 
private measure on the Object Sorting Test as 
a whole (rj, = — .30, £ = 2.22). In view of 
the number of relationships tested, and in the 
absence of a ready explanation for this find- 
ing, it appears best to postpone any attempt 


|. at interpretation until such time as further 
a 
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research may confirm that this was not a 
chance effect. 

It is noteworthy that even within the rela- 
tively narrow range of IQs represented in the 
sample (43-69), a number of the conceptual 
area measures were significantly related to 
intelligence: open-private (r = .29) on Part I 
of the Object Sorting Test; open-public 
(r — .45) and public (r = .38) on Part II; 
open-public (7=.37) and closed-private 
(r = .32) on the Object Sorting Test as a 
whole; and open-public (r = 37) and public 
(r = .36) on Similarities. The only one of 
these that is not consistent with the results 
of McGaughran and Moran (1956, 1957) 
is the positive relationship with the open- 
private measure. The remaining results agree 
in indicating that the less retarded patients 
tended to have more concepts based on a 
single common attribute, and more concepts 
based on principles that are freely com- 
municated in our culture. 

Length of hospitalization was significantly 
related to the closed-public measure on 
Part II of the Object Sorting Test (r — .31) 
and on the Object Sorting Test as a 
whole (r= .28); and to the open measure 
(r= —.29) on Similarities. Since there was 
a highly significant relationship between age 
and length of hospitalization (r — .62), an 
attempt was made to assess the contribution 
of each of these variables, independent of the 
other. Partial correlations were calculated in 
the case of the seven conceptual area measures 
that were significantly related to age or 
length of hospitalization, or both. With a 
single exception (the closed-public measure 
on Part II of the Object Sorting Test), age 
was found to be more closely related to the 
conceptual area measures than was length 
of hospitalization, when the effects of the 
other variable were partialed out. 

Diagnosis proved not to be significantly 
related to any of the conceptual area. meas- 
ures. While somewhat disappointing, this 
negative finding was not surprising, for the 
literature contains many studies comparing 
exogenous and endogenous, brain damaged 
and familial, organic and nonorganic re- 
tardates, in which positive findings are, rela- 
tively few in number, and inconsistent from 
study to study. 
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Consistency of Performance 


The second question with which this study 
was concerned was that of the consistency 
shown by the retarded in their performance 
on different conceptual tests. Moran, Mc- 
Gaughran, and Leventhal (1957) adminis- 
tered the Object Sorting Test, along with 
specially devised similarities and synonyms 
tests, to several normal and clinical groups, 
and obtained significant relationships between 
corresponding measures on the three tests. 
The authors concluded that intraindividual 
consistency was sufficient to merit the use of 
the term “conceptual trait." Again, Leventhal, 
McGaughran, and Moran (1959) reported 
closely equivalent results in comparing schizo- 
phrenics and brain damaged individuals on 
the Object Sorting Test and a similarities test. 

Actually, the consistency question arises 
within the Object Sorting Test itself, although 
it has not previously been studied in this 
context by means of conceptual area analysis. 
Rapaport et al. (1945), who used their own 
analytic schema, consistently distinguished 
between active and passive concept forma- 
tion, and placed some emphasis on the diag- 
nostic significance of differences in perform- 
ance on Parts I and II of the test. Silverstein 
(1960a) performed a cluster analysis of a 
portion of Rapaport’s data, and concluded 
that the distinction between active and pas- 
sive concept formation was quite justified, 
since two of the three clusters yielded by the 
analysis corresponded closely to the two parts 
of the test. 

In a second study based on the same data, 
Silverstein (1960b) found that IQ was not 
significantly related to the measures on Part I, 
but that it had a significant positive rela- 
tionship with the number of abstract defini- 
tions, and a significant negative relationship 
with the number of concrete definitions on 
Part II. Accordingly, measures on the two 
parts of the test have been treated separately 
throughout the present paper, although meas- 
ures on the test as a whole have also been 
reported for purposes of comparison with the 
results of earlier studies. 

Table 2 gives the correlations between cor- 
responding conceptual area measures on the 
various tests. It can be seen that with the 
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TABLE 2 - 


CORRELATIONS BETWEEN CORRESPONDING CONCEPTUAL 
AREA MEASURES ON VARIOUS TESTS 


Part I Part I Part II Whole 
Measure versus versus versus versus 
Part II |Similarities |Similarities | Similarities 
CIS qub. 28 AS 4e ASHE 
Closed-public 15394: 16 Ant 34. 
Open-private .04 .05 Aper Ape 
Closed-private | —.14 446 .33* .39* 
Public 16 .04 E od 369. 
Open 21 .20 Agree Ed 
Autistic index | .34* 40 AGP 3o 


, Note.—Part I, Part II, and Whole refer to the Object 
Sorting Test. 


exception of the closed-public measure and 
the autistic index, corresponding measures on 
the two parts of the Object Sorting Test were 
not significantly related. This finding is con- 
sistent with Silverstein's previous results, 
based on a very different sample (members of 
the Kansas Highway Patrol) and a different 
analytic scliema, and supports the notion that 
in general, Parts I and II of the test are 
not measuring the same thing. 

Additional evidence on the consistency 
question is provided by the correlations be- 
tween corresponding measures on the Object 
Sorting Test and Similarities. None of the 
measures on Part I was significantly related 
to its corresponding measure on Similarities, 
whereas each measure on Part II was sig- 
nificantly related to its corresponding Simi- 
larities measure. The measures on the Object 
Sorting Test as a whole were also significantly 
related to their corresponding Similarities 
measures, but these relationships tended to 
be slightly lower than those for the measures 
on Part II alone. To sum up, the findings offer 
some support for the consistency of retardates 
in their performance on different conceptual 
tests, but suggest that measures on the two 
parts of the Object Sorting Test should 
probably not be combined. 


Relationships among Measures 


The third question which this study sought 
to answer was that of the relationships among 
the conceptual area measures. McGaughran 
and Moran (1957) have stated that a rela- 
tively high negative relationship between the 
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closed-public and open-private measures has 
been found consistently. They cited correla- 
tions of —.48 and —.63 for brain damaged 
individuals and schizophrenics, respectively, 
as examples of this relationship. In the present 
study, corresponding values ranged from —.50 
to —.57 for the various tests, all highly 
significant. ; 

What McGaughran and Moran failed to 
observe, however, is that by the very nature 
of conceptual area analysis, the relationships 
between any two area measures will tend to 
be negative. DuBois (1957) has shown that 
the mean intercorrelation in a closed system 
of & measures, i.e., one in which the sum of 
the & measures for each subject is a constant, 
will approximate —1/(% — 1). In the case of 
conceptual area analysis, since there are four 
areas, the expected mean intercorrelation is 
— 1/(4 — 1) or — .33. The value obtained in 
the present study was —.32 for each test. 
The point is that relationships between area 
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measures are difficult, if not impossible, to 
interpret because of the statistical properties 
of closed systems. 

A second aspect of the question of the 
relationships among the conceptual area 
measures is that of the relationship between 
the public-private and open-closed dimensions. 
Moran et al. (1957), and Leventhal et al. 
(1959) have reported that the public and 
open measures appear to vary independently; 
the relationships between them did not ap- 
proach significance for either normal or clin- 
ical groups. In the present study, the rela- 
tionships between these measures were all 
negative, and were significant for Part I of 
the Object Sorting Test (r= —.37) and 
for the Object Sorting Test as a whole 
(r = —.33). The reason for this discrepancy 
with previous findings is not at all clear, but 
for the retarded, at least, the basic dimensions 
of conceptual area analysis do not appear to 
be independent. 


PUBLIC 
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Comparison of Groups 


The fourth question with which this study 
dealt was that of the similarities and dif- 
ferences between the conceptual behavior of 
the retarded and that of other groups. In 
this connection, it is instructive to consider 
a graphic presentation of the Object Sorting 
Test performance of the various groups that 
have been studied by means of conceptual 
area analysis. Figure 1 provides such a graph, 
the public-private and open-closed dimensions 
being shown as orthogonal axes on which the 
mean scores of the different groups have 
been plotted. The brain damaged individuals 
occupy a position in the closed-public area; 
normal subjects lie in the open-public area; 
schizophrenics are in one of the open areas, 
although there is some question as to whether 
their conceptual behavior is predominantly 
open or closed‘; and the retardates of the 
present study fall in the closed-private area— 
their behavior is doubly “concrete.” While 
some of the plotted points lie very close to 
the axes, it is nevertheless remarkable that 
conceptual area analysis should differentiate 
these groups as clearly as it does. 


4 The discrepancy in the performance of the two 
schizophrenic groups was noted by Leventhal, 
McGaughran, and Moran (1959) who attributed it 
to a greater degree of chronicity for the group we 
have designated as Schizophrenic II. 
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THE ACCURACY OF PARENTAL RECALL OF ASPECTS 
OF CHILD DEVELOPMENT AND OF CHILD 
REARING PRACTICES * 


LILLIAN CUKIER ROBBINS ? 
New York University 


Retrospective accounts of child rearing obtained from parents of 3-year-olds 
were compared with reports they previously gave in the course of a 
longitudinal study begun with the birth of the child. The parents were quite 
inaccurate in their memory of details about child rearing practices and early 
development progress, in spite of the frequent rehearsal of these data due to 
their participation in the longitudinal study, and in spite of the relatively 
young age of the children. Inaccuracies were greatest for items dealing with 
the age of weaning and toilet training, the occurrence of thumbsucking, and 
demand feeding. Inaccuracies tended to be in the direction of the recommenda- 
tions of experts in child rearing, especially on the part of the mothers. In 
general, mothers recalled more correctly than fathers. 


The accuracy with which parents can re- 
call and report, after a period of years, de- 
tails of the early functioning of their children 
and of their own child care practices is a sig- 
nificant issue in view of the large number of 
investigations which have used retrospective 
parental reports as their primary data. Tests 
of theories of personality development, and 
discussions of similarities and differences in 
the child rearing patterns of various class and 
color groups (see, e.g., Davis & Havighurst, 
1946; Miller & Swanson, 1958; Sears, Mac- 
coby, & Levin, 1957), have been based on 
such data, often without sufficient regard for 
possible factual errors. 

Only three studies have so far attempted 
to evaluate the extent to which retrospective 
reports of developmental data are accurate. 
The first two (McGraw & Molloy, 1941; 


i This paper is based on a PhD dissertation sub- 
mitted to the Graduate School of Arts and Science 
at New York University. The author wishes to thank 
Elsa E. Robinson and M. Brewster Smith for their 
valuable suggestions and helpful editorial criticism, 
Sidney Weinstein for his advice on statistical design, 
and Edwin S. Robbins for his invaluable assistance. 
The author is also grateful to Stella Chess and Alex- 
ander Thomas for making the data and sample of 
their longitudinal study of child development avail- 
able. The research was supported in part by the 
Margaret E. Maltby Fellowship of the American As- 
sociation of University Women. 

2 Now USPHS Post-Doctoral Research Fellow, De- 
partment of Psychiatry, New York University Medi- 
cal Center, 


Pyles, Stolz, & Macfarlane, 1935) demon- 
strated that inaccuracy is frequent, by com- 
paring mothers’ recollections with the obser- 
vations of experts. Since there is no way of 
knowing which mothers, if any, had ever 
been aware of the highly specific material 
they were asked to remember, it is impossible 
to determine whether discrepancies were due 
to errors in the mothers’ original perceptions, 
to differences in the frame of reference of ex- 
perts and mothers, or to errors in recall. A 
more recent longitudinal study (Haggard, 
Brekstad, & Skard, 1960) has examined the 
reliability of maternal memory, as well as its 
validity. Maternal reports and independent 
observations were compared with the recollec- 
tions of mothers as much as 8 years later. For 
items dealing with such attitudinal questions 
as anxiety regarding childbirth and strength 
of wish for a given sex, reliability was lower 
than for the more factual questions, such as 
the birth weight. Where validity was ascer- 
tainable, R ranged from .71 to .89. 

The present study endeavors to examine ac- 
curacy of recall by comparing retrospective 
accounts of child rearing obtained from par- 
ents of 3-year-olds with reports they previ- 
ously gave in the course of a longitudinal 
study begun with the birth of the child 
(Thomas & Chess, 1957). It differs from 
other appraisals of parental accuracy by fo- 
cusing on objective, nonattitudinal, yet long- 
term aspects of child development, such as 
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the onset of toilet training and the duration 
of weaning—items which form the basis for 
most descriptive and comparative studies of 
child rearing patterns. Since it is not feasible 
to have an observer live with families for pro- 
longed periods of time, there can only be as- 
sessment of the consistency of parental re- 
ports for material of this sort, and not of its 
validity. 

The sample used is particularly favorable 
for maximizing the accuracy of recall since 
continued participation in the longitudinal 
study ensured parents’ original awareness of 
the information sought. In addition, this is a 
middle-class group, whose practices are pre- 
dominantly child centered, with the mothers 
accepting care of the baby as their prime re- 
sponsibility, so that it can be presumed that 
maximum attention has been paid to details 
of the child’s functioning. 

This report will deal with two aspects of 
parental recall: the accuracy of retrospective 
parental reports,* and the relationship be- 
tween errors in retrospective parental reports 
and recommendations made by authorities in 
the child rearing field. 


METHOD 
Subjects 


The broader longitudinal study, in progress since 
March 1956, has been mainly directed at exploring 
the relationship between initial patterns of reactivity 
in infants, and subsequent personality development 
(Chess, Thomas, & Birch, 1959; Chess, Thomas, 
Birch, & Hertzig, 1960; Thomas, Birch, Chess, & 
Robbins, 1961; Thomas & Chess, 1957; Thomas, 
Chess, Birch, & Hertzig, 1960). Parents have been 
seen at frequent intervals and their participation 
in the study has been extensive. Interviews have 
been held, usually with both mother and father 
present, every 3 months during the child’s first 
year, and at 6-month intervals thereafter. Questions 


3 While it could be argued that only the consist- 
ency of retrospective reports is being measured when 
parental reports at two points in time are compared, 

„it should be borne in mind that the original data 
were gathered close to the time of occurrence of 
events and are objective and noninterpretative in 
content, In addition, independent observations of 
children in this study showed that concurrent pa- 
rental descriptions are valid (Thomas, Chess, Birch, 
& Hertzig, 1960). As a result, agreement between pa- 
rental recall and original parental descriptions of the 
same events and behaviors can be held to represent 
accuracy rather than mere consistency of report. 


LILLIAN CUKIER ROBBINS 


were designed to elicit objective and specific descrip - 


tion of the child's behavior during the normal rou- 
tines of daily living, with items of a type which par- 
ents can usually answer easily and without defensive- 
ness. Parents were used as the observers since their 
intimate contact with the child serves to make them 
the best reporters of consistent patterns and long- 
range activities, Other procedures have included in- 
dependent observations of the children in order to 
check the validity of concurrent parental reports, the 
Stanford-Binet at 3 years, and direct observations in 
nursery school. 

When the retrospective interviews were conducted, 
in the spring of 1959, 49 children of the families un- 
der study were approximately 3 years old. Recurrent 
illness and a recent divorce made scheduling impos- 
sible in two cases, so that only 47 families could be 
interviewed. Six fathers were not seen: four were no 
longer living with their wives, who had custody of 
the child; no appointment could be made with an- 
other because of the pressures of his business; and 
one family had moved to the midwest so that only 
the mother could be interviewed on one of her visits 
to New York. 

In order to make this study more directly com- 
parable to others, the data for three nonwhite fami- 
lies, among them one of the divorced couples, haye 
not been included. As a result, the sample consists of 
44 mothers and 39 fathers. All but five of the moth- 
ers have attended college, and many have advanced 
degrees in medicine, law, or psychology. The ma- 
jority of the men work in a professional or execu- 
live capacity, and, with the exception of five, have 
gone to college. Most hold advanced degrees. The 
group is predominantly Jewish and is located in the 
Greater New York area. Its socioeconomic status is 
uniformly high, with the two men who have had no 
Schooling beyond high school at the upper extreme; 
The median age of the children at the time of inter- 
view was 37.0 months, with a range of 20-45 months. 


Procedure 


Appointments were arranged through the project 
secretary, who had a long-standing acquaintance with. 
the parents in the study. On the night of the inter- 
view, the author and her husband, a psychiatrist, 
arrived at the parents! home and introduced them- 
selves. Arrangements were made so that interviews 
could simultaneously be held in two different rooms. 
This technique made possible the assessment of in- 
dividual attitudes and recall in a way that joint in- 
terviews with both parents could not achieve. At the 
same time, it eliminated the contamination whi 
might ensue if parents were interviewed one after 
the other. j 

On the basis of a prearranged scheme, each inter- 
viewer spoke to a father on one visit and to a mother 
on the next. Tape recorders were used to ensure a 
complete record of the interviews and to eliminate 
the need for writing lengthy verbatim notes as well 
as the danger of selective recording. The interviewers 
possessed no information beyond the name and agè 
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TABLE 1 
DESCRIPTION OF THE 19 Memory ITEMS 


Area and item 


Quant, Qual. Recommendation 


Feeding 

C3. Schedule ys. demand? 

C4. Breast vs. bottle? 

C4b. If breast—how long? 

C5. Cereal—when introduced? 
Weaning 

D1. Cup—when introduced? 

D2a. Bottle—when stop? 

D2b. If stop—type of transition? 
Sucking 

D3. Ever suck thumb? 

D4. Ever suck pacifier? 
Toilet training 


Fla. Bowel training—when begun? 
Fla. Bladder training—when begun? 


Sleeping 


G5a. 2 a.m, feeding—when stopped? 
G5b. 10 p.m. feeding—when stopped? 
G5c. How well sleep at 6 months? 


Motor development 
Ii. When first stood alone? 
Ila. When first walked alone? 

Medical history 
L2. First injection—when? 
L4. Birth weight? 
L4a. Birth length? 


demand 
breast 
7-10 months 


5months | 
‘by 24 months 


WA Mm 


Opposes 
favors 


18-24 months 
18-24 months 


nu 


WM NK 


WA nx 


Note.—Key: Quant. = Quantitative item; Qual. = Qualitative item. Recommendation = Advice of Spock (1957) reg 
parental practices. Spock has been used as the authority, since all the parents in this sample were familiar with his writings. 


signifies the presence of the category heading a given column. 


of the child, and no previous attempt had been made 
to collect retrospective information. 

The interviews, based on a specially designed in- 
terview guide, were primarily concerned with obtain- 
ing descriptions of parental behavior from which 
attitudes could be inferred. The responses to the 
Memory items, which took but a small fraction of 
the time spent in talking with the parents, were writ- 
ten in the course of the interview so as to expedite 
analysis of the present data. Table 1 cites the 19 
Memory questions that will be discussed in this re- 
port. 

Immediately after the interview, which lasted any- 
Where from 1.5 to 3 hours, each parent was asked to 
fill out a modified version of the Schaefer and Bell 
(1957 unpublished, 1958) Parental Attitude Research 
Instrument. The scale was administered in order to 
Obtain an objective and rapid measure of parental 
attitudes, which could be compared with the ratings 
based on the more extensive focused interviews. 

„While the parents were thus occupied, each inter- 
viewer had the opportunity to write a summary de- 
scribing his impressions of the parent’s personality 
and attitudes. 

IIS 

* The interview guide and all measures pertaining 
io recall are described and presented in Robbins 
(1961). An analysis of the attitudinal material is 
currently under way. 
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Scoring 


After editing the retrospective records for am- 
biguities, the interview responses were tabulated in 
accordance with the following conventions: 

1. Ages were converted to weeks and weight to 
ounces, so as to obviate fractions. 

2. Ranges were transformed into means. For ex- 
ample, if a parent said that something had happened 
between 3 and 4 months, his response was scored as 
“15 weeks.” 

3. Guesses were counted as actual responses but 
were set off by parentheses, so as to make possible, ` 
eventually, an analysis of the relationship between 
certainty and accuracy of recall, 

For most of the items (e.g, birth weight, breast 
or bottle fed, age of onset of walking) coding proved 
relatively easy. The parents had been asked for spe- 
cific information and by and large were able to give 
answers definite enough that no judgment was re- 
quired in evaluating responses. Accordingly, these 
items were scored by the author alone. 

There were only three questions where coding was 
more complex because the parental descriptions re- 
quired interpretation. These were the items referring 
to feeding on schedule or demand, ease of Weaning, 
and the sleeping pattern at 6 months. The responses 
to each -of these items were entered on a worksheet 
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and coded independently by the two interviewers.5 
The codings were then compared. Where disagree- 
ments occurred, the two coders re-evaluated the item 
independently, without reviewing the scores previ- 
ously assigned. If the new scores were in agreement, 
this was the code entered. If not, both coders dis- 
cussed the response until consensus was reached. Re- 
liability of the first coding was computed by Cohen’s 
(1960) K and the degree of agreement was high, 
ranging from .78 to .96. 

When coding of the retrospective data was finished, 
the original longitudinal records were consulted for 
the first time. The complete folder on each child 
was read. Precautions were taken to prevent con- 
tamination by masking the retrospective responses 
while the information írom the longitudinal his- 
tories was entered. 

In tabulating the longitudinal material, the fol- 
lowing conventions were added to the previous set: 

1, In the rare instances when the exact time at 
which a given behavior had begun or stopped was 
not specified, the midpoint between the date of the 
interview at which its presence or cessation was 
first noted, and that of the preceding interview, 
was tabulated. For example, if a child was described 
as not yet standing at 38 weeks, and as standing by 
the next interview, at 52 weeks, with no specific date 
of onset mentioned, the item was scored as “45 
weeks." 

2. When the longitudinal data for a specific item 
were vague or incomplete, the entry ODM (Original 
Data Missing) was made. This occurred typically 
when, as a result of error, some information was 
either not sought or not entered in the longitudinal 
histories. 

Coding was done in the same manner as before, 
with K ranging from .84 to 1.00. 

Once the data had been coded, two master charts, 
one for mothers and one for fathers, were set up to 
permit comparison of the retrospective and longi- 
tudinal material. Qualitative items, such as «whether 
the child had been fed on schedule or demand, were 
Scored as "correct" if the original and retrospective 
responses were identical, and incorrect" if there 
was any difference between the two. Quantitative 
items, such as when the cup had been introduced, 
were scored in terms of the discrepancy between 
the retrospective and original statements. A positive 
discrepancy score (+) indicates that the parent 
recalled the event as having occurred later than it 
really did; a negative discrepancy score (—), that 
the parent dated the event earlier than was actually 
the case. 


Statistical Treatment 


The question concerned with the accuracy of 
parental recall was evaluated by comparing retro- 
spective responses with the original parental state- 
ments. For the quantitative items, ¢ tests of the sig- 
nificance of differences between correlated means 


5 Criteria are presented in Robbins (1961, Ap- 
pendix B). y 
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were computed. For the qualitative items, Mc- 
Nemar’s (1949) test of the significance of changes 
was employed. Mothers and fathers were treated 
separately in order to compare their relative ac- 
curacy, and to facilitate comparison with other 
studies where mothers only have been questioned. 

The effect of experts’ recommendations on paren- 
tal recall was evaluated by comparing the kinds of 
errors made with the advice of authorities. For 
example, if a mother who had originally described 
feeding on schedule stated in the retrospective in- 
terview that she had fed her baby on demand, this 
was scored as an error in the direction of the experts’ 
recommendations. Spock (1957) was used as the 
basis of comparison since all the parents in the 
present group were more or less familiar with his 
work, However, his recommendations are so similar 
to those in other common sources of child rearing 
advice, such as Infant Care (United States Depart- 
ment of Health, Education, and Welfare, 1955), and 
women’s magazines, that it was felt that parental 
agreement with experts in general, rather than with 
Spock alone, was being tested. 


RESULTS 


Question 1. How Accurate Are Retrospective 
Parental Reports of Child Rearing Practices? 


In Figure 1, the percentage of parents 
whose reports were accurate, i.e., whose inter- 
view responses and original records were im 
exact agreement, is contrasted with the per- 
centage whose retrospective reports were off 
by less, or more, than a month, and with the 
percentage who stated they were unable to 


recall. For the qualitative items, the percent | 


age of parents who were correct is compared 
with the percentage giving incorrect or “don’t 
know” responses. It may be seen that certain 
items, such as whether initial feeding was by 
breast or bottle, and the birth weight, were 
well-remembered by the great majority of 
the parents, both mothers and fathers. Other 
items, such as the cessation of bottle feeding 
and the onset of bowel and bladder training, 
were less accurately recalled by either parent, 
while two items—the introduction of the cup 
and the date of the first injection—were not 
recalled correctly by any of the fathers. 
Table 2 further demonstrates that parental 
recall tends to be inaccurate. On 4 of the 13 
quantitative items—those dealing with the 
age of weaning, beginning of bowel am! 
bladder training, and the stopping of the ? 
A.M. feeding—mean retrospective reports by 
both mothers and fathers differed significantly 


Accuracy OF PARENTAL RECALL 265 


FATHERS 


Quantitative Items 
C4b. If breast--how 
long? 


C5. — Cereal--vhen 
introduced? 
Di. Cup- 
introduced? 
D2a. — Bottle--eben 


44 PSPRIIMMMHIIIHNNEMININIUHZUIINIHHIRAHMHHH F13. Bowel training-— 
when begun? 

a3 pabroanseaoanooanenannoaos ooann  F1a. Bladder training-— » 
when begun? 

40 APPLIAN) GSa. 2AM feeding-- PANN NNN SOOBOOOOSOGSOD 36 


e MEME 


ae —— 12. 


‘when stopped? 


FO A rl. 
TAUTA 11a. 
Firat anjection-- 
— en 


Léa. Birth length? 


Dab. 1f stop bottle-- 
transition? 
b3. Ever suck thumb? 


D4, Ever suck 
pacifier? 

GSc. How well sleep at 
six months? 


FATA IIIIININBODOSSSSODDANNSOSSONUSSEES 25 
ABI NAAM TALLIES 36 
492990 49-9747 ANN NNO 38 
2674744945. WAWMIMIIIIIIMIINMEDSSDSSSSCOBSODU 35 
DIDID GERDI TEEN 39 
ATAMIN NIIN OIOGOOOOSOGOOSOSOTTESEEDTER 26 


A AAA 


Percentage of Respondents 


Quantitative Items 
WEM = accurate report. 
BUB * don't know. 


report deviates by:B7 1 month, or less (5 lb.: 4 in.) 
WHIM more than 1 month (h 1b.; 5 in.). 


Percentage of Respondents 


Qualitative Items 


BHH = accurate report. 


WER = don't Know. 


aa» inaccurate report. 


N does not include ODM (Original data missing) or DNA (Does not apply) responses. 


Fic. 1. Accuracy of parental recall. 


from the original records. Fathers! reports 
were also significantly inaccurate on the items 
dealing with the introduction of cereal, the 
introduction of the cup, and the child's age 
when he first stood alone. For two of the 
six qualitative items—those dealing with feed- 
ing on schedule vs. demand, and thumb- 
sucking—retrospective reports by both par- 
ents differed significantly from the original 
records, 

For certain of the quantitative items, the 
mean discrepancy scores showed extreme dis- 
tortion in recall. For instance, with regard 
to the onset of bowel training, mothers erred 
by an average of + 14.2 weeks and fathers 
by an average of + 22.7 weeks, both recalling 
the event as occurring later than was origi- 
nally reported. As for bladder training, the 
average discrepancies were + 22.3 and + 25.5 
weeks, respectively. These discrepancies of 
3-6 months seem especially large when one 
Considers that they are for behaviors that 
typically do not begin much before the end 
of the first year (Thomas et al., 1961), so 
that they are relative to a maximum time 
Span of approximately 2 years. 


The age of first standing was recalled cor- 
rectly by a far smaller percentage of parents 
than the onset of walking. Furthermore, the 
mean discrepancy scores were opposite in 
sign, with both mothers and fathers tending 
to recall standing as having occurred later 
than it did and walking as having started 
earlier. The parents considerably shortened 
the time required for the progression from 
one stage to the next, with the fathers be- 
lieving that it took 14.8 weeks on the average 
and the mothers, 16.8 weeks. The average in- 
terval between standing and walking was 
actually 21.8 weeks. 

In the case of weaning, the same tendency 
to shorten the learning process was found, 
with the age of introducing the cup recalled 
as having been later than was really the 
case, while the age of completion of wean- 
ing was placed significantly earlier. The 
former discrepancy was significant for the 
fathers only (at the .05 level); the latter 
for fathers and mothers alike (at the .001 and 
.01 levels, respectively). 

With regard to bowel and bladder func- 
tions, the initiation of training was also re- 
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TABLE 2 
ACCURACY OF PARENTAL RECALL 
' Fathers’ mean 
Parental recall Dod sees cs D dE vaiscrenancy t 
Quantitative items 
C4b. If breast—how long? 20 +.4> «1 16 —.4> <1 
C5. Cereal—when introduced? 41 =3 «1 29 | 473 3.09** 
D1. Cup—when introduced? 26 | +1.1 «1 21 | +12.7 2.48* 
D2a. Bottle—when stop? 25 | —13.6 3.52** 19 | —18.2 474v 
Fla. Bowel training—when begun? 44 | +14.2 3.49** 38 | +22.7 4,92» 
“Fia. Bladder training—when begun? | 42 | +22.3 5:2392. 36 | +25.5 4.86*** 
G5a. 2 A.M. feeding—when stopped? | 34 | +2.8 2.13* 25 |. 45.5 245*: 
G5b. 10 p.w. feeding—when stopped? | 28 | +2.3 <1 16 | —5.9 1.23 
Ii. When first stood alone? 35| +19 1.31 31 +9.1 3.08** 
lia. When first walked alone? 43| —14 1.36 32| —14 1.08 
L2. First injection—when? 27| —16 1.55 22 | 4-40 «1 
L4. Birth weight? 43 +.02 ounces} «1 37 | —11ouncs| «1 
L4a. Birth length? 19 +.1 inch 1,56 10 +.2 inch «1 
% Accurate | Significance* % Accurate | Significance® 
Qualitative items D 
C3. Schedule vs. demand? 43 53 .054 38 37 052 
C4. Breast vs. bottle? 44 95 — 39 97 — 
D2b. If stop bottle—transition? 23 61 — 19 74 E 
D3. Ever suck thumb? 44 84 01° 39 78 05° 
D4, Ever suck pacifier? 43 88 — 38 87 = 
GSc. How well sleep at 6 months? 42 79 — 38 66 pr^ 


a Ns vary because of ODM Kona Data Missing), DK (Don't Know), or DNA (Does Not Apply) responses. 


» In weeks except Items L4 an 


© Significance depends upon the direction of errors, rather than the percentage of accurate responses. Thus, seven mothers denied 
thumbsucking and none erroneously added it in the retrospective reports, so that the responses on original and retrospective inter- 
views significantly differ from one another. For Item GSc, where the percentage accuracy is actually lower, change was not significant 
since four mothers said the child had slept well when he had slept poorly and three erred in the opposite direction. y 

5 je schedule than had occurred were mentioned in the retrospective interview than: 


tly more modified deman 


called as significantly later by both sets of 
parents, Since a majority of the children 
had not yet been fully trained at the time of 
the retrospective interviews, information con- 
cerning the date of completion was not sought. 
However, in those families where training was 
complete, the parents tended to describe its 
accomplishment as taking place virtually over- 
night, and to gloss over the lengthy process of 
acquisition. 

A larger percentage of both mothers and 
fathers were accurate in recalling the termina- 
tion of the 2 a.m. feeding, which interrupts 
sleep, than of the 10 p.m. feeding. However, 
only the mean discrepancies for the 2 a.m. 
feeding were significant, because they were 
predominantly in the same direction, that of 
interfering longer with parental rest. In con- 
trast, the discrepancies for the 10 p.m. feed- 
ing were random, with the fathers, who were 
less often involved, recalling its cessation as 


earlier, on the average. Finally, it should be 
noted that there was better recall of birth 
weight than birth length. The birth weight 
is considerably more important to the sur- 
vival of the child, and is one of the most fre- 
quently mentioned items in descriptions of 
discussions of newborn babies, while the birth 
length is rarely brought up. However, the 
comparatively few parents who did recall the 
birth length were likely to be accurate, so that 
the mean discrepancy score was negligible. 

As Bronfenbrenner (1958) anticipated, the 
recall of whether or not a particular practice 
was employed is somewhat more reliable than. 
a parent's estimate of when the practice Wa5 
begun or discontinued. However, even for tbe 
qualitative items, no fact was recalled ac 
curately by all the parents, and there are two 
items with significant discrepancies. Both 
mothers and fathers reported a far greater 
incidence of demand feeding than had been 
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described when their children were infants, 
and several stated that there had been no 
thumbsucking in instances where the evidence 
in the original records was undeniably to the 
contrary. 

Table 3 presents the median ages for all 
quantitative items and the percentages for 
all qualitative items, as derived independently 
from the original records and retrospective 
interviews. These data should be utilized in 
comparing the present results with those ob- 
tained by other investigators. 


Question 2. Is There any Relationship be- 
tween the Recommendations of Experts in 
Child Rearing and Retrospective Parental Re- 
ports of Their Own Practices? 


Although no cause and effect relationship 
can positively be established, the parallelism 
between experts’ advice and prevailing di- 
rection of distortion is suggestive. As is shown 
in Figure 2, mothers tended to be inaccurate 
in the direction of the recommendations on 
every one of the nine items for which sug- 
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gestions were clearly made in the child 
rearing literature. For instance, of the 20 
mothers who gave inaccurate responses re- 
garding the mode of infant feeding they em- 
ployed, 65% shifted in the direction of more 
demand feeding and only 35% toward less. 
This discrepancy suggests that mothers were 
desirous of appearing to have fed on demand 
even when they had not actually engaged in 
the practice. The age of weaning was 
markedly reduced in the retrospective reports, 
while the onset of toilet training was re- 
called as later than was really the case. In 
both instances, the shifts paralleled the 
recommendations of Spock (1957; see Table 
1) regarding the ideal timing of these prac- 
tices. Finally, with regard to thumbsucking, 
of which Spock disapproves, and the pacifier, 
whose use he favors, errors again reflected the 
recommendations. All seven of the mothers 
who were inaccurate in their reports of 
thumbsucking denied that their child had 
ever sucked his thumb. The original records 
showed not only that thumbsucking had oc- 


TABLE 3 


MEDIAN VALUES AND PERCENTAGES GIVEN IN THE ORIGINAL AND RETROSPECTIVE INTERVIEWS FOR 
THE 19 Memory ITEMS; NUMBER or “Don’t Know” RESPONSES GIVEN IN THE 
RETROSPECTIVE INTERVIEWS 


Mothers Fathers 
Parental recall Originals Retrospective Original Retrospective 
Nd| Median |N | Median |DK|N| Median |N| Median | DK 
Quantitative items 
C4b. If breast—how long? 18| | 2.79 20| 27. 0 |16| 29° 16| 2.80 1 
C5. Cereal—when introduced? 43| 2.0 42| 2:0 2 |38| 18 29| 3.0 10 
Di. Cup—when introduced? 30| 8&0 39| 6.0 4 |26| 80 29| 10.6 10 
D2a. Bottle—when stopped ?4 25| 23.8 25| 174 0 |20| 240 19| 174 1 
Fla. Bowel training—when begun? |44| 16.0 44| 17.4 0 |39| 15.7 38| 22.0 1 
Fla, Bladder training—when begun? | 44 | 16.4 42| 19.4 1 |39| 159 36| 214 3 
G5a. 2 A.M, feeding—when stopped? | 41| 1.9 36| 2.0 6 |37| 18 27| 34 11 
G5b. 10 p.m. feeding—when stopped?| 41| 2.9 29| 3.0 13 |37| 3.0 18| 40 19 
Ii. When first stood alone? 4| 82 38| 8.6 6 |36| 8.0 34| 94 5 
Ila. When first walked alone? 44| 128 44| 12.5 0 |39| 127 33| 119 6 
L2. First injection—when? 33| 3.0 37| 2.8 7|30| 33 26| 34 13 
L4. Birth weight? 43 | 109.0 ounces| 44 | 108,0 ounces} 0 | 39| 109.5 ounces| 37 | 108.0 ounces| 2 
L4a. Birth length? 23| 19.8inches| 33 | 20.2inches| 9 |22| 19.5inches| 22| 19.4inches| 16 
Q % Accurate % Accurate % Accurate % Accurate 

ualitative itemse ma hepa: 
C3. Schedule vs. demand? 43 56 44 77 0 |38 53 38 176 1 
C4, Breast vs. bottle? 44 41 44 46 0 |39 41 39 44 0 
D2b. Transition easy? 23 65 25 72 0 |19 63 19 68 1 
D3. Suck thumb? Yes. 44 57 44 41 0 |39 59 39 44 0 
D4. Suck pacifier? Yes. 43 42 44 50 0 |38 42 39 41 0 
G5c. How well sleep at 6 months? |42 74 40 78 2 |38 71 36 72 2 


a "Original" medians and percentages for fathers and 


mothers are not always identical because only the records of parents who 


artici i d in these computations. 
Participated in the retrospective interviews were Nt Sin the number of ODM (Original Data Missing), DK (Don't Know). or 
o 


N varies from item to item because of differences in tl 


DNA (Does Not Apply) responses. The total number of respondents equals N plus DK. 


* In months except Items L4 and L4a. 
3 For those already wean: 
* Percentages are given 


ee the italicized categories, e.g., percentage fed on demand; percentage reported as sucking the thumb, 
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N. inaccurate 
Items MOTHERS ] responses Total ya 
c3. Schedule vs. 555658 > 20 43 A 
demand? 
C4. Breast vs. 2 44 
? 4 
Cab. If breast how 13 20 i 
USE Mte (aps DEET í 
Dl.  Cup-wen —— EEE EER REIT 22 30 
introduced? 
D2a.  Bottle--when 23 25 
Stop? 
D3. Ever suck thumb? 7 44 
D4. Ever suck 5 43 
pacifier? 
Fla. Bowel training-- 40 44 
when begun? 
Fla. Bladder training-- |e IG 39 43 
when begun? 
FATHERS 
c3. Schedule vs. —  BENEENEINENUIEEEMNEIEMEENZZEENNM505000000000000 23 38 
demand? 
Cá. Breast vs. 1 39 
bottle? 
C4b. If breast--how 1l 17 
long? 
Dl. Cup--when 21 31 
introduced? 
D2a. Bottle--when 16 20 
stop? 
D3. Ever suck thumb? 8 39 
D4. Ever suck 5 38 
pacifier? 
Fla. Bowel training-- 37 39 
when begun? 
Fla. Bladder training-- 35 39 
when begun? 


Ü $ 


NEN = inaccuracy in direction of recommendation. 


S 50 
Percentage of Respondents 


80 100 


55555 = inaccuracy in direction opposite to recommendation, 


? N does not include ODM (Original data missing) or DNA (Does not apply) responses. 


Fic. 2. Inaccuracies in parental recall as related to experts! recommendations. 


curred, in three cases for as long as a year, 
but also that several of these mothers had 
expressed concern about it at the time. In 
contrast, of five mothers who erred in their 
recall of the use of the pacifier, four stated 
that their child had used one when, according 
to the longitudinal records, he had not. 

On items for which expert advice was rela- 
tively less specific or irrelevant, maternal er- 
rors were random in direction. Thus, with 
respect to the introduction of cereal, 17 
mothers recalled the date of onset as later and 
18 as earlier, while the birth weight was 
understated by 8 and overstated by 9. 

As appears in Figure 2, the fathers tended 
to be less affected by expert advice than the 
mothers. While they all reported some ac- 
quaintance with Spock (1957), the fathers 
felt they were less thorough in their reading 
than their wives, and that he influenced them 
little. 


Discussion 


In a widely quoted review of the literature; 
Bronfenbrenner (1958) inferred, on the basis 
of data which showed a rise in reports of de- 
mand feeding from 7% in 1932 to 71% in 
1949-50, that self-demand feeding is becom- 
ing more common in the middle class. His 
conclusion must be assessed in light of the 
fact that 77% of the mothers in the current 
sample reported having fed their infants om 
demand 3 years previously. While this sup" 
ports the trend noted by Bronfenbrenner, 
since the data were gathered subsequent to 
the last study he surveyed, an examination. 
of the original records shows that only 5676 
of the mothers had actually fed on demand 
(see Table 3). The number of mothers whose 
reports were erroneous was sufficient to make. 
this one of the items where inaccuracies were 


‘ significant (see Table 2). 


AccURACY OF PARENTAL RECALL 


In the studies cited by Bronfenbrenner 
(1958), the age of weaning ranges from 10.3 
to 14.4 months. This is considerably earlier 
than the recalled median age for completion 
of weaning found in the current sample— 
17.4 months. The difference again appears to 
be consistent with the trend to permissiveness 
identified by Bronfenbrenner. Since the sub- 
jects of the other studies include a substantial 
number of children who were reared in the 
late thirties and early forties, when early 
weaning was commonly advocated (Wolf- 
enstein, 1953), it is quite conceivable that a 
genuine change in practices has occurred. 
However, the actual median age of weaning 
of children in this study was 23.8 months. 
Parents were even more permissive in their 
practices than in their reports, perhaps be- 
cause they had been influenced by such 
recommendations as Spock’s (1957) to com- 
plete weaning before 24 months. In addition, 
the direction of error suggests that the ten- 
dency to report weaning as occurring earlier 
than was really the case may have been in- 
tensified for mothers in other studies, who 
were questioned about this event as much as 
17 years after it took place. 

The degree of inaccuracy found in the 
present study is only a minimal indication of 
the errors that may be present in other 
research. In part this is so because the chil- 
dren in this sample were considerably younger 
than those in other studies. In addition, the 
parents as a group were better educated and 
probably more intelligent than those inter- 
viewed by previous investigators. There is 
some indication that higher education is 
correlated with more accurate recall of de- 
tails about child rearing (Pyles et al., 1935). 
Because this sample has been participating 
in a longitudinal study in which extensive 
reports of the child’s behavior and develop- 
ment have been made at relatively frequent 
intervals, parents could be expected to ob- 
serve, and therefore to retain, material relat- 
ing to child rearing with greater precision 
than mothers whose first contact with an in- 
terviewer occurred some years after the events 
had taken place. Another factor that probably 
serves to enhance the relative accuracy of 
parents in this sample is that both parents 
were interviewed, rather than mothers alone. 
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They might have shown greater care in their 
responses, knowing that their spouses were 
answering the same questions. Finally, the 
parents were aware that records were avail- 
able against which their recollections could 
be checked—knowledge that might serve both 
as a deterrent to distortion and as an intel- 
lectual challenge for demonstrating their 
acuity to the personnel of the study. 

The differences between original records 
and subsequent reports presented in Table 3 
are in almost every instance as large as, or 
larger than, the differences between middle- 
and lower-class samples tabulated by Bron- 
fenbrenner (1958, pp. 408-410). This is true 
even though the parents in the current study 
were talking about the same child at two 
points in time, while the class comparisons are 
based on different children. It is clear that a 
meaningful discussion of class differences in 
child rearing practices must await detailed 
longitudinal observations of representative 
samples of each group. Regardless of the ac- 
curacy of the reports reviewed by Bronfen- 
brenner, however, the way the mothers pre- 
sented their child rearing practices at various 
points in time remains an important indicator 
of the values they have held. 

As is apparent in Table 3, child rearing 
practices in the current sample by and large 
conformed to the recommendations of experts 
(see Table 1), in that demand feeding, rela- 
tively late weaning and toilet training, and a 
permissive attitude toward sucking are the 
mode, However, the experts also materially 
influenced parental recall, both in the sense 
of providing knowledge about child rearing 
around which memories could be organized, 
and in setting standards for appearing to be 
a good parent in the ways approved by “en- 
lightened” judgment. This was especially evi- 
dent for the mothers, who prefaced many of 
their responses with accurate statements of 
Spock’s (1957) recommendations. In addi- 
tion, many of the parents apologized for not 
always following the approved techniques, 
even when their modification seemed well- 
suited to the particular child. 

It is possible that some of the inaccuracies 
in recall were due to more or less deliberate 
distortion. Yet the experts’ recomméndations 
are such a potent influence that in all likeli- 
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hood many parents were not aware of the 
fact that they were incorrect in their reports. 
This factor should be kept in mind when 
evaluating the results of other studies, since 
it suggests that neither good rapport (Sears 
et al., 1957), the asking of several questions 
in a given area (Miller & Swanson, 1960), 
nor internal consistency in maternal responses 
(Bronfenbrenner, 1958; Miller & Swanson, 
1958, 1960; Sears et al., 1957) is sufficient 
to insure accuracy. 

The interplay between experts’ opinions 
and parental practices is a complex one. The 
experts look to research for justification of 
their recommendations and for substantiation 
of their theories. Ironically, however, the 
responses of parents who furnish data for 
such research may already be colored by the 
advice that they have received and may not 
be accurate accounts of what they have actu- 
ally done. As a result, there may be a self- 
perpetuating “validation,” with experts’ opin- 
ions influencing parental reports, which in 
turn influence the experts. 
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IMBALANCE IN NEED STRUCTURE AND MALADJUSTMENT: 
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4 studies are reported which replicate the test of the hypothesis that extreme 
imbalance within a need structure is associated with poorer adjustment. 
The Rotter Incomplete Sentences Blank (ISB) was the adjustment-maladjust- 
ment measure, Imbalance in need structure was measured by Liverant's Goal 
Preference Inventory which assesses the relative strength of Affection and 
Recognition needs. Extremely imbalanced and extremely balanced need groups 
were compared on ISB scores in each of the 4 independent studies. All 
results were in the predicted direction, with 11 out of 16 comparisons statis- 
tically significant. Within the limitations of the measures, the studies provide 
support for the hypothesis that a structural characteristic of a personality 
need system—extreme imbalance—is associated with poorer adjustment. 


The aim of the present research has been 
to coordinate adjustment-maladjustment to a 
Characteristic of the need structure of an 
individual. Four separate and independent 
studies, modified replications of each other, 
will be reported. The general hypothesis has 
been that where the structure of needs is such 
that one need is prepotently stronger than 
Other needs in that structure, the index of 
maladjustment will be higher than in the con- 
dition where there is relative balance among 
needs in the structure. 

Although this hypothesis is compatible with 
Several interpretations, the reasoning which 
led to its development and testing in the 
present context was that an overvalent or 
Prepotent need would dominate the goal 
Oriented behavior of an individual and lead 
him to seek goals relevant to the strong need 
in all or most situations, including situations 
in which such goals are not considered by 
Others to be socially appropriate or even 
Potentially available. The consequence of such 


1Financial assistance from the University of 
Colorado Council on Research and Creative Work 
and Grant 3M-9156 from the National Institute of 
Mental Health, United States Public Health Service, 
1s gratefully acknowledged. The following persons 
ave provided invaluable assistance in scoring proto- 
Cols or in the statistical analyses of the data from 
the various studies: Theodore Graves, Lee Jessor, 
Jaswant Khanna, Laurie MacTavish, Sandra Nugent, 
and Robert Titley. à 
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a process over time would be: conflict, since 
prepotent goal seeking is at the expense of 
other needs; frustration, since goals are often 
sought in situations where they are un- 
available or unlikely (e.g., seeking affection 
in a college classroom situation); and social 
rejection, since interpersonal reactions to in- 
appropriate goal seeking (e.g., seeking recog- 
nition in social affection situations) would 
ultimately tend to be negative. This general 
reasoning follows from Rotter’s (1954) Social 
Learning Theory, and the procedures employed 
to test the hypothesis were developed more 
or less systematically within the framework 
of that theory. E 
METHOD 


Four separate studies have been carried out over 
the past 5 years, each employing the same measuring 
instruments and general procedure, with minor modi- 
fications. j 


Maladjustment Measure 


The Rotter Incomplete Sentences Blank (ISB), 
college or high school form where appropriate (Rot- 
ter & Rafferty, 1950; Rotter, Rafferty, & Lotsof, 
1954), was the measure of adjustment-maladjustment 
in each study. The ISB is a well-standardized in- 
strument with established reliability and validity for 
discriminating, among students, maladjustment lead- 
ing to clinic referral or clinic self-presentation. It 
consists of 40 stems, completions to each of which 
are scored on a scale from O to 6 with higher scores 
being assigned where completions indicate, conflict, 
feelings of isolation, and low expectation of success. 
Interscorer reliability (above .90) is easily obtained 
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even among undergraduate student scorers trained 
only by instructions and examples in the ISB manual 
(Churchill & Crandall, 1955). In the present research 
interscorer reliability was either directly assessed, or 
scoring was done by a psychologist whose interscorer 
reliability had been previously established. In all 
cases, Pearson product-moment 7’s between scorers, 
for total ISB score, were above .95. 


Measure of Need Structure 


The Goal Preference Inventory (GPI), a forced- 
choice procedure in which behavioral referents from 
different need areas are paired into items, was used 
in all four studies. The original version of this in- 
ventory consisted of 120 items made up into six sub- 
scales measuring relative strength of two major needs, 
Recognition and Love and Affection. These two ma- 
jor needs were constituted from four lower-order 


needs: Academic Recognition (ACR) and Social: 


Recognition (SOR) on the one hand, and Academic 
Love and Affection (ALA) and Social Love and Af- 
fection (SLA) on the other. Referents for each of 
these four needs, when paired against each other, 
yield the six subscales mentioned above. The full de- 
velopment and construct validation of this inventory 
by Liverant (1958) is described in his monograph, 
and this form of the inventory was used in Studies I, 
II, and III. Subsequently, the GPI was revised to 
eliminate the Academic Love and Affection need, 
which resulted in a 60-item inventory comprised of 
only the three remaining lower-order needs (ACR, 
SOR, and SLA) which, when paired against each 
other, yielded three subscales, This revision was then 
modified further, primarily in wording of the items, 
to make it suitable for students in a small town high 
school, the subjects in Study IV. 

Need structure in these studies, then, refers to re- 
lations among those needs measured by the GPI. 
Clearly, the GPI measures only a small number of 
needs, and structural relations among these few needs 
may not represent characteristics of the overall need 
Structure. Nevertheless, the needs dealt with by the 
GPI are central ones and are particularly pertinent 
to student life and to this age range. It was, there- 
fore, deemed satisfactory as a partial assessment of 
need structure for a student group. 

The procedure of identifying subjects with a pre- 
potent need was similar but not identical in the four 
studies, Generally the procedure involved administer- 
ing the GPI to a large sample of subjects and then, 
from the distribution of scores, selecting three com- 
Parison groups—a group scoring extremely high on 
need for Academic Recognition (ACR group); a 
group scoring extremely high on need for Social Love 
and Affection (SLA group); and a group scoring 
relatively in the middle range, ie., having ACR and 
SLA scores about equal (Balance group). It should 
be clear to the reader that scoring high on one need 
automatically lowers the scores on one or more of 
the othe? needs by virtue of the pairing which oc- 
curs in forced-choice measures. The criterion of ex- 
tremeness of score was generally a score 1 standard 
^ 
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deviation or more from the total sample's mean score 
on one or on several subscales. The criterion for bal- 
ance of needs was generally a score, for each need, 
within + and —1 standard deviation from the total 
sample’s mean for that need. In all studies, efforts 
were made to maximize the separation between ex- 
tremeness and balance by selecting the most ex- 
tremely imbalanced subjects and the most extremely 
balanced subjects. The success in this effort varied, 
of course, with the size of the original sample to 
which the GPI was administered in each study. 
When extreme-need groups and a balance-need 
group were established in each study, all subjects in 
these groups were administered the ISB. Scores on 
the ISB were then compared among the groups. The 
specific hypothesis in all studies was that ISB scores 
for all extreme-need groups (irrespective of the con- 
tent of the extreme need) would be higher than ISB 
scores for the comparable balance-need group. 


Subjects 


In Study I, the GPI was administered to 283 stu- 
dents in an introductory psychology class at the 
University of Colorado. In Study II, the GPI was 
given to 153 students in introductory psychology at 
Ohio State University. In Study III, the GPI was 
administered to 931 students in an introductory psy- 
chology class at the University of Colorado. In Study 
IV, the GPI was administered to 68 juniors and 
seniors in a small high school in southwestern Colo- 
rado. (This group was made up of Anglo-Americans, 
Spanish-Americans, and Indians.) The size and sex 
breakdown. of the extreme-need and balance-need 
groups established in each study will be shown in 
the next section, 

RESULTS 
Study I 


The Academic Recognition (ACR) extreme 
group was composed of subjects scoring 1 
standard deviation or more above the mean 
when Academic Recognition was paired with 
Social Recognition, Academic Love and Affec- 
tion, and Social Love and Affection and, at 
the same time, scoring less than 1 standard 
deviation above the mean on each of the three 
remaining need areas, SOR, ALA; and SLA. 
Likewise, the Social Love and Affection (SLA) 
extreme group was composed of subjects scor- 
ing 1 standard deviation or more above the 
mean when Social Love and Affection was 
paired with Academic Recognition, Social 
Recognition, and Academic Love and Affec- 
tion and less than 1 standard deviation above 
the mean on each of the three remaining need 
areas. The balance group is established when 
ACR = SLA (Subscale III) and when scores 
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TABLE 1 


COMPARISON OF EXTREME GROUPS WITH BALANCE 
Groups on ISB Scores 


ACR Balance 
Sex U 
M N M N 
Females | 134.6 13 121.8 12 ST5h 
Males 132.9 9 1214 10 18.5* 
SLA Balance 
Females | 127.6 10 121.8 12 44.5 
Males 128.1 8 121.4 10 30 
*p <.025. 


are between + and —1 standard deviation 
from each of the means on the other five sub- 
scales. The results of this analysis are shown 
in Table 1. The Mann-Whitney U test was 
used in assessing results in all four studies 
and all p levels reported are for one-tailed 
tests. 

On this analysis, although all differences are 
in the predicted direction, only two of the 
four comparisons yield statistically significant 
support for the hypothesis. 


Study II 


A replication was carried out at Ohio State 
University as part of a larger project. Be- 
cause of requirements stemming from the 
larger project, only male subjects were used. 
The criteria for group selection are identical 
to those used in Study I. The results are pre- 
sented in Table 2. Again, only the compari- 
son with the ACR group is significant. 


TABLE 2 


Comparison or ExrREME GROUPS WITH BALANCE 
Groups on ISB ScoRES 


ACR Balance 
U 
M N M N 
132.4 10 121.2 8 Eve: 
SLA Balance 
121.9 10 121.2 8 39.5 
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Study III 


The large population (W = 931) adminis- 
tered the GPI in this study permitted some- 
what more stringent criteria of selection of 
the various groups. Here the procedure was 
to compose the ACR extreme groups from 
subjects scoring 1 standard deviation or more 
above the mean when ACR was paired with 
ALA and SLA and, at the same time, scoring 
close to the mean on the remaining subscales. 
Likewise, the SLA extreme group was estab- 
lished as scoring 1 standard deviation or more 
above the mean when SLA was paired with 
ACR and SOR and scoring close to the mean 
on the other subscales. The balance group 
is established when ACR = SLA and when 
scores are close to the mean on the other five 
subscales. The results of this analysis are 
shown in Table 3. , 

These data provide strong support for the 
hypothesis that the index of maladjustment 
will be higher where need structure involves 
an extremely prepotent need than where need 
structure is extremely balanced. All differ- 
ences were statistically significant in the pre- 
dicted direction. This study provides the 
strongest confirmation of the hypothesis of 
any of the studies reported in this paper. 


Study IV 

A final investigation of the hypothesis was 
made possible by the availability of the rele- 
vant measures as part of a larger community 
study. The GPI used in this investigation con- 


TABLE 3 


Comparison OF EXTREME GROUPS WITH BALANCE 
Gnours on ISB Scores 


ACR Balance 

Sex I——————— U z 
M' |N| M |N 

Females| 145.4 |14| 125.4 16] Tta 

Males 442 1062 130.7 145, S284 
SLA Balance 

Females} 140.0 |23| 125.4 |16 115.5** |1.96 

Males 140.9 8| 1307 |15| 33* — 

e. 
oe} c dis. 
HD <01. 
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TABLE 4 


Comparison OF EXTREME GROUPS WITH BALANCE 
Groups on ISB SconEs 


Extreme ACR, 


SLA, and SOR | Balance 


M N M N 


1364|16|] — | — 
1240 | 23| 97.5 | 1.82* 
129.1 | 39 | 433.5 | 1.64* 


Female 13/2115 
Male 1318| 14 
Combined | 134.6 | 29 


*$«.05. 
sisted of the revision described earlier, a form 
including only three subneeds—Academic Rec- 
ognition, Social Love and Affection, and Social 
Recognition. Each is paired against both of 
the others: ACR versus SOR, ACR versus 
SLA, and SOR versus SLA. The score for any 
single subneed is based, then, upon 40 items. 
Since the total sample to which the GPI was 
administered was so small in this study (W 
— 68), the establishment of extreme and bal- 
ance groups was handled in a less stringent 
way. All students obtaining a score 1 stand- 
ard deviation or more above the mean on any 
of the three subneeds measured by this form 
of the GPI were considered to have an ex- 
treme need. All other students were considered 
to have balance among the three needs. Re- 
sults of this analysis are shown in Table 4. 
Again, all differences are as predicted and 
those for males alone and for the combined 
sexes are statistically significant. Further 
analysis was made by breaking down the 
combined extreme need group into the sepa- 
rate extreme needs and comparing each of 
these groups with a more refined balance 


TABLE 5 


Comparison or EXTREME GROUPS WITH BALANCE 
Group on ISB Scores 


Extreme Balance 
Measure U 
M N 
ACR 126.0 |14| 525* 
SLA 126.0 |14| 29)0*** 
SOR 126.0 [14| 31.5** 


* - 
#3 = x approximate. 
"o 01. 
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group. The latter was constituted of those 
students whose scores on all three need areas 
were within .5 of a standard deviation, or less, 
of the respective mean for each need. These 
results are shown in Table 5. 

From this final analysis it is clear again 
that all differences are in the expected direc- 
tion, and two out of three are statistically 
significant. 

DISCUSSION 


The hypothesis which was independently 
examined in four separate studies reported in 
this paper was that a substantial departure 
from balance in a structure of needs or mo- 
tives will be associated with a higher level of 
maladjustment (or a lower level of adjust- 
ment). Given the measures employed, the re- 
sults are generally supportive of the hypothe- 
sis. Although not all the predicted differences 
were statistically substantiated, all showed 
differences in the expected direction, and 11 
out of 16 comparisons did reach an acceptable 
level of statistical significance. 

A closer analysis of the data suggests, how- 
ever, that the content of the need may not be 
entirely irrelevant to-our findings. The signifi- 
cant relationships within Studies I and II are 
entirely accounted for by the extreme ACR 
groups. None of the SLA comparisons is sig- 
nificant. While this observation does not hold 
up at all in Studies III and IV, it is worth 
speculating upon for a moment. 

The greater likelihood for ACR dominated 
college students to have higher maladjust- 
ment scores than SLA dominated students 
may be a function (at least in part) of the 
differential reinforcements provided by their 
respective college cultures. That is, the cul- 
ture at large state universities is perhaps more 
accepting of the person who is strongly affec- 
tion oriented, even in achievement-demand 
situations, and relatively less accepting of one 
who pursues academic recognition goals at 
the expense of social interaction. Another 
possible explanation for the higher maladjust- 
ment scores of academically oriented college 
students may lie in the ISB scoring system 
itself. A review of the sentence completions 
suggests that the SLA dominated students 
say, to a greater degree, that they like peo- 
ple and tend to give brief and socially con- 
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forming answers (responses which yield low 
scores), whereas the ACR dominated indi- 
yiduals tend to be more introspective and less 
socially conforming in their responses. 

The overall conclusion to be reached from 
the data, despite the foregoing, is that there 
is a relationship between need structure and 
maladjustment within the limited assessment 
of need structure and maladjustment made in 
this research, 

Even though small Ns were used, it is sup- 
portive of the selection procedure employed 
to find that in most instances the mean ISB 
scores of the college balance groups are below 
the means of the normative sample estab- 
lished by Rotter and Rafferty (1950) (fe- 
males = 127.4, males = 127.5), whereas the 
means of the extreme groups tend to be above 
the norms. Also of interest is that in Study 
III, wherein the most stringent criteria were 
employed, all of the extreme-need groups had 
mean ISB scores higher than the maladjust- 
ment cutoff point of 135 suggested in the ISB 
manual for the most efficient discrimination 
of students in need of counseling. 

The higher scores of all groups selected 
from the high school population relative to the 
normative high school sample (Rotter et al., 
1954) (females = 116.9, males = 120.9) may 
well reflect the unusual ethnic composition 
and lower socioeconomic level of this group 
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compared to the middle-class suburban high 
school subjects used in establishing the norms. 

A final caution should be offered perhaps. 
Our measures suggest a relation between a 
structural characteristic of a need system— 
imbalance—and a particular index of adjust- 
ment-maladjustment. The implications of such 
a relationship are, on the face of it, neither 
good nor bad, and one should be wary of ap- 
plying value judgments. Need-imbalance may 
be as much a source of creative accomplish- 
ment, despite feelings of frustration and re- 
jection, as it is of personality debilitating 
maladjustment as that term is used in com- 
mon parlance. 
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CRITIQUE AND NOTES 


INFLUENCE OF PARTIALLY VESTED INTERESTS ON JUDGMENT* 


JANE SRYGLEY MOUTON aw» ROBERT R. BLAKE 


University of Texas 


In a study of the evaluation of group positions under various conditions of 
vested interests, 16 groups composed of 118 Ss spent approximately 12 hr. 
in autonomous ingroup activities prior to intergroup problem solving. An 
assigned problem served as the basis for creating different positions among 
groups. Competition was between pairs of groups. The judging pairs, who 
evaluated competing solutions, were composed with 2 degrees of vested 
interests: no vested interest or the condition of neutrality and partially 
vested interests with 1 person holding membership in a group whose solution 
was being judged and the other person from a neutral group. A vested 
interest in an inferior position led to judgments different from those obtained 
from nonvested pairs. In the latter situation, when the judging pairs included 
persons from the group with an inferior position, they were successful in 
elevating the evaluations of their position significantly above those given 
under baseline conditions of neutrality to a point where it, in effect, tied the 


better solution for adequacy. 


Vested interests based on membership ties 
define a significant factor for understanding the 
perceptions and judgments of group members 
when operating outside the immediate framework 
of their own group. The definition of vested 
interest, in the context of its use here, is related 
to a member’s positive identification with his 
own group and with a product it has created 
which is being compared with a similar product 
by a competitive group. The prediction to be 
tested is that when an individual has a vested 
interest in a comparison judgment, he evaluates 
less objectively than when his attitude toward the 
object or action being judged is neutral (Sherif 
& Sherif, 1953), 

A variety of types of vested interests can be 
identified for theoretical and experimental pur- 
poses. When the two individuals who are engaged 
in comparing two products do not hold member- 
ship in the groups which produced these products, 
their relationship to the judging situation is 
nonvested. This baseline condition of neutrality 
provides a control measure against which to 
evaluate the influence of pressures acting on 
individuals in the interest of own group victory. 
Partially vested interests arise when one of the 
two individuals has membership in one of the 


1 Studies reported concerned with intergroup be- 
havior were partially supported by Grant M-2447, 
“Behavior of Group Representatives under Pressure," 
National, Institute of Mental Health, United States 
Public Health Service, and by a grant from Esso 
Division, Humble Oil and Refining Company. 


groups that produced one of the two products 
being compared, but when the other does not 
hold membership in either of the contending 
groups. The operation of partially vested interests 
can be evaluated against the control condition 
which provides an objective assessment of the 
relative merits of any two group products. 


PROCEDURE 
General Design 


The design used was as follows. In each of four 
independent runs, four groups designated A, B, C, 
and D, engaged in intergroup competition. Each 
produced its own solution to a standard problem. 
Then, A was pitted against B, but not against C 
and D; C competed against D, but not A and B. 

Under the above circumstances, evaluations of the 
four products were made by every member of each 
group interacting in a pair relationship with one 
member from another group. Judgments by members 
from A and B, involving comparisons of solutions 
produced by C and D, constituted baseline or non- 
vested judgments. Judgments by members from C 
and D, contrasting solutions produced by Groups A 
and B, also constituted nonvested evaluations. For 
both of these conditions, no vested membership 
interests were involved. 

Comparison judgments of solutions created by 
Groups A and B by pairs composed of members 
from A and C, or judgments of A and B, by 
members from A and D were partially vested. One 
member was from a group that produced one of the 
solutions and the other member was neutral 
Similarly pairs composed of individuals from B and 
C, and B and D, judging solutions A and B were 
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partially vested pairs. Parallel conditions of partial 
vestment also applied to judgments for Solutions 
C and D. 


Hypothesis 


The hypothesis is that when judgments are made 
of competing products by a partially vested pair, 
the evalution of the product "belonging" to the 
member with the vested interest will be higher than 
under nonvested or neutral conditions. The evalua- 
tion of the competing solution will be lower than the 
judgment it receives under neutral conditions. 

The rationale for this prediction is that the 
judgment of the neutral member will be influenced 
by the person with a vested interest in the direction 
of his own personal gain. The direction of influence 
wil be to evaluate one's own position and to 
de-evaluate one's adversary. 


Setting and Subjects 


The experiments were undertaken in connection 
with four human relations training programs con- 
ducted according to the same basic design. Any given 
laboratory was composed of four groups of 7 or 
8 members in each group, with a total of 118 
subjects used. 


Evolution, Opposition, and. Resolution of Differ- 
ences between Groups 


The arrangements for creating intergroup competi- 
tion were: 


1. Ten to 12 hours, divided into five or six 2-hour 
sessions occurring over a 3-day period, were devoted 
to autonomous ingroup development. The purpose of 
individuals meeting together was to study decision 
making in groups through developing and then 
evaluating their own ingroup structures. 

2. During the second phase, each group had 3 
hours to develop its own solution for an assigned 
Droblem, concerning some basic issue with which 
members of the contending groups were equally 
familiar, such as ways to improve the effectiveness 
OÍ an organization or how to improve union- 
Management relations. Group members understood 
that their group products were to be evaluated as 
an indication of which group was the “better” of 
the two. 

3. Then, 1-2 hours were spent in ingroup discussion 
and evaluation of the merits of each of the four 
Solutions, followed by a cross-questioning period for 
Clarification purposes between pairs of groups which 
had been pitted against one another. 

4, Judging pairs were composed of members 
according to the general design presented above. 
Twenty pairs were nonvested. The remaining 39 
Were partially vested. Each pair evaluated the com- 
Peting solutions and determined a joint rating for 
each solution on an “adequacy of solution scale.” 
This scale ranged from totally inadequate, 1, to 
totally adequate, 9. 
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RESULTS 


Results for conditions differing in vested interest 
combinations are shown in Table 1. Because no 
one of the four repetitions was significantly dif- 
ferent from the others, results have been pooled 
for analytical purposes. Average judgments by 
nonvested pairs provide the basis for determining 
which solution was “objectively” better and, 
therefore, which group produced the winning 
solutions. Data in Table 1 show that, in terms 
of the average ratings by the nonvested pairs, 
the better solutions, judged 6.6 in terms of over- 
all adequacy, were evaluated as being significantly 
better than the poorer solutions, judged 5.0 
(t= 3.72). These averages indicate that when 
the solutions were rated by persons without 
vested interests, there was a clear difference 
between the two solutions in terms of perceived 
adequacy. 

The evaluation of competing solutions by 
vested interest pairs resulted in an elevation of 
the judgment of the solution belonging to the 
member of the pair with a vested interest. Also, 
the competitor's solution was consistently de- 
evaluated. In addition, an interaction was found 
between objective quality of a solution and the 
impact of vested interest on the judgment. 

When one member had a vested interest in the 
solution that was judged better by the nonvested 
pairs and the other had no vested interest because 
his group's solution was not being evaluated, the 
average judgments did not differ significantly 
from those made by nonvested pairs. The trend, 
however, was in the predicted direction, The 
highest average, 6.7, was awarded to the solution 
“owned” by the vested member, with an average 
of only 4.6 assigned to the “poorer” solution. 
These ratings are not significantly different from 
the ones assigned by the neutral pairs, but they 
are significantly different from one another. 

When one member had a vested interest in 
the objectively poorer solution, he was able to 
persuade the nonvested person to give a signifi- 
cantly higher average rating (6.3) to his own 


TABLE 1 
JUDGMENTS OF COMPETING SOLUTIONS 


Objectively|Objectively 


Condition better poorer 


solution solution 
Nonvested 6.6 5.0 3.72* 
Partially vested interests 6.7 46 4.12* 
(better solution) 
Partially vested interests 6.1 6.3 43 
(poorer solution) LI 
*p < .001. 
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group's solution than it received under the non- 
vested judging conditions (5.0) (ź= 3.53). In 
addition, the judgment of the better solution was 
the lowest of any condition, (6.1), although it 
was not significantly different from the judgment 
of the poorer solutions. 

These results demonstrate that the judgment 
assigned to the better solution is different under 
partially vested conditions than is the judg- 
ment assigned to the objectively poorer position 
under partially vested conditions. The better solu- 
tion elicited a more uniform judgment across all 
conditions. The average ratings given by each 
of the three kinds of vested pairs were not 
significantly different from one another. 

Results were quite different with regard to the 
solution of objectively poorer quality. When 
evaluated under partially vested conditions the 
vested member exerted influence on the pair in 
the direction of elevating his inferior product 
to a level significantly higher than the judgment 
it received under nonvested conditions. When the 
member with a vested interest in the better 
solution was judging with a neutral, he tended 
to exert influence in the direction. of lowering 
the evaluation given by the pair to the poorer 
product. 
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CRITIQUE AND Notes 


In evaluating the merits of solutions to a 
problem, the results above indicate that the 
operation of vested interests stemming from 
group membership are influenced by the quality 
of one’s group position. If a person is a member 
of a group which produces a superior product, 
then there is no conflict of interests in the evalua- 
tion situation. The tendency to embrace one’s 
group position as being better, is consistent 
with the perceived quality of the position by 
neutral individuals, so that a person can operate 
consistently within the framework of logic and 
loyalty. However, when individuals are con- 
fronted with the inferiority of their group's 
position a threat is posed to the competence and 
cohesion of the group. Under these conditions, 
loyalty overwhelms logic. In the evaluation situa- 
tion, group interests are paramount. Judgments 
are biased in the direction of heightening the 
merit of one’s own proposal. 
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A UBIQUITOUS SEX DIFFERENCE IN DREAMS 


CALVIN HALL anp BILL DOMHOFF 


Institute of Dream Research 


Dream narratives from 1399 men and from 1418 women indicate that men 
dream more about other men than they do about women, whereas women 
dream about men and women in almost equal proportions. This finding is 
explained in terms of (a) a conflict theory of dreaming and (b) Freud’s 
theory of the male and female Oedipus complex. 


In 3874 dream narratives collected from 1399 
male dreamers, 5811 characters, exclusive of 
the dreamer, could be identified as to sex. Of 
these 5811 characters, 3695 are males and 2116 
um oe The percentage of male characters 
is 64. 

In 3065 dream narratives collected from 1418 
female dreamers, 5342 characters, exclusive of 
the dreamer, could be identified as to sex. Of 
these 5342 characters, 2782 are males and 2560 
are females. The percentage of male characters 
29152108 

The foregoing figures were obtained by pooling 
data from 11 separate investigations, The re- 


sults for these 11 studies are presented in 
Table 1. The consistency among the percentages 
for the various populations is striking. 

In addition to the principal fact established 
by these 11 investigations, namely, that in 
male dreams there are more male characters than 
female characters, and in female dreams there 
are nearly as many females as males, other 
conclusions may be drawn. These are: 

1, The sex difference holds throughout the 
age range from 2 to 80 (Groups 1 through 6). 

2. The sex difference is found for dream 
series as well as for collections of individual 
dreams (Group 7 compared with Groups 1-6). 
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TABLE 1 
MALE AND FEMALE CHARACTERS IN DREAMS 


it i Number of dreamers | Number of dreams | Percentage of male 
Description of population 
Male Female Male Female Male Female 
1. College students 78 80 841 978 64 52 
2. College students 199 499 357 670 63 51 
3. College students 225 225 658 662 61 54 
4, College students 200 200 200 200 60 53 
5. Adults (ages 30-80) 281 281 281 281 66 52 
6. Children (ages 2 through 12) 119 133 217 274 69 49 
7. College students; average percentage of 
male characters obtained from individual 
dream series 30 34 543 629 68 55 
8. Neurotic patients 15 400 64 
9. Epileptic Negro patient (age 25) 1 340 64 
10. White adult (age 37-52) 1 300 65 
11. College students 280 280 65 
All groups, expect Group 7, combined 1399 1418 3874 3065 64 52 


3. The same or approximately the same per- 
centage of male characters is found for neu- 
rotic males as for normal males (Group 8 
compared with Groups 1—6). 

4. The same result is found for two long 
dream series of two males as for collections of 
individual dreams (Groups 9 and 10 compared 
with Groups 1-6). 

5. Approximately the same percentage of male 
Characters is found for a young adult Negro 
epileptic man and a middle-aged white man 
(Group 9 compared with Group 10). 

That this differential proportion of male and 
female characters in dreams may be a uni- 
versal phenomenon is suggested by evidence from 
the dreams of two widely separated ethnic 
groups, the Hopi of southwest United States 
and the Yir Yoront of Cape York, Australia. 
The Hopi dreams were collected by Dorothy 
Eggan during the years 1939-1942, and the Yir 


TABLE 2 
MALE AND FEMALE CHARACTERS IN Hop DREAMS 
Number | Number | Percentage 
Gi Number f mal 
TP | Agel of dreams eee red characters 
Males 
60} 24 49 5 91 
2 38| 18 36 11 7] 
3 36| 10 10 4 71 
4 36 6 6 3 67 
Total 58 101 23 81 
Females > 
1 60| 18 1 14 44 
2 28| 60 56 60 48 
3 36| 18 21 20 51 
"Total 96 88 94 48 


Yoront dreams by R. L. Sharp during 1933-1935. 
We are indebted to Dorothy Eggan and David 
Schneider for providing us with these collections. 

The Hopi collection consists of the dreams 
of four adult males and three adult females. A 
character analysis of each dream series was 
made and results are presented in Table 2. 

Although the percentage of male characters in 
the dreams of Hopi males is considerably higher 
than any of the groups in Table 1, the percentage 
for Hopi females is about the same as for 
white females. 

The Yir Yoront dreams were collected. from 
43 men and 8 women. The men contributed 139 
dreams; one man contributed 14, one 11, and 
the rest 1-5 each. The 8 women contributed 10 
dreams. An analysis was made only for the men's 
dreams (see Table 3). 

The percentage of male characters is higher 
than it is for the white male groups and lower 
than the Hopi males. 

In spite of the differences between these two 
ethnic groups and the white samples, the fact 
remains that Hopi and Yir Yoront males dream 
more of males than they do of females, and Hopi 
females dream of males and females with 
equal frequency. 

We would like to report three supplemental 


TABLE 3 


Mate AND FEMALE CHARACTERS IN Ym YORONT 
MALE Dreams 


Number | Number | Percentage 
Number of] Number | of male | of female | of male 
dreamers | of dreams | characters | characters | cAaracters 
43 139 181 67 73 
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studies which have some bearing on the foregoing 
data. These studies were not done on dreams. 


Srupy I 


Thirty-five male college students and 40 female 
students were asked to write down the initials of 
people they like for 1 minute. After 1 minute 
they were asked to write down the initials of 
people they dislike. Half of the group started with 
liked and half started with disliked. They were 
then asked to identify the initials by age and sex. 
For males, the percentage of male names is 69; 
for females, the percentage of male names is 41. 


Stupy II 


Nine short stories written by well-known con- 
temporary male writers and 18 stories written by 
female writers were read and the characters ap- 
pearing in the stories were classified by sex. One 
hundred and six single characters appeared in 
stories written by men; 80 of these are males, and 
26 are females, which yields a percentage of male 
characters of 75. One hundred and ninety char- 
acters were identified in the stories written by 
women; 103 of them are males, 87 are females, 
which gives a percentage of male characters of 54. 


Srupv III 


This study was reported by Handel (1948). He 
conducted 100 depth interviews to determine the 
drawing power of male and female stars upon 
movie-goers. Seventy-six percent of the men inter- 
viewed expressed more interest in male stars whereas 
5496 of the women showed more interest in female 
stars. In this connection, David Ogilvy of New 
York City informs us that when he was working 
with George Gallup in the early forties, they dis- 
covered that the 10 most popular movie stars with 
male movie-goers were all men, whereas the 10 
most popular movie stars with female movie-goers 
were equally divided among men and women. 


Discussion 


What is the explanation for this ubiquitous 
sex difference? Why do men dream more about 
other men whereas women dream about men 
and women with almost equal frequency? 

Virtually all of the current dream theories 
with which the writers are familiar stress the 
principle that dreams are primarily concerned 
with the preoccupations, anxieties, conflicts, and 
unsatisfied wishes of the dreamer. We dream 
during the night as we think during the day 
about our unresolved problems, 

If this is the case, then our findings suggest 
that the unresolved problems of males center 
more around their relationships with men than 
with wemen, and those of females are focused 
upon their relationships with both sexes about 
equally, E 
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We have some evidence which supports this 
hypothesis. An analysis of aggressive interactions 
between the dreamer and male and female 
dream characters shows that male dreamers 
have a higher proportion of aggressions with 
other males whereas female dreamers divide 
their aggressions about equally between male 
and female characters. These data are reported 
by Hall and Domhoff (1962a). A comparable 
analysis for friendly interactions revealed that 
male dreamers have a higher proportion of 
friendly encounters with females than with male 
characters, whereas female dreamers have about 
an equal proportion of friendly interactions with 
male and female characters. These data are 
reported by Hall and Domhoff (1962b). 

If it is assumed that aggression is a sign of 
conflict and that friendliness is not, then the 
foregoing data support the hypothesis that men 
dream more about men because male-male 
relationships produce more conflict than male- 
female relationships, and that women dream 
about males and females with almost the same 
frequency because their relationships with both 
sexes are equally conflictual. 

This hypothesis derives theoretical support 
from Freud’s theory of the male and female 
Oedipus complex. Briefly stated, this theory 
asserts that the first conflict of the child is 
with the father who is seen as an intruder in 
the satisfying relationship between mother and 
child. In the course of development, the girl 
because she blames her mother for not having 
a penis turns to the father. Her original positive 
feelings for the mother and negative feelings 
for the father persist, however, and account 
for feelings of ambivalence towards both males 
and females. The boy, on the other hand, under- 
goes no such reversal in object cathexis, Con- 
flict with the father, and as a result of identifica- 
tion and displacement, conflict with other males 
continues throughout life. His relationships with 
women are fraught with fewer intense conflicts 
because he does not ordinarily turn against his 
mother with the bitterness that a girl does. 
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RELATIONSHIP OF EGO DISJUNCTION AND MANIFEST ANXIETY 


TO CONFLICT RESOLUTION 


DENNIS K. KAMANO! 
Galesburg State Research Hospital 


The purpose of this study was to discover the influence of 2 determinants, 
ego disjunction (ED) and manifest anxiety (MA), on conflict resolution. 
ED, defined as antagonistic response tendencies within an individual, was 
measured by the Edwards Personal Preference Schedule, and MA by the 
Taylor MA scale. From a sample of 56 college Ss, it was found that Ss high 
in both ED and MA required significantly more time to resolve approach- 
approach and avoidance-avoidance conflicts than Ss high in only one 
determinant but low in the other and Ss low in both determinants. No 
significant differences in conflict scores were found when Ss low in both 
determinants were compared with Ss high in only one determinant but low 
in the other. The results were interpreted as indicating that ED and MA 
interact, producing interfering responses which hinder efficient resolution of 
conflicts when ED and MA are both high within an individual. High MA 
and high ED scores, when taken alone, did not yield any conclusive results. 
It is only when both determinants are taken into consideration that greater 
confidence can be invested in those measures to predict conflict performance. 


This study represents an effort to discover 
the relationship of two personality variables, 
ego disjunction and manifest anxiety, to be- 
havior in conflict situations. Trehub (1959) 
developed a measure of ego disjunction, defining 
the concept as “the concurrence in an individual 
of two or more relatively long enduring needs 
Or characterological dispositions with mutually 
incompatible objectives and with a relatively 
high level of joint strength" (p. 191). Ego dis- 
junction was measured by the Edwards (1954) 
Personal Preference Schedule, on which four 
incompatible need pairs were taken to indicate 
the degree of ego disjunction. Each incompatible 
need pair represented a disjunction indicator, 
and a need pair (e.g, need Aggression-need 
Deference) was considered disjunctive if the 
intensity of both needs was high within an in- 
dividual. Ego disjunction was regarded by 
Trehub as “a particular kind of intrapsychic 
conflict” (p. 191). Since ego disjunction may 
be regarded as an indicant of intrapsychic con- 
flict, or as antagonistic response tendencies within 
an individual, we would expect it to be related 
to behavior in conflict situations. However, since 
the method of measurement of ego disjunction is 
telatively recent, data concerning its empirical 
correlates are not available. The present study 
Proposes to investigate the relationships of ego 


1The author wishes to express his appreciation 
to Philip M. Merikle and Robert Peters for their 
assistance in the collection of the data. 


disjunction and manifest anxiety to conflict 
behavior. 

According to Taylor (1953, 1956), scores on 
the Manifest Anxiety (MA) scale are related to 
the individual’s level of emotionality or drive. 
Applying certain notions from Hullian theory, 
she has found that high drive level facilitates 
performance where a single response tendency 
is dominant and inhibits performance on tasks 
where competing response tendencies are evoked. 
Since increase in drive raises the intensity of 
the competing response tendencies evoked by a 
conflict situation, it was assumed that high MA 
scorers experience a greater degree of inter- 
ference and blocking than low MA scorers. Since 
ego disjunction determines the likelihood that 
a given situation will produce conflict, High Ego 
Disjunctive (HED) subjects should be less 
efficient in resolving conflict situations. 

On the assumptions that HED subjects emit 
more interfering responses (ie, oscillation), 
and high MA scorers experience more response 
interference in a conflict situation, and assuming 
an interactive relation between ego disjunction, 
emotionality, and the conflict task variables, 
several predictions may be derived. With the 
conflict task variables held constant for all 
subjects, the effect of either of the variables— 
degree of ego disjunction or level of emotion- 
ality—depends on the level of the other. The 
specific predictions of interaction are stated 
below, 
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1. It was predicted that HED-High Anxious 
(HA) subjects would require more time to re- 
solve successively presented approach-approach 
and avoidance-avoidance conflict situations than 
Low Ego Disjunctive (LED)-Low Anxious (LA), 
HED-LA, and LED-HA subjects. 

2. It was predicted that there would be 
relatively little difference in performance between 
HED-LA subjects and LED-HA subjects in the 
two conflict situations, but that the LED-LA 
subjects would require less time to resolve the 
conflicts than the other subjects. 


METHOD 


. The Edwards Personal Preference Schedule 
(EPPS) and MA scale were administered to groups 
varying in size from 3 to 10. A total of 76 male 
undergraduate students ranging in age from 18 to 21 
at Knox College were tested and classified, leaving 
56 subjects for the experiment proper. The 56 
subjects were classified as either High or Low in 
ED, and within each of these groups, either High 
or Low in manifest anxiety on the basis of their 
scores on the MA scale. The measures of ED were 
obtained by summing over four disjunctive need 
pairs (aggression-deference, succorance-nurturance, 
autonomy-abasement, and order-change) in the 
EPPS, following the procedure outlined by Trehub 
(1959). The T score distribution for college students 
on the EPPS was used to compute the ED scores. 
The degree of ED was considered to be reflected 
in the magnitude of the scores. 

ED scores for the 56 subjects ranged from 0 to 59, 
with a median of 19.5. Subjects with scores falling 
below 20 were designated as Low ED subjects, 
while those with scores of 20 and above were 
designated as High ED subjects. MA scale scores 
secured from the same subjects ranged from 3 to 35, 
with a median of 16.5. Subjects whose scores 
were less than 17 were classified as LA subjects, 
while those with scores of 17 and above were 
classified as HA subjects. On the basis of the 
distribution of scores on both measures the sub- 
jects were classified into four different groups, each 
composed of 14 subjects: High Ego Disjunction- 
High Manifest Anxiety (HED-HMA), High Ego 
Disjunction-Low Manifest Anxiety (HED-LMA), 
Low Ego Disjunction-High Manifest Anxiety (LED- 
HMA), and Low Ego Disjunction-Low Manifest 
Anxiety (LED-LMA). Table 1 presents the mean 
ED and MA scores in each of the four groups. 
No significant differences in mean ED scores were 
noted between the groups both high in ED, as 
well as the groups both low in ED. No significant 
differences in mean MA scores were noted when 
similar comparisons were made with the mean MA 
Scores. 

For he approach-approach and avoidance- 
avoidance conflict series, each subject was tested 
individually. Approach-approach and avoidance- 
avoidance conflicts were created by replicating the 
procedure outlined by Arkoff (1957), to whom 
the reader is referred for a detailed description, 
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TABLE 1 
MEAN SCORES IN EACH OF THE EXPERIMENTAL GROUPS 


Group Mean ED score Mean MA score 
HED-HMA 28.93 24.57 
HED-LMA 28.50 11.21 
LED-HMA 13.00 22.58 
LED-LMA 10.29 10.57 


Briefly, seven personal characteristics (adjustment, 
attractiveness, health, intelligence, popularity, talent, 
and wealth) were systematically paired to create 
conflicts. In the approach-approach conflicts, each 
subject was required to select the alternative that 
he desired to a greater degree than he had at 
present, and in the avoidance-avoidance conflicts 
he was required to select the alternative that he 
would rather have in lesser degree than he had 
at present. Forty-two conflicts, 21 describing ap- 
proach-approach situations and 21 describing avoid- 
ance-avoidance situations were presented. The 
amount of time to resolve the conflicts was the 
measure used in this study, 


RESULTS 


Table 2 presents the mean resolution times 
for each of the conflicts by the four experimental 
groups. To test the significance of the results 
obtained, the conflict scores were subjected to 
a Type III design (Lindquist, 1953, p. 281) 
which took into account subject variance. Table 
3 summarizes the analysis of variance of the 
resolution times for approach-approach and 
avoidance-avoidance conflicts of the four ex- 
perimental groups. Looking first at the conflict 
variable, the significant main effect for con- 
flict indicates that approach-approach conflicts 
required significantly less time to resolve than 
avoidance-avoidance conflicts for all groups. Also 
of note in this analysis is the significance of the 
variance contributed by the interaction of con- 
flict conditions with ED. This indicates that the 
conflict effects are more variable at one level 
of ED than the other. In this case, the variance 
of the interaction effect was greater at the HED 
level with avoidance-avoidance conflict, Avoid- 


TABLE 2 


MEAN RESOLUTION TIMES FOR APPROACH-APPROACH 
AND AvomANCE-AVOIDANCE CONFLICTS 


Conflict 
Group 
Approach-Approach | Avoidance-Avoidance 
HED-HMA 160.64 197.43 
HED-LMA 123.07 161.07 
LED-HMA 124.14 148.71 
LED-LMA 133.07 166.36 


Pa a canes PA ef) | 2000 C 2 eA MED IRU EE 
Note.—In seconds. 
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TABLE 3 


ANALYSIS OF VARIANCE FOR CONFLICT 
RESOLUTION Time DATA 


Source df SS MS E 
Between subjects 55 [198,462.64 
ED 1| 8,557.50| 8,557.50| 2.64 
MA 1| 3,924.72| 3,924.72| 1.21 
ED X MA 1| 17,675.45|17,675.45| 5.46* 
Error (b) 52 |168,304.97| 3,236.63 
Within subjects 56 | 37,188.42 
Conflict (C) 1 | 30,789.72|30,789.72) 284.56** 
CX ED 1 501.52| 501.52| 4.64* 
CXMA 1 172.51] 172.54| 1.59 
CXEDXMA,! 1 98.43| 9843 91 
Error (w) 52| 5,626.24| 108.20 
"Total 111 [235,651.06 
*p«.05. 
25$ 0r 


ance-avoidance conflicts, as contrasted with 
approach-approach conflicts, are more conflictual 
at both ED levels but its interactive effect is 
more marked at the HED level than at the LED 
level. No significant interaction was noted for 
the conflict condition with the MA levels. 

As Table 3 shows, when the analysis of the 
data was confined to the ED and MA variables, 
respectively, no significant differences were 
found. No significant differences in conflict 
resolution times were noted between subjects high 
and low in ED, and between subjects high and 
low in MA. Of major interest in this analysis 
ls the significance of the variance contributed 
by the interaction of ED with MA to the total 
variance of conflict scores. The four experimental 
groups based upon differences in ED and MA 
combinations were differentiated, and the data 
were further analyzed by the application of the 
t test for significant difference between con- 
fict mean scores. It was found, as shown in 
Table 4, that there were no significant differences 
among means for the LED-LMA, HED-LMA, 
and LED-HMA groups in both approach- 
approach and  avoidance-avoidance conflicts. 
However, significant mean differences were found 


TABLE 4 


Summary or £ VALUES FROM MEAN MEASURES OF 
APPROACH-APPROACH AND AVOIDANCE-AVOIDANCE 


CONFLICTS 
* - ED- LED- 
Group tee E KMA | LMA 
HED-HMA — | 345€ | 301** | 2.11* 
HED-LMA | 2.05* ne Bic Aan Est] 
LED-HMA | 2.50** | .77 = 83 
LED-LMA_ | 1.72* 29 91 — 


.Note.—Approach-approach above diagonal, avoidance- 
avoidance below diagonal. 
442 S 05, one-tailed test. 
> S .01, one-tailed test. 
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when the LED-LMA, HED-LMA, and LED- 
HMA groups were compared with the HED- 
HMA group. 

DISCUSSION 


The results of the present study suggest some 
clear relationships regarding the effects of ED 
and MA in the resolution of conflicts. The results 
were in agreement with Arkoff’s (1957) finding 
that approach-approach conflicts are more 
easily resolved than are avoidance-avoidance 
conflicts. The order of effectiveness of the two 
conflict situations is the same within each ED 
level, but the effect of avoidance-avoidance 
conflicts is more marked at the HED level. 
The MA variable revealed no such interaction 
effect with the conflict conditions. Further 
consideration of the role of ED in determining 
conflict behavior led to the finding that sub- 
jects high in ED did not require more time 
to resolve the conflicts than subjects low in ED. 
Confined to the ED variable alone and dis- 
regarding the MA variable, no significant dif- 
ference in conflict resolution times was found 
between the HED and LED subjects. Concerning 
the relationship of MA to conflict behavior, no 
significant difference was noted between HMA 
and LMA subjects in the resolution of conflicts. 
In a conflict situation where competing response 
tendencies are evoked, the results of previous 
studies would suggest that HMA subjects, as 
defined by the MA scale, should experience more 
difficulty in resolving conflicts than LMA sub- 
jects. Contrary to this expectation, no significant 
difference in conflict resolution times was found 
between the HMA and LMA subjects defined 
by the median of the distribution of scores on 
the MA scale, The MA variable in this study 
was not effective in itself in producing differ- 
ential performance in the resolution of conflicts. 

Of the specific predictions about the inter- 
action effects of ED and MA in the resolution 
of conflicts, one was borne out clearly. The 
HED-HMA subjects required significantly more 
time to resolve approach-approach and avoid- 
ance-avoidance conflicts as compared with the 
LED-LMA, HED-LMA, and LED-HMA groups. 
This trend was expected since it had been 
assumed that successful resolution of conflicts 
is a function of two interacting determinants, 
degree of ED and level of MA. The result of 
the combined effect of HED and HMA was 
conceived as producing interfering responses 
which hinders efficient handling of conflicts. 

It was also anticipated that subjects high 
in ED but low in MA, and subjects high in 
MA but low in ED would experience more dif- 
ficulty in resolving conflicts than subjects both 
low in ED and MA. The results yielded no 
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evidence to support this anticipation. If any 
statement can be made, it is that high MA scale 
scores and high ED scores cannot be taken alone 
to predict performance in conflict situations 
with any high degree of confidence. It can 
also be said that this provides further support 
for the plausibility of the interaction hypothesis 
of ED and MA. When both determinants are 
taken into consideration greater confidence can 
be invested in those measures to predict conflict 
performance. The general results of this study 
suggest that, if a number of people are con- 
fronted with the problem of resolving conflict 
situations, each individual will perform in an 
idiosyncratic way dependent, to a great extent, 
on two determinants. On the one hand, an 
individual’s level of emotionality or drive de- 
termines the potency of any situation producing 
conflict, i.e., determines the intensity of conflict 
produced by a particular situation. On the 
other hand, the likelihood that a given situation 
will produce conflict depends upon the individ- 
uals characteristic ways (ie. response tend- 
encies) of dealing with such situations. If 
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antagonistic response tendencies are promi- 
nent within an individual's response hierarchy, 
more situations are likely to be conflictual in 
nature. Factors both of emotionality and of 
response tendencies therefore need to be in- 
cluded in the systematic study of conflict. 
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CONFORMING BEHAVIOR AND PERSONALITY VARIABLES 


IN COLLEGE WOMEN: 


MORTIMER H. APPLEY anp GEORGE MOELLER 
York University and United States Naval Submarine Base, New London 


Conforming behavior of 41 college freshman women, as measured in an 
Asch situation, was compared with the scales of the Edwards Personal 
Preference Schedule, the Gough California Psychological Inventory, and the 
Gordon Personal Profile. In the sample studied, only the Edwards’ Abasement 
scale was able to generate a small but significant relationship (+ .33) with a 
kind of behavior which should have been predictable from some 12 to 
perhaps 27 of the 38 measures used. Excepting the 5 scales of Gough’s 18, 
on the basis of questionable comparability of samples, a total of 33 measures 
was correlated with conforming behavior. Only 1 was found to have 
construct validity for this type of social conformity criterion when tested on 


a sample of college women, 


Barron (1953) has reported some consistent 
self-descriptions of independents and yielders in 


l'The work on which this paper is based was 
completed while both authors were at Connecticut 
College, The paper is a revision of Technical Report. 
No. 8, Project NR 172-228, Contract Nonr 996(02) 
between the Group Psychology Branch, Office of 
Naval Research, and Connecticut College. Permission 
is granted for reproduction in whole or in part for 
any purpdse of the United States Government. 

An earlier version of this paper was presented at 
meetings of the Eastern Psychological Association, 
Philadelphia, Pennsylvania, April 1958. 


the Asch situation using the Gough (1950) 
Adjective Check List. For male subjects, he 
found that yielders tended to characterize them- 
selves as obliging, optimistic, efficient, deter- 
mined, patient, and kind, whereas independents 
saw themselves primarily as original, emotional, 
and artistic. Crutchfield (1955) found independ- 
ents to have “more intellectual effectiveness, ego 
strength, leadership ability and maturity of social 
relations, together with a conspicuous absence of 
inferiority feelings, rigid and excessive self-control, 
and authoritarian attitudes” (p. 194). He sup- 
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ported these generalizations by citing the follow- 
ing correlations with conformity scores: intel- 
Jectual competence (based on ratings), —.63; 
Barron's Ego Strength scale, —.33; leadership 
ability (based on ratings), —.30; Gough's (1956) 
California Personality Inventory scales of Toler- 
ance, Sociability, and Responsibility, —.30 to 
—41 (unspecified). In addition he reported that 
yielding is correlated +.39 and +.35 with the F 
Scale and ratings of authoritarian behavior in 
a psychodrama. 

Tuddenham (1959) recently presented addi- 
tional data on personality correlates of yielding, 
noting the marked sex differences which he and 
others have found. He concluded that the 
* , , yielding-independence variable is less 
closely related in women to attitudes toward the 
self and the sex role, and is hence somewhat less 
predictable from other variables" (p. 294). 

We elsewhere demonstrated a relationship be- 
tween conformity and motivation (Moeller & 
Applezweig, 1957), but we were there able to 
show some of the subtle ways in which the mo- 
tivational and situational variables could interact. 
We here report a study of the relationship be- 
tween conforming behavior in an Asch situation 
and 38 personality variables derived from three 
standard inventories on a sample of 41 college 
sophomore women.” 


METHOD 


Conformity scores. The judgmental situation used 
to obtain conformity scores was the familiar one 
developed by Asch (1956). Each subject chose one 
of three comparison lines as matching a standard 
line in length after she had heard all but one member 
of the four- to six-member majority express their 
judgments regarding the correct comparison stimulus. 
The classmates of the subject who comprised the 
majority had been instructed to give specific errone- 
ous responses on 12 of the 18 trials, The procedure is 
described in detail in a previous paper (Moeller & 
Applezweig, 1957) dealing with another aspect of 
the experiment, The number of occasions on which 
the subject agreed with, or compromised in the 
direction of, the unanimous majority response when 
the majority was wrong determined her conformity 
Score, 

Personality data. Scores on the Gough (1956) 
California Psychological Inventory (CPI), the 


Gordon (1953) Personal Profile (PP), and the 
Edwards (1954) Personal Preference Schedule 
(EPPS) had previously been obtained on all 
Subjects. 
— 


wa We wish to acknowledge with thanks the par- 
ticipation of Joan Lake Kaiser as experimenter in the 
conformity measurement procedure and Ross R. 
Thomas in the personality measurement procedures. 

Since no preselection of sample was possible for 
the personality scales used, we determined for each 


285 


TABLE 1 


DISTRIBUTION OF ConFormity Scores SHOWING 
CUTOFF LOCATIONS FOR THREE GROUPS 


Conformity score N 
High group (N = 14) 
12 2 
11 3 
10 2 
9 2 
8 Z 
Middle group (N — 13) 
g 2 
6 4 
5 3 
4 2 
3 2 
Low group (N = 14) 
2 3 
1 2 
0 9 


Note.—Total N = 41. 


Procedure. Conformity scores of the 41 subjects 
were classified as high, middle, and low, with regard 
to frequency of compliance with the erring majority. 
The cutting scores used (see Table 1) would have 
placed Asch’s (1956) control group with our lows, 
his upper quartile with our highs (p. 11). F tests 
were computed for the three groups on the 38 
personality variables, and Spearman rank-order cor- 
relations, corrected for ties, were also computed for 
all measures against conformity score.* 

Prediction of conformity. 'The manuals of all three 
inventories claim that scale titles were chosen to 
represent meaningfully the content of each scale, and 
that the combined titles and descriptions could be 
literally interpreted. Those scales which can be 
understood to describe conforming behavior, then, 
should be expected to be correlated with our Asch 
conformity scores. 


scale whether we had sampled the full range of 
scores and, from a sampling point of view at least, 
obtained proper validating data. Where reservations 
were indicated they are mentioned in the Discussion. 

4Since N is relatively small and the conformity 
score distribution is obviously nonnormal, measures 
of relationship which would use all of the informa- 
tion in the raw data (such as r or eta) could not 
be applied. Thus the F test provides a measure of 
relationship, linear or nonlinear, which uses all of 
the information in the inventory scores, but not all in 
the conformity scores. The Spearman coefficient, as- 
suming linearity, uses more of the information in the 
conformity scores than the F test, but less of that 
in the inventory scores. 

5]Joan Bopp Shor independently evaluated each 
scale description, as did each of the writers. The 
Expected relationships shown (Table 2) represent 
complete agreement, or, where two predictions are 
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TABLE 2 
Means, Sicmas, AND Two Tests or RELATIONSHIP (F and r's) ror HIGH, MIDDLE, 
AND Low YIELDING GROUPS ON 38 PERSONALITY SCALES 
Tuddenham' 
EE poss | netoens cod 
ees wai] wars | avery | tir | 7 | ay an | College women 
(N = 37) 
Edwards PPS 
Achievement 
M 13.92 11.08 13.79 NP-N | 2.40 —.02 
gus 3.63 410 3.47 
Deference 
M 12.93 10.85 14.43 E 3.19 .00 
c 4.10 3.78 2.64 
Order 
M 11.21 823 10.14 NP 2.07 —.05 
o 4.77 2.81 3.18 
Exhibition 
M 14.93 1446 15.21 N 1 —.17 
o 2.66 3.54 3.55 
Autonomy 
M 11.57 12.69 12.00 N 1 —12 
[4 3.92 4.10 3.72 
Affiliation 
M 18.86 18.92 17.36 NP-P |1 +17 
UE 4.39 3.99 4.61 
Intraception 
M 16.57 16.54 18.43 NP 1 —.11 
c 5.18 5.33 5.43 
Succorance 
M 12.14 13.69 10.57 P 1.38 +.15 
c 3.74 451 5.64 
Dominance 
M 11.71 12.92 13.57 N 1 —.10 
[i 4.83 5.69 6.04 
Abasement 
M 19.14 17.62 15.07 P 2.81 +.33* 
c 4.07 4.52 4.70 
Nurturance 
M 16.21 17.69 15.93 NP 1 42 
c 4.35 3.62 4.53 
Change 
M uR D 17.71 NP 1 1.02 
g al E D 
Endurance X 
M 13.86 | 1023 1407 | NP 251 | —.12 
o N 5.77 3.74 4.56 
Heterosexuality 
M 10.71 15.23 12.57 NP 3.12 .00 
are. 515 | 381 | 4.50 
Aggression 
i 20 es 9.43 N 142 | —44 
Gough's CPI ' : A 
papas 
26.86 26.54 27.57 N 1 —.04 —.20 
eu i 6.07 5.18 6.90 
specs for Social Status 
21.64 20.85 22.29 NP 1 —.02 —.33* 
oe, 4.08 2.38 2.74 
PEU 
24.14 25.77 25.36 N 
(A 521 3.60 4.10 ^ be US 
P0 Presence 
3221 | 3615 | 3457 | N- apt 
o 5.75 5.59 442 2 up * 


Boe. x £2 
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TABLE 2—Continued 
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He m EES DM DET DATA 11939) 
Scale ‘ig! Middle Low Teton F ar product-moment 
Meee Cs | Ce E abio SES ei otc sone 
(N = 37) 
Self-Acceptance 
21.29 21.62 22.21 N 1 +.10 —.22 
c 3.61 3.39 3.73 
Sense of Well Being 
M 7.00 6.00 6.14 | NP 1 —.02 —.10 
c 4.00 3.01 3.66 
Responsibility 
34.14 32.08 33.21 P-NP |1 +.01 +.07 
c 3.80 3.41 3.75 
Socialization 
12.79 13.46 14.64 | P 1 —.09 +.07 
c 3.26 521 5.42 
Self-Control 
M 18.71 20.85 18.29 | NP 1 —.12 00 
Li 6.99 5.64 5.34 
"Tolerance 
23.86 24.92 24.36 | NP 1 +.11 —.21 
o 4.76 2.34 3.75 
Good Impression 
M 19.79 1946 20.93 P-NP |1 +.07 20 
o 6.50 521 5.64 
Communality 
1.64 2.54 171 NP 134 | —.23 +.25 
T 1.76 1.39 1.33 
Achievement via Conformance 
M 27.93 27.08 29.00 | P-NP |1 —.09 -42 
v. 3.81 3.97 3.55 
Achievement via Independence 
M 19.86 18.69 2093 | N 138 | 4.07 — 450" 
o 4.09 2.70 3.13 
Intellectual Efficiency 
M 38.50 38.00 40.36 | N 127 | +.03 —.23 
A 4.98 2.77 3.64 
Psychological Mindedness 
M 10.50 10.46 11.21 N 1 +.01 —.22 
(4 2.23 2.76 1.78 
Flexibility 
M 9.50 9.77 10.14 | N 1 —.02 59e 
e. 297 2.97 1.88 
Femininity i 
M 2457 | 22.62 ne PP |142 | 4-07 —.02 
a 3.29 2.59 4 
Gordon's PP> | 
Ascendancy 
M 0.17 3.67 115 | N 105 | —.01 
A 6.16 3.50 6.68 
Responsibility 
M 608 | 538 | 638 | NPN |1 —.09 
SC 447 4.37 3.34 
Emotional Stability 
ES 875 | 650 |. 5.38 | NP-N |1 .00 
Ce 3.32 5.95 5.06 
Sociability [ 
ui $i; | sg | 308 | NP-P | 3.69") 4-22 
E 5.89 2.98 5.28 
Total 
M 1717 | 2433 | 1585 | NNP |153 | +.02 
Ss 12.66 9.34 1447 


* Based on scale description: P = positive relationship expected; N 


(see Footnote 6, page 288). 


= negative relationship expected; NP = no prediction 


x ES scores were available on only 37 of the 41 subjects studied. 


** S 
p < .05. 
EDSO 
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RESULTS 


Preliminary comparisons were made between 
score distributions of the present sample and 
norm populations for the 38 personality scores. 
No significant variance differences were found for 
the EPPS, though the sample mean for the 
Abasement scale was significantly higher than the 
norm mean, Our sample differed on only one of 
the 18 Gough CPI scales—Self-Acceptance, the 
sample sigma being significantly smaller than that 
for the norms. Four additional scales showed mean 
differences between sample and norm distribu- 
tions, namely, Social Presence, Achievement via 
Independence, Intellectual Efficiency, and Flexi- 
bility. For the Gordon PP, inspection suggested 
no variance differences, with perhaps slightly 
higher sample means on all but the Ascendancy 
scale, Unfortunately, no significance tests were 
possible because sigmas were not available for the 
norm population. 

Table 2 shows the means, sigmas, and tests of 
significance (Fs and r’s) for the high, middle, 
and low conformity groups on the 38 personality 
variables, Our interpretations as to which scales 
might be expected to relate to conformity scores 
—on the basis of their authors’ descriptions (see 
Footnote 5)—and the expected direction of these 
relationships, are also shown. In addition, to en- 
able direct comparison, the comparable correla- 
tions for college women obtained by Tuddenham 
(1959) on the Gough CPI are shown in the far 
right column.® 

EPPS. As can be seen from Table 2, none of 
the F tests of the Edwards scales is significant. 
However, when the conformity scores are com- 
pared with scale scores by use of Spearman rank- 


indicated, the first represents the opinion of two 
judges and the second the opinion of the remaining 
judge as to what the Expected relationship should 
be. In no case were opinions in disagreement as to 
the direction of prediction, although there were some 
questions as to whether a single statement in the 
scale description was sufficient to warrant a clear 
expectation of a directional relationship. 

S'Tuddenham (1959) also reported correlations 
between EPPS scales and yielding. Those data, 
however, were obtained from a sample of former 
women members of the Adolescent Growth Study 
(AGS). Since the relationships between the Gough 
CPI scales and yielding were not the same for college 
women and the AGS women, it seems inappropriate 
to use the AGS data here as an approximation of 
the relationship between yielding and EPPS scores 
for college women. There is no correspondence 
between,our set of correlations and those significant 
tetrachoric correlations obtained by Tuddenham for 
the AGS subjects. 
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order coefficients (corrected for ties, following 
the procedure described by Siegel, 1956), a sig- 
nificant relation appears between conformity and 
Abasement scores (r = +.33, p < .05). 

CPI. Neither F tests nor Spearman coefficients 
reveal statistically significant relationships for 
any of the Gough scales and conformity. Linear 
trends are noted, in comparison by inspection of 
means, for scales of Self-Acceptance, Socializa- 
tion, and Flexibility, although these trends are 
not confirmed in the sizes of the Spearman co- 
efficients found (+.10, —.09, and —.02, respec- 
tively). As was the case for the EPPS, the cor- 
relations we found for the Gough scales do not 
correspond with those obtained by Tuddenham 
(1959). However, we must note that on three 
of the scales in which Tuddenham found signifi- 
cant relationships we have questioned the com- 
parability of our sample with that of the norm 
group (see above) and thus our failure to con- 
firm these correlations may be due to sampling 
differences. 

PP. Neither significant Fs nor Spearman rank- 
order correlations were found for scales of As- 
cendancy, Responsibility, Emotional Stability, or 
Total score and conformity. A significant mean 
difference for the Sociability scale cannot be 
interpreted simply, in light of the fact that the 
means neither increase nor decrease in linear 
fashion. The Spearman correlation for Sociability 
was not statistically significant. 


Discussion 


Findings of this study. One may naively infer 
from the scale titles and descriptions of from 12 
to possibly 27 of the 38 personality scales which 
compose the three tests used that they would 
yield predictions of conforming behavior. 

For the PP, the description of the Ascendancy 
scale, and perhaps those for Total, Responsibility, 
and Emotional Stability, should be expected to 
relate negatively with conformity. These expecta- 
tions were not confirmed. The Sociability scale,+ 
which might have been positively related, showed 
a significant F, but as was noted above, the 
scores for high conformers actually fall between 
those for low and middle conformers, so the 
meaning of the relation is not clear. 

For the CPI, clear predictions were inferrable 
from scale descriptions of Socialization—positive 
—and Dominance, Sociability, and Psychological 
Mindedness—all negative. Although the sampling 
seemed to be adequate for these scales, none was, 
found to be related to conformity. Negative 
relations might also have been expected between 
conformity and the scales for Self-Acceptance, 
Achievement via Independence, Intellectual Ef 


' 


ficiency, and Flexibility. The failure to find rela- 
tionships for those scales might be attributed 
to sampling differences between the norm samples 
and ours. The same explanation might be given 
for Social Presence, where a possible negative 
relation was not confirmed. 

The sampling for the scales of Responsibility, 
Good Impression, Achievement via Conformance, 
and Femininity does seem adequate, however, 
and the failure to find the positive correlations 
predicted for them would appear to cast doubt 
on their construct validity." 

In the Edwards schedule, scales of Deference, 
Succorance, and Abasement should have been 
positively related to conformity. Only Abasement 
showed a relationship in the predicted direction, 
though it may be inappropriate to generalize this 
finding, since our sample had a significantly 
higher mean than did the norm group. Negative 
relationships to conformity should have been 
expected for the scales of Exhibition, Autonomy, 
Dominance, and Aggression. None of these rela- 
tionships was found in our sample. No relation- 
ship was found for Achievement and Affiliation 
scales, either, when a possible relationship with. 
conformity might have been expected. 

To summarize our findings, then, we can report 
only one statistically significant relationship be- 
tween a personality measure and yielding be- 
havior: for the EPPS Abasement scale (+.33). 
Even this finding is questionable, however, in 
light of sampling differences noted above. 

Comparison of results. How do our findings 
compare with those of others? We have noted 
already that Crutchfield (1955) found statisti- 
cally significant negative correlations between 
three scales of the CPI (Tolerance, Responsibil- 
ity, and Sociability ®) and yielding for adult males. 
Tuddenham obtained six such correlations for a 
sample of adult males, but only one of the six 
(Responsibility) replicates Crutchfield’s results. 
Unfortunately, this correlation does not hold up 
for the college male and two female samples 
Tuddenham studied. In addition, only one CPI 
Scale (Achievement via Independence) was re- 
lated to yielding for Tuddenham's college men. 
A similar lack of cross-sample agreement is found 
On comparison of the CPI correlations with 


.' The remaining scales of the CPI were not con- 
Sidered relevant to prediction of conforming behavior 
9n the basis of their published descriptions. No 
relationships were in fact found. 

ê Crutchfield (1955) refers to Gough’s measure of 
cial Participation. Since the completion of his 
Study that scale has been renamed Sociability, but, 


50 far as we can tell, has not been changed in any 
other way. 
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yielding for Tuddenham’s two samples of women 
(adults and college students). Only one of the 
four significant negative correlations for college 
women (Capacity for Social Status) was repli- 
cated in the three reliable correlations reported 
for his sample of adult women. In the six 
samples, Crutchfield’s, Tuddenham’s, and ours, 
there are only two CPI scales (Achievement via 
Independence and Capacity for Social Status) 
which yield significant correlations for as many 
as three samples, and all of those samples come 
from Tuddenham’s study. 

As with the CPI, the stability of correlations 
between yielding and EPPS scores across samples 
is not impressive. Tuddenham (1959), Izard 
(1960), DiVesta and Cox (1960), Endler (1961), 
and we have studied the relation of the 15 EPPS 
scales to acquiescence. Across seven samples (our 
college women, Endler’s college males, DiVesta's 
undergraduates, Izard's college men and women, 
and Tuddenham's adult men and women) 10 of 
105 correlations differ significantly from zero, 
and only three pairs of those correlations are 
mutually confirmatory—Achievement is nega- 
tively correlated with conformity for Tudden- 
ham’s sample of adult women and DiVesta's 
undergraduates, Dominance is related negatively 
to conformity for Tuddenham's adult women and 
Izard's college men, and Order is positively re- 
lated to acquiescence for the latter groups. Since 
several authors, e.g., Izard, have tried to explain 
away discrepancies between correlations of per- 
sonality and conformity for men and women, it 
is difficult to argue that these replications, found 
on comparison of male and female samples, pro- 
vide convincing evidence for EPPS utility in 
prediction of conformity. In addition, two of 
the three replicated correlations arise from the 
studies which differed most methodologically 
from each other. Izard employed a Sherif-type 
task (Sherif, 1935) in which pressure to conform 
came from the experimenter in a subtle point in 
the instructions to the subject, rather than a 
real, or putative group; Tuddenham used a vari- 
ant of the Asch-type task, as did DiVesta and 
Cox, Endler, and the authors of this report. 

Implications, The simplest hypothesis regarding 
the discrepancies between and within studies is 
that the samples in some, or all, of them were 
biased with respect to each other and the norma- 
tive groups. Since this is the only report among 
those considered here which presents such sample 
comparison information (see Footnote 3), it is 
impossible to evaluate that hypothesis. 

A more meaningful explanation of the appar- 
ently contradictory findings is implicit in the 
discussjon of motivation and conformity pre- 
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sented by Moeller and Applezweig (1957). The 
substance of that discussion is that conformity 
per se is an act in the service of the individual's 
motivations—a means to an end rather than an 
end in itself for the normal person. Tuddenham 
(1957), Beloff (1958), Hollander (1960), and 
Endler (1961) present related views. In explana- 
tion of the differences in patterns of conformity 
and the relations of those patterns to other 
measures for men and women, Beloff® states that 
further study of acquiescence should be con- 
cerned with “. . . clarification of the place of 
conformity in general personality organization 
and, more particular, with the meaning and 
significance that conformity has for men and 
women .. . to elucidate the part that conform- 
ity plays in their adjustment to their respective 
roles. This meaning or significance would appear 
to be very different for the two sexes” (p. 103). 
Tuddenham (1957) concludes, “The subject’s final 
choice (to agree with his peers or not) is the re- 
sult of numerous and complexly interacting varia- 
bles, . . . The outcome, whatever it may be, repre- 
sents the best solution for the particular subject, 
given his particular weightings of all the relevant 
variables" (p. 18). And Hollander writes: *Con- 


- formity behavior does not lend itself to segmental 


treatment as either a feature of . . . personality 
- . . or of the situational field in which the indi- 
vidual is immersed. It would seem rather to be 
a combined function of prevailing situational 
conditions and stable states which are likely to 
yield certain behavioral outcomes, at times in 
accordance with more or less explicit expectancies 
of relevant others” (p. 222). 

If it is true that conformity is a means, not 
an end, and that enduring personality character- 
istics should be conceptualized in terms of goals, 
then conformity is only one of many ways in 
which the more fundamental aspects of personal- 
ity may be expressed. Adults and adolescents, 
men and women, and other differentiable groups 
within the general population, may employ dif- 
ferent means to achieve same goals or similar 
means to reach different goals. Unless particular 
cues were present to elicit certain types of re- 
sponse modalities (e.g., deferent behavior, suc- 
corant behavior, etc.) we would not expect to 


?Beloff (1958) makes the further distinction 
between acquiescence and conventionality as two 
forms of social conformity. We would expect that 
our study is concerned primarily with the former 
of these to the extent that the type of judgment 
required is free from social values in its content. 


D] 
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find such behavior necessarily in the repertoire 
of an individual who in another situation con- 
forms to the norm of a group response. It must 
be expected that attempts to replicate successful 
predictions of acquiescence from measures of 
supposedly enduring personality characteristics 
must meet with only occasional and adventitious 
success. 
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A CROSS-CULTURAL STUDY OF CORRELATES OF CRIME" 


MARGARET K. BACON, IRVIN L. CHILD, ax» HERBERT BARRY, III ? 


Yale University 


td 


In a sample of 48 nonliterate societies, frequency of theft and personal crime 
were separately corrélated with a number of variables which were suspected 
to be causal factors in the development of crime, Lack or limitation of 
opportunity for the young boy to form an identification with his father was 
associated with both types of crime. A high degree of socialization anxiety 
in childhood and a high degree of status differentiation in adulthood were 
significantly associated with theft only; a general adult attitude of suspicion 
and distrust was more decidedly associated with personal crime. 


A number of researchers have analyzed the 
sociological and psychological background of 
delinquents and criminals and compared them 
with a noncriminal control population, in 
order to discover what conditions give rise 
to criminal behavior; for a recent review, see 
Robison (1960). The present paper reports 
on variations among a sample of preliterate 
societies in the frequency of crime, in order 
to determine what other known features of 
these societies are associated with the occur- 
rence of crime. The cross-cultural technique 
(Whiting, 1954), in which each society is 
taken as a single case, is a unique method 
for studying crime and has certain advan- 
tages: The index of frequency of crime in a 
society represents the average among its many 
individuals and over a span of many years, 
so that the measure is likely to be more stable 
and reliable than a measure of criminal tend- 
ency in a single individual. Some of the cul- 
tural features which may be related to crime 
show wider variations among societies than 
within a single society, permitting a more 
Comprehensive test of their significance. 
Results which are consistent in a number of 


diverse societies may be applied to a great 
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variety of cultural conditions instead of being 
limited to a single cultural setting. 

If certain cultural features foster the de- 
velopment of criminal behavior, they should 
be found preponderantly in societies with a 
high frequency of crime; factors which 
inhibit crime should be found largely in 
societies which are low in crime. Thus the 
cross-cultural method may help us discover 
psychological and sociological variables which 
have a causal relationship to the development 
of crime; the importance of these variables 
may then also be tested intraculturally. On 


the other hand, variables identified as possible 


causes of crime within our society may be 
tested for broader significance by the cross- 
cultural method. 

The possible causal factors which we have 
explored are principally concerned with child 
training practices, economy, and social struc- 
ture. Hypotheses concerning these factors, as 
they have been presented by other writers 
or as they have occurred to us, will be 
described in connection with the presentation 
of our results. 


METHOD 


Sample. The sample used in this study consists of 
48 societies, mostly preliterate, scattered over the 
world. They were taken from a larger group of 110 
societies which were selected on the basis of geo- 
graphical diversity and adequacy of information on 
aboriginal child training practices. The present sample 
of 48 consists of those societies whose ethnographies 
were searched and found to provide sufficient in- 
formation to permit comparative ratings on criminal 
behavior by three independent research workers.’ 


3 The 48 societies included in the studyeare as 
follows: Africa—Ashanti, Azande, Bena, Chagga, 
Dahomeans, Lovedu, Mbundu, Thonga; Asia— 
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Ratings. We have included two types of crime in 
our study: theft and personal crime. These two were 
chosen because they are relatively easy to identify 
and almost universal in occurrence. Also, they repre- 
sent two quite different types of behavior. Thus we 
are able to clarify antecedents common to both types 
of crime and those characteristic of only one. Judg- 
ments were always made in relation to the norms of 
the culture under consideration. Theft was defined 
as stealing the personal property of others. Property 
included anything on which the society placed value, 
whether it was a whale’s tooth or a song, Personal 
crime was defined by intent to injure or kill a 
person; assault, rape, suicide, sorcery intended to 
make another ill, murder, making false accusations, 
etc., were all included. 

'The method of comparative ratings was used to 
obtain measures of frequency. Three raters independ- 
ently analyzed the ethnographic material on each 
society and made ratings on a seven-point scale as 
to the relative frequency of the type of crime under 
consideration, Thus a rating of 4 on theft would 
mean that the frequency of theft in a given society 
appeared to be about average for the sample of 
societies. Ratings of 5, 6, and 7 represented high 
frequencies and those of 3, 2, and 1 were low. 
Societies in which the behavior did not occur were 
rated as 0, Each rating was classified as confident 
or doubtful at the time that it was made. No rating 
was made if the analyst judged the information to 
be insufficient. We have included all societies on 
which all three analysts made a rating, whether it 
was confident or doubtful, and we have used the 
pooled ratings of all three analysts. The reliability 
of these pooled ratings is estimated as +.67 for 
Theft and +.57 for Personal Crime. These estimates 
were obtained by averaging (using a z transforma- 
tion) the separate interrater reliabilities, and entering 


Andamanese, Baiga, Chenchu, Chukchee, Lepcha, 
Muria, Tanala, Yakut; North America—Cheyenne, 
Comanche, Flatheads, Hopi, Jamaicans (Rocky 
Roaders), Kaska, Kwakiutl, Navaho, Papago, 
Tepoztlan, Western Apache; South America—Aymara, 
Cuna, Jivaro, Siriono, Yagua; Oceania—Arapesh, 
Balinese, Buka (Kurtachi), Ifaluk, Kwoma, Lau 
Fijians (Kambara), Lesu, Manus, Maori, Puka- 
pukans, Samoans, Tikopia, Trobrianders, Trukese, 
Ulithians, Vanua Levu (Nakoroka), All information 
was obtained from ethnographic studies available in 
the literature or in the Human Relations Area Files, 
Ratings were, so far as possible, of the aboriginal 
Practices of the group in order to reduce the influ- 
ence of acculturation. All ratings used in this study 
have been filed with the American Documentation 
Institute. Order Document No. 7450 from the ADI 
Auxiliary Publications Project, Photoduplication 
Service, Library of Congress; Washington 25, D. C., 
remitting in advance $1.75 for microfilm or $2.50 
for photocopies. Make checks payable to: Chief, 
Photoduplication Service, Library of Congress. All 
intercorrelations among variables for our sample of 
societies appear in the same document, 
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this average into the Spearman-Brown correction 
formula. 

Most writers in this field make a distinction 
between delinquency and crime, largely on the basis 
of the age of the offender. The nature of our evi- 
dence does not permit us to make such a clear 
distinction. Ratings were made in terms of the rela- 
tive frequency of specific types of criminal behavior 
in the adult population. Since the age at which 
adulthood is considered to have begun varies from 
one society to another, ratings may in some cases 
have included individuals young enough to be 


considered adolescent in our society and therefore - 


delinquent rather than criminal. The distinction does 
not appear to be crucial in this study. 

The measures of possible causal variables consist 
of ratings which have been derived from several 
sources. Each will be described in the following 
section. Except where noted (for certain variables in 
Tables 3 and 4), none of the three people who made 
the crime ratings participated in any of the other 
ratings. 


HYPOTHESES, RESULTS, AND DISCUSSION 


Our results will be presented under three 
main headings: Correlates of Crime in Gen- 
eral, Correlates Specific to Theft, and Cor- 
relates Specific to Personal Crime. As this 
classification suggests, we have found it useful 
to consider the antecedents of crime as either 
general or specific, i.e., leading to a general 
increase in criminal behavior, or associated 
with only one major category of crime. A cor- 
relation of +.46 was found between frequency 
of Theft and frequency of Personal Crime. 
This indicates that the two variables show a 
significant degree of communality (p < .01) 
and also some independence. 


Correlates of Crime in General 


Our principal findings concerning common 
correlates of both Theft and Personal Crime 
are relevant to a hypothesis that crime arises 
partly as a defense against strong feminine 
identification. We will begin with an account 
of this hypothesis. 

In our society crime occurs mostly in men, 
and we have no reason to doubt that this sex 
difference characterizes most societies. Several 
writers have called attention to the sex role 
identification of males as especially pertinent 
to the development of delinquency in our 
society. It is assumed that the very young boy 
tends to identify with his mother rather than 
his father because of his almost exclusive 
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contact with his mother. Later in his develop- 
ment he becomes aware of expectations that 
he behave in a masculine way and as a result 
his behavior tends to be marked by a com- 
pulsive masculinity which is really a defense 
against feminine identification. Parsons (1954, 
pp. 304-305) notes further that the mother 
is the principal agent of socialization as well 
as an object of love and identification. There- 
fore, when the boy revolts he unconsciously 
identifies “goodness” with femininity and 
hence accepts the role of “bad boy” as a 
positive goal. 

Miller (1958) has made a study of lower- 
class culture and delinquency which is also 
pertinent in this connection. He points out 
that some delinquent behavior may result 
from an attempt to live up to attitudes and 
values characteristic of lower-class culture. 
He also notes that many lower-class males are 
.reared in predominantly female households 
lacking a consistently present male with whom 
to identify. He feels that what he calls an 
almost obsessive lower-class concern with 
masculinity results from the feminine identi- 
fication in preadolescent years. 

Whiting, Kluckhohn, and Anthony (1958), 
in a cross-cultural study of male initiation 
rites at puberty, found these rites tended to 
occur in societies with prolonged, exclusive 
mother-son sleeping arrangements. Their in- 
terpretation of this relationship is that the 
early mother-infant sleeping arrangement pro- 
duces an initial feminine identification, and 
later control by men leads to a secondary 
Masculine identification. The function of the 
initiation ceremony is to resolve this conflict 
of sexual identification in favor of the mascu- 
line identification. The authors further pre- 
dict that insofar as there has been an increase 
in juvenile delinquency in our society, “it 
Probably has been accompanied by an increase 
in the exclusiveness of mother-child relation- 
ships and/or a decrease in the authority of 
the father.” 

The hypothesis that crime is in part a 
defense against initial feminine identification 
would lead to the expectation that all factors 
which tend to produce strong identification 
with the mother and failure of early identi- 
fication with the father would be positively 
Correlated with the frequency of crime in the 
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adult population. The factor that is easiest 
to study is the presence of the father. It seems 
reasonable to suppose that successful identi- 
fication with the father is dependent on his 
presence, Therefore, societies which differ in 
the degree to which the father is present 
during the child’s first few years should differ 
correspondingly in the degree to which the boy 
typically forms a masculine identification.* 

Whiting (1959) has made use of Murdock’s 
(1957) classification of household structure 
and family composition to distinguish among 
four types of households which provide a 
range from maximal to minimal degree of 
presence of the father. They are as follows: 

Monogamous Nuclear. This household is 
the usual one in our society. The father, 
mother, and children eat, sleep, and entertain 
under one roof. Grandparents, siblings of the 
parents, and other relatives live elsewhere. 
The effective presence of the father in the 
child’s environment is thus at a maximum. 

Monogamous Extended. Here two or more 
nuclear families live together under one roof. 
A typical extended family consists of an aged 
couple together with their married sons and 
daughters and their respective families. In 
such a household, the child’s interaction with 
his father is likely to be somewhat less than 
in the single nuclear household. 

Polygynous Polygynous. The polygynous 
household consists of a man living with his 
wives and their various children. Here the 
child is likely to have even less opportunity 
to interact with his father. 

Polygynous Mother-Child. This type of 
household occurs in those polygynous societies 
where each wife has a separate establishment 
and lives in it with her children. In these 
societies the father either sleeps in a men’s 
club, has a hut of his own, or divides his 
time among the houses of his various wives. 
The husband usually does not sleep in the 
house of any wife during the 2 to 3 years 
when she is nursing each infant. Thus the 
mother may become the almost exclusive 


4The whole problem of the mechanism whereby 
jdentification occurs has been omitted from this 
study. In all theories it would appear that identifica- 
tion with the father would be in some degree a 
function of the frequency of the presence of the 
father. + 
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Ratings. We have included two types of crime in 
our study: theft and personal crime. These two were 
chosen because they are relatively easy to identify 
and almost universal in occurrence. Also, they repre- 
sent two quite different types of behavior. Thus we 
are able to clarify antecedents common to both types 
of crime and those characteristic of only one. Judg- 
ments were always made in relation to the norms of 
the culture under consideration. Theft was defined 
as stealing the personal property of others. Property 
included anything on which the society placed value, 
whether it was a whale’s tooth or a song. Personal 
crime was defined by intent to injure or kill a 
person; assault, rape, suicide, sorcery intended to 
make another ill, murder, making false accusations, 
etc., were all included. 

The method of comparative ratings was used to 
obtain measures of frequency. Three raters independ- 
ently analyzed the ethnographic material on each 
society and made ratings on a seven-point scale as 
to the relative frequency of the type of crime under 
consideration, Thus a rating of 4 on theft would 
mean that the frequency of theft in a given society 
appeared to be about average for the sample of 
societies. Ratings of 5, 6, and 7 represented high 
Írequencies and those of 3, 2, and 1 were low. 
Societies in which the behavior did not occur were 


rated as 0, Each rating was classified as confident ' 


or doubtful at the time that it was made. No rating 
was made if the analyst judged the information to 
be insufficient. We have included all societies on 
which all three analysts made a rating, whether it 
was confident or doubtful, and we have used the 
pooled ratings of all three analysts. The reliability 
of these pooled ratings is estimated as +.67 for 
Theft and +.57 for Personal Crime. These estimates 
were obtained by averaging (using a z transforma- 
tion) the separate interrater reliabilities, and entering 


Andamanese, Baiga, Chenchu, Chukchee, Lepcha, 
Muria, Tanala, Yakut; North America—Cheyenne, 
Comanche, Flatheads, Hopi, Jamaicans (Rocky 
Roaders), Kaska, Kwakiutl, Navaho, Papago, 
Tepoztlan, Western Apache; South Ameriea—Aymara, 
Cuna, Jivaro, Siriono, Yagua; Oceania—Arapesh, 
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pukans, Samoans, Tikopia, Trobrianders, Trukese, 
Ulithians, Vanua Levu (Nakoroka). All information 
was obtained from ethnographic studies available in 
the literature or in the Human Relations Area Files. 
Ratings were, so Yar as possible, of the aboriginal 
practices of the group in order to reduce the influ- 
ence of acculturation. All ratings used in this study 
have been filed with the American Documentation 
Institute. Order Document No. 7450 from the ADI 
Auxiliary Publications Project, Photoduplication 
Service, Library of Congress; Washington 25, D. C., 
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for photocopies, Make checks payable to: Chief, 
Photoduplication Service, Library of Congress. All 
intercorrelations among variables for our sample of 
Societies appear in the same document. 
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this average into the Spearman-Brown correction 
formula. 

Most writers in this field make a distinction 
between delinquency and crime, largely on the basis 
of the age of the offender. The nature of our evi- 
dence does not permit us to make such a clear 
distinction. Ratings were made in terms of the rela- 
tive frequency of specific types of criminal behavior 
in the adult population. Since the age at which 
adulthood is considered to have begun varies from 
one society to another, ratings may in some cases 
have included individuals young enough to be 
considered adolescent in our society and therefore 
delinquent rather than criminal. The distinction does 
not appear to be crucial in this study. 

The measures of possible causal variables consist 
of ratings which have been derived írom several 
sources. Each will be described in the following 
section. Except where noted (for certain variables in 
Tables 3 and 4), none of the three people who made 
the crime ratings participated in any of the other 
ratings. 


Hyporueses, RESULTS, AND DISCUSSION 


Our results will be presented under three 
main headings: Correlates of Crime in Gen- 
eral, Correlates Specific to Theft, and Cor- 
relates Specific to Personal Crime. As this 
classification suggests, we have found it useful 
to consider the antecedents of crime as either 
general or specific, ie., leading to a general 
increase in criminal behavior, or associated 
with only one major category of crime. A cor- 
relation of 4-.46 was found between frequency 
of Theft and frequency of Personal Crime. 
This indicates that the two variables show a 
significant degree of communality (p < .01) 
and also some independence. 


Correlates of Crime in General 


Our principal findings concerning common 
correlates of both Theft and Personal Crime 
are relevant to a hypothesis that crime arises 
partly as a defense against strong feminine 
identification. We will begin with an account 
of this hypothesis. 

In our society crime occurs mostly in men, 
and we have no reason to doubt that this sex 
difference characterizes most societies. Several 
writers have called attention to the sex role 
identification of males as especially pertinent 
to the development of delinquency in our 
society. It is assumed that the very young boy 
tends to identify with his mother rather than 
his father because of his almost exclusive 
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contact with his mother. Later in his develop- 
ment he becomes aware of expectations that 
he behave in a masculine way and as a result 
his behavior tends to be marked by a com- 
pulsive masculinity which is really a defense 
against feminine identification. Parsons (1954, 
pp. 304-305) notes further that the mother 
is the principal agent of socialization as well 
as an object of love and identification. There- 
fore, when the boy revolts he unconsciously 
identifies “goodness” with femininity and 
hence accepts the role of “bad boy” as a 
positive goal. 

Miller (1958) has made a study of lower- 
class culture and delinquency which is also 
pertinent in this connection. He points out 
that some delinquent behavior may result 
from an attempt to live up to attitudes and 
values characteristic of lower-class culture. 
He also notes that many lower-class males are 
eared in predominantly female households 
lacking a consistently present male with whom 
to identify. He feels that what he calls an 
almost obsessive lower-class concern with 
masculinity results from the feminine identi- 
fication in preadolescent years. 

Whiting, Kluckhohn, and Anthony (1958), 
in a cross-cultural study of male initiation 
rites at puberty, found these rites tended to 
occur in societies with prolonged, exclusive 
mother-son sleeping arrangements. Their in- 
terpretation of this relationship is that the 
early mother-infant sleeping arrangement pro- 
duces an initial feminine identification, and 
later control by men leads to a secondary 
masculine identification. The function of the 
initiation ceremony is to resolve this conflict 
of sexual identification in favor of the mascu- 
line identification. The authors further pre- 
dict that insofar as there has been an increase 
in juvenile delinquency in our society, “it 
Probably has been accompanied by an increase 
in the exclusiveness of mother-child relation- 
Ships and/or a decrease in the authority of 
the father." 

The hypothesis that crime is in part a 
defense against initial feminine identification 
Would lead to the expectation that all factors 
which tend to produce strong identification 
with the mother and failure of early identi- 
‘fication with the father would be positively 
Correlated with the frequency of crime in the 
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adult population. The factor that is easiest 
to study is the presence of the father. It seems ' 
reasonable to suppose that successful identi- 
fication with the father is dependent on his 
presence. Therefore, societies which differ in 
the degree to which the father is present 
during the child's first few years should differ 
correspondingly in the degree to which the boy 
typically forms a masculine identification.* 

Whiting (1959) has made use of Murdock's 
(1957) classification of household structure 
and family composition to ‘distinguish among 
four types of households which provide a 
range from maximal to minimal degree of 
presence of the father. They are as follows: 

Monogamous Nuclear. This household is 
the usual one in our society. The father, 
mother, and children eat, sleep; and entertain 
under one roof, Grandparents, siblings of the 
parents, and other relatives live elsewhere. 
The effective presence of the father in the 
child’s environment is thus at a maximum. 

Monogamous Extended. Here two or more 
nuclear families live together under one roof. 
A typical extended family consists of an aged 
couple together with their married sons and 
daughters and their respective families. In 
such a household, the child’s interaction with 
his father is likely to be somewhat less than 
in the single nuclear household. 

Polygynous Polygynous. The polygynous 
household consists of a man living with his 
wives and their various children, Here the 
child is likely to have even less opportunity 
to interact with his father. 

Polygynous Mother-Child. This type of 
household occurs in those polygynous societies 
where each wife has a separate establishment 
and lives in it with her children. In these 
societies the father either sleeps in a men’s 
club, has a hut of his own, or divides his 
time among the houses of his various wives. 
The husband usually does not sleep in the 
house of any wife during the 2 to 3 years 
when she is nursing each infant. Thus the 
mother may become the almost exclusive 


4The whole problem of the mechanism whereby 
identification occurs has been omitted from this 
study. In all theories it would appear that identifica- 
tion with the father would be in some degree a 
function of the frequency of the presence of the 
father. « 
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TABLE 1 


FREQUENCY OF THEFT OR PERSONAL CRIME IN RELA- 
TION TO FAMILY STRUCTURE AND HOUSEHOLD 


Frequency of 


Frequency of k 
Theft Personal Crime 


Family Structure and 


Household» poc 
Low | High | Low | High 
Monogamous Nuclear 7 2 5 4 
Monogamous Extended 7 3 6 3 
Polygynous Polygynous 7 6 3 7 
Polygynous Mother-Child | 1 11 3 9 


Note.—Each entry in the table gives the number of societies 
in our sample which have the particular combination of char- 
acteristics indicated for that row and column. 

The total number of cases in the left-hand and right-hand 
parts of this table and in the various divisions of succeeding 
tables varies because lack of information prevented rating some 
societies on some variables, In testing each relationship we 
have of course been able to use only those societies for which 
the relevant ratings are available. The division into “low” and 
"high" was made as near the median as possible. 

^ See Murdock (1957). 


object of identification for the first few years 
of life. 

"Table 1 shows the number of societies with 
low and high frequency of Theft and Personal 
Crime within each of the four categories of 
household type. As the opportunity for con- 
tact with the father decreases, the frequency 
of both Theft and Personal Crime increases. 
This result agrees with our hypothesis. If the 
family structure and household is treated as 
a four-point scale, it yields a correlation of 
+.58 with frequency of Theft and of +.44 
with frequency of Personal Crime; both cor- 
relations are statistically significant (p < .01). 
If we compare the extremes of the distri- 
bution—contrasting Monogamous Nuclear 
households (which provide the maximum 
opportunity for identification with the father) 
with Polygynous Mother-Child households 
(which provide the minimum opportunity for 
identification with the father )—this relation- 
ship is clearly demonstrated; 18 of the 21 
societies fall in the predicted quadrants for 
Theft, and 14 out of 21 for Personal Crime. 

Several results of empirical studies in our 
society appear consistent with this finding. 
One is the frequently reported relationship 
between broken homes and delinquency, since 
in the majority of cases broken homes are 
probably mother-child households. Robins and 
O'Neal (1958), for example, in a follow-up 
study:of problem children after 30 years, refer 
to the high incidence of fatherless families. 
Glueck and Glueck (1950) report that 41.2% 
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of their delinquent group were not living with 
their own fathers, as compared with 24.8% 
of a matched nondelinquent group. These 
data suggest that a relatively high proportion 
of the delinquents came from what were 
essentially “mother-child” households. 

A recent book by Rohrer and Edmonson 
(1960) is also relevant. Their study is a 


follow-up after 20 years of the individuals | 


described in Children of Bondage by Davis 
and Dollard (1941). The importance of the 
matriarchal household typical in a Southern 
Negro lower-class group, and its effect on the 
emotional development of the young boy and 
his eventual attitudes as an adult, are stressed 
throughout. The following passage sum- 
marizes, in its application to their (Rohrer & 
Edmonson) particular data, an interpretation 
consistent with those we 
introducing this hypothesis. 


Gang life begins early, more or less contemporane- 
ously with the first years of schooling, and for many 
men lasts until death. . . . Although each gang is 
a somewhat distinct group, all of them appear to 
have a common structure expressing and reinforcing 
the gang ideology. Thus an organizational form that 
springs from the little boy’s search for a masculinity 
he cannot find at home becomes first a protest 
against femininity and then an assertion of hyper- 
virility. On the way it acquires a structuring in which 
the aspirations and goals of the matriarchy or the 
middle class are seen as soft, effeminate, and 
despicable. The gang ideology of masculine independ- 
ence is formed from these perceptions, and the gang 
then sees its common enemy not as a class, nor 
even perhaps as a sex, but as the “feminine principle 
in society. The gang member rejects this femininity 
in every form, and he sees it in women and in effemi- 
nate men, in laws and morals and religion, in schools 
and occupational striving (pp. 162-163). 


Correlates of Theft 


Although we shall consider correlates. of 
"Theft in this section and correlates of Per- 
sonal Crime in the next section, each table 
will show in parallel columns the relation of 
a set of variables both to Theft and to Per- 
sonal Crime. This will facilitate comparison 
and avoid repetition. How each of these vari- 
ables was measured will be described in the 
section to which it is most pertinent. 

The first variables to be considered até 
concerned with child training practices. Mos 
of the child training variables have bee? 
developed in our research and described i 


have cited in| 


to 


Cross-CULTURAL 


an earlier paper (Barry, Bacon, & Child, 
1957). These variables may be briefly de- 
scribed as follows: 

Overall childhood indulgence. The period 
of childhood was defined roughly as covering 
the age period from 5 to 12 years, or to the 
beginning of any pubertal or prepubertal 
status change. In making ratings of childhood 
indulgence, factors relevant to indulgence in 
infancy—such as immediacy and degree of 
drive reduction, display of affection by 
parents, etc.—if operative at this later age, 
were taken into account. In addition, the 
raters also considered the degree of socializa- 
tion expected in childhood and the severity 
of the methods used to obtain the expected 
behavior. 

Anxiety associated with socialization dur- 
ing the same period of childhood. This was 
rated separately for each of five systems 
of behavior: Responsibility or dutifulness 
training; Nurturance training, ie. training 
the child to be nurturant or helpful toward 
younger siblings and other dependent people; 
Obedience training; Self-reliance training; 
Achievement training, i.e., training the child 
to orient his behavior toward standards of 
excellence in performance and to seek to 
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achieve as excellent a performance as possible. 

In rating the training in these areas, an 
attempt was first made to estimate the Total . 
Pressure exerted by the adults in each society 
toward making the children behave in each of 
these specified ways (Responsible, Nurturant, 
Obedient, Self-Reliant, and Achieving). The 
socialization anxiety measures were based on 
an estimate of the amount of anxiety aroused 
in the child by failing to behave in a respon- 
sible, self-reliant, etc. way, and they reflect 
primarily the extent of punishment for failure 
to show each particular form of behavior. The 
measures of Total Pressure reflect both this 
and the extent of reward and encouragement. 

Wherever boys and girls were rated dif- 
ferently on any of the above variables of 
socialization, we used the ratings for boys. 

The relation of the crime ratings to these 
and other variables of child training is pre- 
sented in Table 2. It is clear that Theft is 
significantly related to several variables of 
child training. 

First, Theft is negatively correlated with 
Childhood Indulgence, i.e., societies with a 
high rating of Childhood Indulgence tend to 
have a low frequency of Theft in the adult 
population; and, conversely, societies with a 


TABLE 2 
Cup TRAINING FACTORS ASSOCIATED WITH THEFT OR PERSONAL CRIME 


Theft Personal Crime 


Factor 


N r N r 
1. Childhood Indulgence* 45 iA P eu 
2. Responsibility Socialization Anxiety* 43 4g 21 iD 
3. Self-Reliance Socialization Anxiety* 43 ido. a qe 
4. Achievement Socialization Anxiety* 36 JT eu Hun 
? QUAS Socialization Anxiety* E a a 3 aio 
7. ependence Socialization Anxiety’ 20 3p: 40 19 --.46* 


- Mother-Child Sleeping? 

- Infant Indulgence" 

- Age of Weaning* 

. Oral Socialization Anxiety” 

- Anal Socialization Anxiety? 

« Sex Socialization Anxiety! 

- Aggression Socialization Anxiety” 

Nurturance Socialization Anxiety* 

. Total Pressures toward Responsibility, Nurturance, 
Self-Reliance, Achievement, and Obedience* 


Note.—In this and the following tables the correlations are ERR coefficients, thus 
and low. 


gradation in score rather than simply classifying societies as high 
Factors 8-15 showed no significant relationship with either 
è See Barry, Bacon, and Child (1957). 
¢ See Whiting and Child (1953). 
«See Whiting, Kluckhohn, and Anthony (1958). 


MEUM 


reflecting all available degrees of 


"Theft or Personal Crime. 
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TABLE 3 
SOCIOECONOMIC Factors ASSOCIATED WITH THEFT OR PERSONAL CRIME 
Theft Personal Crime 
Factor 
N r N r 
1. Social Stratification* E 44 T36 o +. i 
2. Level of Political Integration* 43 T3 TA 
3. Degree of Elaboration of Social Control» 43 +.46' 40 +.04 
4. Accumulation of Food* 
5. Settlement Pattern* 
6. Division of Labor by Sex* d 
7. Rule of Residence (Patrilocal, Matrilocal, etc.)* 
8. Extent of Storing> . 
9. Irrationality of Storing? y 
10. Severity of Punishment for Property Crime? 
11. Severity of Punishment for Personal Crime 
—Rati de i tion with the analysis of crime by two of the three raters 
pucr aU c UR scat crie by two of the Pure d 


analysis of food and economy. 


Y. , 
Factors 4-11 showed no significant relationship with either Theft or Personal Crime. 


a See Murdock (1957). 
» Bacon, Child, and Barry (unpublished). 
e See Barry, Child, and Bacon (1959). 
*p «05. 

**5x.01. 


low rating of Childhood Indulgence show a 
high frequency of Theft. 

Frequency of Theft is also positively cor- 
related with socialization anxiety during the 
period of childhood with respect to the follow- 
ing areas of training: Responsibility, Self- 
Reliance, Achievement, and Obedience. It 
Should be emphasized that Total Pressures 
toward those four areas of socialization are 
not significantly correlated with Theft. There- 
fore it is apparently not the area or level of 
socialization required which is significant, but 
rather the punitive and anxiety provoking 
methods of socialization employed. 

These findings on child training in relation 
to Theft may be summarized and interpreted 
by the hypothesis that theft is in part moti- 
vated by feelings of deprivation of love. Our 
data indicate that one source of such feelings 
is punitive and anxiety provoking treatment 
during childhood. Such treatment during in- 
fancy may tend to have a similar effect, as 
suggested by a correlation of —.25 between 
frequency of Theft and Infant Indulgence. 
"This correlation falls slightly short of signifi- 
cance at the 5% level. It is of special interest 
that substantial correlations with socializa- 
tion anxiety in childhood tended to occur 
in the, areas of training in Responsibility, 
Achievement, and Self-Reliance. These all 
involve demands for behavior far removed 


from the dependent behavior of infancy and 
early childhood and close to the independent 
behavior expected of adults. If we assume 
that lack of adequate indulgence in childhood 
leads to a desire to return to earlier means 
of gratification and behavior symbolic of this 
need, then we would expect that pressures 
toward more adult behavior might intensify 
this need and the frequency of the symbolic 
behavior. Theft, from this point of view, 
would be seen as rewarded partly by its 
value as symbolic gratification of an infan- 
tile demand for unconditional indulgence ir- 
respective of other people’s rights or interests. 

The results of the early study by Healy 
and Bronner (1936) seem directly pertinent 
to our findings and interpretation. They 
found that a group of delinquents differed 
from their nondelinquent siblings primarily in 
their relationships with their parents; the 
delinquent child was much more likely to give 
evidence of feeling thwarted and rejected. It 
seems reasonable to assume that such feelings 
would often, though not always, indicate 4 
real deprivation of parental love. Glueck 
and Glueck (1950) also found that their 
delinquents, compared with matched nondelin- 
quents, had received less affection from their 
parents and siblings and had a greater tend- 
ency to feel that their parents were not con- 
cerned with their welfare. It was also noted 


- 
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that fathers of the delinquents had a much 
greater tendency to resort to physical punish- 
ment as a means of discipline than fathers 
of the nondelinquents. This agrees with our 
observation that more punitive methods of 
socialization are associated with an increased 
frequency of theft. 

Compulsive stealing (kleptomania) has 
been interpreted by psychoanalysts (see 
Fenichel, 1945, pp. 370-371) as an attempt 
to seize symbols of security and affection. 
Thus this form of mental illness, in common 
with more rational forms of stealing, may be 
regarded as being motivated by feelings of 
deprivation of love. 

Table 3 summarizes the relationship be- 
tween our two measures of crime and a 
number of aspects of economy and social 
organization on which we were able to obtain 
ratings. Theft shows a significant relationship 
with only three of these measures: Social 
Stratification, Level of Political Integration, 
and Degree of Elaboration of Social Control. 
Social Stratification was treated as a five- 
point scale ranging from complex stratifica- 
tion, i.e., three or more definite social classes 
Or castes exclusive of slaves, to egalitarian, 
le, absence of significant status differentia- 
tion other than recognition of political 
statuses and of individual skill, prowess, piety, 
etc. Level of Political Integration was also 
treated as a five-point scale ranging from 
complex state, e.g., confederation of tribes or 
conquest state with a king, differentiated of- 
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ficials, and a hierarchical administrative 
organization to no political integration, even 
at the community level.* Elaboration of Social 
Control is concerned with the degree to which 
a society has law making, law enforcing, and 
punishing agencies. 

Our findings indicate that theft is positively 
correlated with each of these three measures. 
In other words, with an increased Level of 
Political Integration, Social Stratification, and 
Elaboration of Social Control there is an 
increase in the frequency of Theft. These 
variables show no significant relationship with 
frequency of Personal Crime. Each of these 
institutional conditions seems capable of 
arousing feelings of insecurity and resentment, 
and hence may be similar in this respect to 
parental deprivation. Therefore the correlation 
of these institutional conditions with Theft 
might be tentatively interpreted as consistent 
with our hypothesis about motivational influ- 
ences on Theft. It is obvious that other inter- 
pretations might be made from the same data. 
For example, a high frequency of crime may 
give rise to increased elaboration of social 
control. 

Table 4 presents the relation of both Theft 
and Personal Crime to certain adult attitudes 
on which we were able to obtain ratings. 
Frequency of Theft is positively related to 
Sense of Property and negatively related to 

5 Both variables are taken from Murdock (1957). 
Our manner of treating his data is described in 
Barry, Child, and Bacon (1959). 


TABLE 4 
ADULT ATTITUDES ASSOCIATED WITH THEFT OR PERSONAL CRIME 


Theft Personal Crime 
Attitude 
N r N if 
- Sense of Property 43 PEER Hm uo 
- Trust about Property 43 En d Te 
s pun "Trustfulness M l aA UTE oH = "30 
- Environmental Kindness in Folk Tales 23 F3 6 21 4.56% 


- Communality of Property 
- Competition in the Acquisition of Wealth 
- Generosity 


1 
2 
3 
4. 
p. Environmental Hostility in Folk Tales 
7 
8. 
9. n Achievement in Folk Tales* 


1,2, 6, 7, and 8 were rated by another of the three raters (H. 


Attitudes 6-9 showed no significant relationship with either Theft or Personal Crime, 


ae ts, Veroff, and Storm (1958). 
op <.01, 
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Trust about Property. This may indicate 
merely that the greater the importance of 
property, the greater the variety of acts which 
will be classified as Theft, or that a high 
frequency of Theft gives rise to an emphasis 
on property. But it may also mean that the 
greater the importance of property, the more 
effectively does Theft serve the personal needs 
to which it seems to be related. 

Frequency of Theft is also negatively cor- 
related with Environmental Kindness in Folk 
Tales. This folk tale measure requires some 
explanation. It was taken from an analysis 
of folk tales made by one of the authors 
(MKB) without knowledge of the societies 
from which the sample of folk tales was taken. 
In making the analysis, each folk tale was 
divided into units of action or events as they 
related to the principal character or the 
character with whom the listener would be 
expected to identify. Each unit was then 
classified in one of a number of different cate- 
gories including that of environmental kind- 
ness. Classification in this category means 
that the particular unit involved action or 
state of affairs definitely friendly or nurturant 
to the principal character. Thus our results 
show that societies high in frequency of Theft 
tend to have folk tales which do not Tepresent 
the environment as kind. Thinking of the 
environment as lacking in friendly nurturance 
seems entirely consistent with the relative 
absence of parental nurturance which we have 
already found to be correlated with frequency 
of theft. 


Correlates of Personal Crime 


Inspection of Tables 2, 3, and 4 reveals 
that the significant correlates of Personal 
Crime are different from those for theft. In 
no instance does a variable in these tables 
show a significant correlation with both Theft 
and Personal Crime. 

F requency of Personal Crime shows a sig- 
nificant positive correlation with Dependence 
Socialization Anxiety, a rating taken from 
Whiting and Child (1953). In making this 
rating, an estimate was made of the amount 
of anxiety aroused in the children of a given 
society by the methods of independence train- 
ing typically employed. This estimate was 
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based on the following factors: abruptness | 
the transition required, severity and frequency 
of punishment, and evidence of emotional 
disturbance in the child. : 

Ratings on mother-child sleeping are taken 
from Whiting et al. (1958). In this study 
societies were placed into two categories; 
those in which the mother and baby shared 
the same bed for at least a year to the ex- 
clusion of the father, those in which the baby 
slept alone or with both the mother and 
father. According to our results there is a 
high positive relationship between prolonged, 
exclusive mother-child sleeping arrangements 
and frequency of Personal Crime.* 

Inspection of the child training factors 
associated with frequency of Personal Crime 
suggests that the conditions in childhood lead- | 
ing to a high frequency of personal crime 
among adults are as follows: a mother-child | 
household with inadequate opportunity in 
early life for identification with the father, 
mother-child sleeping arrangements which 
tend to foster a strong dependent relation- 
ship between the child and the mother, 
subsequent socialization with respect to 
independence training which tends to be 
abrupt, punitive, and productive of emotional ) 
disturbance in the child. 

We would predict that this pattern of child 
training factors would tend to produce in the | 
child persistent attitudes of rivalry, distrust, 
and hostility, which would probably continue 
into adult life. The results obtained with | 
ratings of adult attitudes (Table 4) support 
this view. Frequency of Personal Crime is 
negatively correlated with General Trustful- 
ness. Frequency of Personal Crime is also 
positively correlated with Environmental 
Hostility in Folk Tales. Classification of à 
folk tale unit in this category means that the | 
particular unit involved definite deception, 
aggression, or rejection in relation to the 
principal character. This variable was not 
highly related to that of environmental kind- | 
ness, although the results obtained with the 


©The variable of mother-child sleeping might be 
considered to favor feminine identification. In that 
event, the fact that it shows correlations in the 
Positive direction with both types of crime tends 
toward confirmation of the findings in our earlier 
section on Correlates of Crime in General. 


Cross-CULTURAL 


two are consistent with each other. The cor- 
relation between them was only —.34, most 
folk tale units not falling in either of these 
categories. Our results indicate that societies 
which are rated as relatively high in the fre- 
quency of Personal Crime have folk tales 
with a high proportion of events representing 
the environment as hostile. If we may infer 
that the content of folk tales reflects the 
underlying attitudes of the people who tell 
them, then this finding, as well as those with 
our other measures of adult attitudes, sup- 
ports the view that personal crime is corre- 
lated with a suspicious or distrustful attitude 
toward the environment. 

An analysis by Whiting (1959) of the 
socialization factors correlated with a belief 
in sorcery is relevant to this aspect of our 
results. He points out that a belief in sorcery 
is consistent with a paranoid attitude. Ac- 
cording to Freudian interpretation, paranoia 
represents a defense against sexual anxiety. 
Whiting presents cross-cultural data in sup- 
port of a hypothesis, based on Freud’s theory 
of paranoia, that a belief in sorcery is related 
to a prolonged and intense contact with the 
mother in infancy followed by a severe sex 

. socialization. The same hypothesis might be 
applied to frequency of Personal Crime, since 
we have evidence that Personal Crime is cor- 
related with a suspicious, paranoid attitude 
in adult life, and sorcery is after all one form 
of Personal Crime, Our results for Personal 
Crime, in common with Whiting’s for sorcery, 
show a correlation with mother-child house- 
hold and prolonged mother-child sleeping. 
However, we found no significant correlation 
with severe sex socialization but rather with 
severe dependence socialization. We do not 
feel that these findings negate the Freudian 
interpretation, because dependence socializa- 
tion, bearing as it does on the child’s intimate 
telation with his mother, necessarily is con- 
cerned with the child’s sexual feelings in a 
broad sense. 


GENERAL DISCUSSION 


We would like to emphasize the value of 
the cross-cultural method for exploring the 
Possible determinants of crime. When each 
Society is used as a single case, and is classified 
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according to crime and other variables for the 
entire society over a period of years, the 
measures are likely to be reliable; comparison 
among societies provides great diversity in 
frequency of crime and in the other variables 
to be related with it. 

The cross-cultural method may help us 
to identify variables with a causal relation- 
ship to crime. For example, our cross- 
cultural data suggest that high differentiation 
of status within a society is a favorable condi- 
tion for a high frequency of Theft, and that 
a high frequency of Personal Crime is associ- 
ated with a generalized attitude of distrust. 
These relationships should be subjected to 
more systematic and intensive tests within our 
own society than has hitherto been done. 

Variables which have been suggested, 
whether in empirical studies or theoretical 
discussions, as possible causes of crime within 
our society may be tested for broader signifi- 
cance by the cross-cultural method. It has 
been argued, for example, that within our 
society delinquent or criminal behavior is 
likely to develop if the boy has been raised 
without adequate opportunity to identify with 
the father. These suggestions have often been 
made in connection with family patterns that 
are said to characterize certain classes or 
groups within our society; the cross-cultural 
findings indicate that a high frequency of 
both Theft and Personal Crime tends to 
occur in societies where the typical family for 
the society as a whole creates lack or limita- 
tion of opportunity for the young boy to form 
an identification with his father. Therefore 
the cross-cultural method supports the theory 
that lack of opportunity for the young boy 
to form a masculine identification is in itself 
an important antecedent of crime. 

Another instance of such confirmation in 
a broader sense is the following: In our 
society delinquents have been reported to 
express feelings of alienation from their 
parents, It is unclear, however, whether this 
reflects their parents’ actual treatment of 
them, or merely their own subjectively deter- 
mined perceptions. Our cross-cultural data (in 
common with some of the findings within our 
own society) indicate that a high frequency 
of Theft is correlated with an actual low 
degree of indulgence during childhood. 
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Other theories about the antecedents of 
crime, when tested with the cross-cultural 
method, have not been confirmed in this 
broader framework. For example, pressures 
toward achievement were not significantly 
related to frequency of crime, although such 
a relationship is implied by theories of delin- 
quency which emphasize the discrepancy be- 
tween culturally induced aspirations and the 
possibility of achieving them. This negative 
result in our sample of societies does not 
deny the existence of such a relationship 
within our society, but it does indicate a 
limitation on its generality. 
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EFFECT OF FANTASY AROUSAL OF AFFILIATION 
DRIVE ON DOLL PLAY AGGRESSION 


JESSE E. GORDON anp FAYE COHN * 


University of Michigan 


Amount of aggression shown by 2 groups of nursery school children was 
measured in doll interviews before and aíter a story was read to them. In 
one group the story was designed to arouse affiliation drive; the story told 
the other group was neutral While the controls increased in aggression on 
the 2nd doll play session, the experimental group remained about the same. 
Initially high and low aggressive Ss were equally affected by the affiliation 
arousal, as were boys and girls. The results indicate that the arousal in 
fantasy of affiliation drive significantly lowers the expression of aggression. 


There has been a considerable amount of 
experimental work on aggression in the past 
few years (Berkowitz, 1958; McNeil, 1959), 
much of it in the framework of Miller's 
(1955) conflict model. Several of the studies 
reported have used some form of thematic 
material for arousing aggressive drive in 
order to test predictions from the Miller 
model. Hokanson and Gordon (1958) read 
fictitious case studies of criminals to their 
Subjects in order to produce high and low 
arousal of aggression. More recently, Mussen 
and Rutherford (1961) showed children a 
cartoon movie containing the aggressive 
themes usually present in animated cartoons, 
and Siegel (1956) found some nonsignificant 
increase in aggression in children who had 
seen an aggressive movie. In these studies 
the conflict consists of the opposition of 
aggression guilt to the aggression arousal. 
The study reported here involves a somewhat 
different conflict, somewhat closer to the 
concept of ambivalence; it is a conflict be- 
tween mutually exclusive approach drives, 
rather than a drive and its inhibition. 

One problem in the use of experimental 
drive arousal procedures for testing hy- 
Potheses about aggressive drive is that the 
experimental manipulations usually involve 
an experimenter controlled display of ag- 
Bression not unlike the behavior to be 
Measured as the dependent variable. When 
an experimenter shows pictures of an ag- 
gressive interaction, or tells a story about 


f 5 The authors are indebted to Warren T. Norman 
or his aid in the statistical analysis. 


aggressive behavior, or aggressively insults the 
subject, and then measures the amount of 
aggression shown in the subject's behavior, 
the independent and dependent variables be- 
come contaminated. Such experimental ma- 
nipulations may not in fact arouse aggressive 
drive; increase in the display of aggressive 
behavior or fantasy following such experi- 
mental manipulations could be a response to 
an implicit communication of the kind of 
behavior expected by the experimenter and 
considered to be appropriate to an experi- 
mental situation which is otherwise quite 
unstructured for the subject. The design of 
the experiment reported here is an attempt 
to minimize this problem by using an arouser 
(affiliation) whose content is different from 
the dependent variable (doll play aggression). 

The possible contamination described 
above calls into doubt those experiments in 
which aggressive fantasy was used as the 
experimental arouser of a motivational state. 
The question of the arousing capacity of 
fantasy is important not only for the inter- 


pretation of the findings of the experiments ' 


mentioned above; it also has relevance to 
the psychoanalytic catharsis hypothesis. Ac- 
cording to psychoanalytic theory, instinctual 
energies may be discharged either motorically 
or in fantasy, as alternative discharge sys- 
tems. Psychoanalytic theory would predict, 
then, that the presence of a fantasy discharge 
in the subject should reduce the overt ex- 
pression of the drives that are expressed by 
the fantasy. $ 
Discharge of aggressive energies in fantasy 
should decrease overt motoric expressions of 
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aggression. Similarly, a fantasy in which af- 
filiation needs are reduced should drain off 
the energies involved in affiliative behavior 
so as to reduce overt affiliation responses. 
Such a prediction conflicts directly with the 
use of an experimenter induced fantasy as 
an arouser of a motivational state having the 
same content as the fantasy. As indicated 
above, the research which appears to demon- 
strate such arousal may instead be demon- 
strating an alteration in cognitions concerning 
the unstructured experimental situation in 
which the dependent behavior was measured. 
The experiment reported in this paper was 
designed to test the motivation arousing 
effect of exposure to fantasy (a fictional 
story told by the experimenter to the subject) 
on the subject’s subsequent behavior. The 
procedures use an hypothesized arouser (af- 
filiation drive) which is different from the 
dependent variable (aggression) in order to 
minimize the problems involved in the use 
of similar dependent and independent 
variables. 

The present experiment makes explicit use 
of the notion of conflict. It is expected that 
affiliation drive conflicts with aggression such 
that if a story theme about affiliation does 
elicit n Affiliation (n Aff) in the subject, 
then there should be a reduction in the 
subject's aggressive responses in a doll inter- 
view following the story. The experiment 
tests the hypothesis that exposure to a fan- 
tasy stimulant arouses motivation as shown 
through an inhibition of behavior which con- 
flicts with the aroused drive. 

The general plan of this research involves 
the treatment of two groups of children, with 
each group tested before and after an ex- 
perimental manipulation. The manipulation 
of one of the groups was designed to invoke 
affiliation drive, with the other group having 
a neutral or nonaffiliative arousal to serve 
as a control Because successive téstings of 
children in doll play and similar situations 
has been found to produce increases in ag- 
Bressive display, the basic hypothesis that 
an affiliation fantasy will produce a decrease 
in aggressive behavior must be modified such 
that rather than specifying an absolute decre- 
ment in aggression following affiliation 
arousal, it is necessary to speak in terms of 
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slower rate of increase in aggression following 
affiliation arousal. 

In order to explore the characteristics of 
affiliation-aggression conflict, subjects were 
divided into a group initially high in aggres- 
sion (HA) and a group initially low (LA), 
This allows a test of the assumption that 
affiliation and aggression drives are independ- 
ent of each other. If the affiliation arousal 
produces the same change in the aggression 
scores of HA and LA subjects, then we could 
draw the conclusion that affiliation arousal 
is independent of aggressive potential. 

Thus the intent of this experiment is to 
discover if a story with an affiliation theme 


' produces a greater decrease (or less of an 


increase) in aggression than occurs when 
affiliation is not invoked, and to discover if 
its effects are greater with HA subjects than 
with LA subjects. 


METHOD 


The doll play technique with young children 
has been used by several investigators in research 
on aggression. This experiment utilizes a modifica- 
tion of this technique. The doll interview is a 
structured projective play technique in which a 
doll is asked questions, and the child answers for 
the doll. It is a modification of the less-structured 
doll play technique utilized primarily by Ammons 
(1950; Ammons & Ammons 1952, 1953) in seyeral 
studies of young children. In this modification, the 
doll is asked questions by “another doll’ rather 
than directly by the experimenter. 

Aggression is treated in this experiment in terms 
of the Dollard, Doob, Miller, and Mowrer (1939) 
interpretation; ie. as a response to frustration. 
The definition of an aggressive act used in the 
experiment, growing out of the Dollard et al. 
definition and used in most doll play studies 
(Cohn, in press) is that aggression is an act having 
an intent to injure, punish, or generally disparage 
and deprecate. 


Subjects 


The subjects for this experiment were 19 boys and 
17 girls, 4- and 5-year-olds, attending a publicly 
supported nursery school for children of working 
mothers in Ann Arbor, Michigan. This sample is 
typical of such nursery schools in that only 8 of 
the children came from intact homes, while 26 
came from homes broken by divorce or separation, 
and 2 children came from homes in which one 
parent had died? 


?Of the original group of 40 subjects, 4 were 
excluded—3 because they were non-English-speaking; 
and 1 because of extreme immaturity. Thirty of 
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Procedure 


‘All of the children were tested at their nursery 
school in a large room, usually used as a class- 
room, which was reasonably free from distractions, 
The testing sessions always took place between 
3 and 5:30 r.m., shortly after the children's 2-hour 
naps, 

The materials used were a tape recorder, placed 
as inconspicuously as possible in the testing room, 
three baby dolls which could be considered as 
either males or females, a red balloon, a picture of 
a large and colorful ball, and a picture of a 
group of dogs playing together. 

The experiment consisted of three steps: an 
initial measure of expression of fantasy aggression, 
arousal of n Aff, and a final measure of aggression 
after the arousal. To measure the expression of 
fantasy aggression before and after arousal of n 
Aff, two doll interviews, each designed to arouse 
aggression, were conducted. 

The doll interviews consisted of a number of 
questions in the context of a theme concerning the 
activities of two dolls at a playground, The ques- 
tions were structured by the experimenter so that, 
in general, one doll did something to inhibit or 
frustrate the other doll. The answers given for 
the other doll by the subject formed the basis for 
the measurement of aggression. 

The actual procedure was as follows: the experi- 
menter approached the subject in the nursery 
room and asked if he would like to play with her. 
If the subject agreed, he was led to the testing 
room in which two dolls were lying on the floor. 
The experimenter then explained that they were 
going to play a game with the dolls, but that the 
dolls needed names before they might begin the 
game. The subject was asked to name both dolls 
and “choose one.” This was done quite readily by 
Most subjects. It was hoped that this procedure 
would aid in the involvement of the child with 
the dolls, Both doll interviews were begun in this 
way, 

1f the experimenter was questioned about the tape 
recorder, located in a corner of the room, she said: 
Tt is just a machine that someone has left here. 
I don’t know what it is.” By answering the question 
in an uninteresting manner, the experimenter was 
usually able to take the subject’s attention away 
from the tape recorder. 

The experimenter did not handle the’ dolls at all 
during the interview except to have them enter or 
exit from the situation. Each subject was allowed 
to manipulate the dolls as he wished and was not 
given any instructions or suggestions regarding this 
matter, The experimenter read each question in the 
Interview from a paper lying on the floor. 

The first testing session took approximately 15 
minutes, and the second, which included the n Aff 
arousal or control procedures, took 35 minutes. 


the 36 children were judged by the experimenter 
being from middle-class families, and 6 as being 
Tom lower-class families. 
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Scoring 


Each physical and verbal expression of aggres- 
sion was scored; however, no more than one ag- 
gressive response was scored for one question, 
The total number of aggressive responses in each 
interview was used as the subject’s score for that 
session. 

The rating of aggressive responses was done 
independently by two judges, and the reliability 
of these ratings, as measured by the product- 
moment correlation, was .96. 


Affiliation Arousal 


The children were randomly assigned to the 
experimental and control groups. The n Aff arousal 
procedure for the experimental group involved the 
telling of a story to each subject in which a dog 
seeks playmates. This was done anywhere from 2 
days to 2 weeks after the first doll interview (DP). 

In order to control for the effects produced by 
telling a story to a young child, the control group 
heard a story almost identical in content and word- 
ing, except that the theme of the control story 
concerned the search by a young dog for a desired 
ball instead of a search for friends, and thus the 
word “ball” was inserted in the control story when- 
ever a reference to friends was made in the arousal 
story. Animals were used in the stories in order to 
avoid carry-over of the story theme into the doll 
interview. 

To check on the children’s understanding of the 
story theme, each subject was asked at the conclu- 
sion of the story whether the dog hero would choose 
to play with a ball or a group of dogs, while a 
picture of a ball and one of a group of dogs was 
shown to the subject. If the story theme was 
communicated effectively, then the subjects in the 
experimental group should choose the group of 
dogs in preference to the ball; this preference 
should not be evidenced in subjects given the 
control story. 

Immediately after the arousal or control story, 
the second doll interview (DPs) was conducted. 


High and Low Aggression 


On the basis of aggression scores in DP, subjects 
were categorized as either HA or LA, based on a 
mean split. The arousal group contained 11 males 
and 10 females, with 6 of the males and 3 of the 
females in the HA category. The contro] group 
contained 8 males and 7 females, with 4 of the 
males and 3 of the females in the HA group. 

‘Thus four experimental groups were created as 
follows: HA-arousal (n = 9), LA-arousal (n = 12), 
HA-control (n= 7), and LA-control (n = 8). 


Sex Differences 


Because it is known from past doll play and doll 
interview studies (Cohn, in press) that boys are 


3 The question and picture order presentation were 
.alterndted to avoid errors due to order effects. 
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TABLE 1 TABLE 3 
AGGRESSION PRIOR TO AROUSAL MEANS OF AGGRESSIVE RESPONSES 
FOR ALL GROUPS 
Arousal Control t ratio 
DP: DP; 
M 3.19 3.13 Group 
N 21 15 Xp es f à f 
*p».05. Arousal | 
HA 5.50 3.00 3.83 2.67 
more aggressive in doll play than girls, the data LA 2.80 1.57 4.00 1.43 
from boys and girls were treated independently in Control 
the analysis. HA 5.75 4.00 6.75 5.33 
Thus the overall design was a 2 X 2 X 2 analysis LA 2.25 0.75 5.50 3.50 


of variance, with subjects in two experimental con- 
ditions (arousal and control), two sexes, and two 
levels of initial aggression. 


RESULTS 


In order to be sure that the assignment 
of children to the arousal and control groups 
was truly random with respect to initial 
aggression, the differences between the two 
groups in aggression in DP; were examined 
(see Table 1). The results indicated that the 
two groups were not significantly different 
with respect to aggression prior to arousal. 

Tn order to test for the effectiveness of 
the arousal procedure, the preferences for 
the dog or the ball were compared in both 
groups. In the arousal group, 71.4% of the 
subjects chose to play with the dogs as com- 
pared with 33.3% of the control group. This 
difference yields y? = 6.89, which is statis- 
tically significant (p < .01) (see Table 2). 
Thus the two stories apparently produced 
the desired communication to the subjects. 

Table 3 reports the mean doll play ag- 
gression scores of the group. Because the 
main interest was in differences in rates of 
Change in aggression as a function of the 
experimental arousals rather than absolute 
level, the analysis is in terms of differences in 
aggression between DP, and DP». Table 4 
reports this analysis of difference scores. 


TABLE 2 


Cuotce or BALL versus Docs py CONTROL 
AND AROUSAL GROUPS 


Choice 


Arousal Control 
Ball 28.6 66.7 
Dogs 714 33.3 


Note.—In percentages. i 


This analysis indicates that the arousal of 
affiliation drive produced a clear decrease in 
aggressive behavior, relative to the control 
group. Rate of change was apparently no 
different in boys and girls. 

The analysis shows a significant, although 
not highly reliable, difference between initially 
high and initially low aggressive subjects, 
suggesting that these groups changed in ag- 
gressiveness from DP, to DP» at different 
rates. However, their rate of change shows 
no significant interaction with the experi- 
mental conditions, indicating that there was 
no significantly greater effect of arousal on 
initially high aggressive subjects than on 
initially low aggressive subjects; i.e., the dif- 
ference between the rates of change of HA 
and LA subjects was not altered by the 
experimental arousal. Figures la and 1b 
describe these data. 

Because of the absence of significant sex 
differences in rate of change, Figures 1a 


TABLE 4 
ANALYSIS OF VARIANCE OF CHANGE SCORES 


Source df | SS MS F 
Arousal versus Control (A) | 1/10.74 {10.74 | 10.43** 
Males versus Females (B) | 1| .001| .001| .001 
HA versus LA (C) 1| 5.66 | 5.66 | 5.50* 
AXB TEE 6:072 REOS 
AXC j .06 | .06 | .06 
BXC 1| 154 | 1.54 | 1.50 
AXBXC 1| 41] 41 | 40 
Error 28| 1.99 | 1.03 
Total 35 18.42 


Note.—Error was computed from pooled within-cells 
goa “Analysis is by method of unweighted means. 
**) = 1005, 
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FANTASY AROUSAL AND AGGRESSION 


and 1b report adjusted means with sex dif- 
ferences partialed out. These figures indicate 
that the increase in aggression from DP; 
to DP usually found in research on children's 
doll play behavior, and observed here in the 
control group, was considerably retarded for 
both HA and LA subjects. This retardation 
was so great that the HA group, whose rate 
of change from DP; to DP» is slower than 
for the LA group, actually reduced in ag- 
gressive fantasy after the arousal, while the 
LA group increased slightly or remained 
about the same. 


DISCUSSION 


There has been a fair amount of research 
on the conditions which modify aggression 
(Berkowitz, 1958; Buss, 1961; McNeil, 
1959) but most of this has been in the 
direction of demonstrating that aggression 
may be increased or evoked under certain 
circumstances. Relatively little research has 
been directed at the reduction of aggression. 
One of the few studies in this area (Hollen- 
berg & Sperry, 1951) has shown that non- 
permissiveness for aggression in a nursery 
group (“We don’t play that way here.") 
produces a decrease in aggressive behavior 
relative to that shown by children permitted 
to behave aggressively. The experiment re- 
ported here points up an alternative means of 
reducing aggressive behavior, through the 
evocation of a conflicting motive. It is of 
interest to note that as simple a procedure 
as telling a story about a dog searching for 
friends was able to produce a quite marked 
reduction in aggression. This finding suggests 
that aggressive drive may not be as imperious 
and prepotent a determinant of behavior as 
1S commonly thought. 

The data suggest that exposure to a 
fantasy theme was effective in arousing a 
Motivational state. The story made no direct 
reference to aggression, so that its effects 
Cannot be ascribed to implicit structuring 
factors, Even if the story suggested to the 
children that affiliative behavior was ap- 
Propriate in the experimental situation, the 
Measure of aggression provided little op- 
Portunity for affiliative responses. The sub- 
Ject’s responses to the “frustrating” doll were 


ADJUSTED MEAN 
AGGRESSION SCORES 
-n ua ^O o 


A — 1 D 
DP1 DP2 DP! DP2 


Ja, CONTROL GROUP Tb. AROUSAL GROUP. 


Fic. 1. Adjusted mean aggression scores. 


either to become aggressive, or to ignore the 
frustration, and there was nothing in the 
story that would specifically suggest to the 
subject that ignoring frustrations was ap- 
propriate or expected by the experimenter. 
Further, the aggressively frustrating behavior 
of the experimenter controlled doll, to which 
the subject responded for his doll, should 
have created an implicit norm for the sub- 
ject that was opposite and counteractive to 
any affiliative norms communicated by the 
story theme. Finally, the responses of the 
subjects to the frustrating doll were adaptive 
in the sense of being varied to suit the 
nature of the frustrating dolls behavior. 
While motivational responses are generally 
adaptive in this sense, responses which are 
the product of set, suggestion, and such 
implicit structuring factors tend to be stereo- 
typed and imitative rather than adaptive. It 
might be added that while it is easy to see 
how an experimenter’s display of generally 
disapproved behavior such as aggression 
could produce a structuring that would 
reduce inhibitions against similar behavior 
in the subject (rather than actually arousing 
the subject's aggression) it does not seem 
reasonable that an experimenter's display of 
socially approved affiliative behavior would 
reduce inhibitions against affiliative behavior 
in the subject. This does not seem to be a 
drive which is commonly inhibited. Increases 
in affiliative effects, as manifested through a 
reduction in aggressive display, would there- 
fore not be the result of disinhibition. Thus 
several considerations lead to the conclusion 
that the effects of the fantasy theme were 
motivationally arousing, rather than a func- 
tion of set, suggestion, or implicit norms 
defining acceptable behavior. This study 
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therefore supports the interpretations by 
Mussen and Rutherford (1961), Hokanson 
and Gordon (1958), and Siegel (1956) that 
their thematic arousers actually influenced 
their subjects’ motivational states. 

The data argue against the catharsis hy- 
pothesis. Rather than the affiliation fantasy 
providing a satisfactory outlet for affiliative 
needs so that the overt expression of such 
needs was reduced, our subjects responded 
with clear evidence for the opposite; the 
fantasy affiliation maximized the operation 
of affiliation drive, as shown by the reduction 
in aggression. 

The conflicting drives involved in the 
experiment were apparently independent of 
each other, since the A X C interaction was 
nonsignificant. The arousal of n Aff did not 
affect the HA group to any greater extent 
than it did the LA group. This can be seen 
by a close examination of Figure 1. The 
LA controls showed a mean increase of 3.00 
on the aggression scale in. DPs compared 
to DPi;; however, the HA controls did not 
increase this much. They increased by only 
1.16 on the aggression scale. This difference 
in rate of increase between the HA and LA 
subjects may indicate the presence of some 
avoidance or inhibitory factor operating more 
strongly in the HAs than in the LAs. Assum- 
ing this to be the case, the avoidance or 
inhibition is apparently equal to 3.00-1.16, 
or 1.84 units of the aggression scale, i.e., the 
HAs disinhibit aggression by 1.84 units less 
than do the LA controls.* We may call this 
an "initial inhibitory potential" in the HA 
subjects. 

When the effects of the affiliation arousal 
are superimposed on these changes in aggres- 
sion which occur as a function of successive 
doll play sessions, it may be seen that they 
involve the subtraction of about the same 
amount for both the HA and LA groups, i.e., 
the affiliation arousal produced the same 
decrease in both groups, but because the HA 
group does not show as strong a practice 
effect as the LA group, the HA experimental 
subjects show a greater net reduction in ag- 


4 This extrapolation from the LA control group 
performance to the HA control group involves the 
assumption that the two groups differ in the height 
of approach gradients but not in their slopes? 
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gression than do the LA 
subjects. 

In conclusion, it seems that affiliation 
arousal operates to conflict with aggression, 
The effect of affiliation on aggression is the 
same for high and low aggressive children, and 
for both sexes. However, the high aggressive 
children appear to have an initial inhibitory 
potential which is not as great in low ag- 
gressive children. The two inhibitory factors 
(initial inhibition and the affiliation pro- 
duced drive) seem to be additive, so that 
highly aggressive children actually reduce 
in aggressive behavior when affiliation is 
aroused, while low aggressive children remain 
about the same. These differences are as- 
sociated with being raised in a mother 
dominated home,’ producing the paradoxical 
effect that father deprived children are both 
initially more aggressive than nondeprived 
children, but they ‘also have a greater 
potential for inhibition of aggression. 

Finally, it is of some interest to note that 
the arousal of affiliation seems to have had 
the same relative effect on young girls as on 
young boys, despite the common idea that 
girls are more responsive to such stimulation 
(McClelland, Atkinson, Clark, & Lowell, 
1953). However, it is possible that sex dif- 
ferences may appear at later ages. In this 
experiment boys were consistently more ag- 
gressive than girls, and the difference was not 
modified by the experimental manipulations 
and did not interact with initial aggressive 
tendency. 

While the results of this experiment offer 
some promising leads, there is need for 
further research in this area. Such variables 
as child training practices, age of child, 
relationship between doll play behavior and 
behavior in the child’s natural environment, 
and initial affiliation drive level, should be 
given explicit attention in subsequent work. 

This research has some incidental signifi- 
cance for the study of propaganda, mass 
communications, and esthetics. The demon- 
strated effectiveness of a story theme in 
arousing motivation can be utilized for study 
of the ways in which literary and dramatic 


experimental 


®The HA subjects more frequently came from 
homes in which the father was absent than did 
the LA subjects (x? = 4.67, p < 05). 
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stimuli, radio, television, and movie programs, 
etc., modify behavior. Individual differences, 
demographic and sociological variables, and 
the effects of socialization practices can be 
studies for their influence in determining 
responsiveness to such material. 
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EFFECTS OF LEVEL OF COOPERATIVE CHOICE BY THE 


OTHER PLAYER ON CHOICES IN A PRISONER’S 
DILEMMA GAME. PART I* 


V. EDWIN BIXENSTINE, HERBERT M. POTASH,? ano KELLOGG V. WILSON € 


Kent State University 


In a Prisoner’s Dilemma type game, Ss (male and female students) were 
asked to make repeated choice between 2 alternatives, R and B, where R is 
presumably the cooperative and B the competitive choice. While believing 
that they were playing persons of the same sex, Ss actually played a simulated 
partner who, for % of the group, chose 83% random R and, for the other 
34, 83% random B for 30 trials. Thereafter for 60 trials, the simulated partner 
chose according to a strategy of 83% matching, 17% mismatching the choices 
of S. The purpose of this procedure was to determine the effects of initial 
level of simulated cooperation on S’s own level of cooperation, The results 
failed to demonstrate that the level of simulated cooperation employed 
functioned to determine choice by S. Ss had been stratified according to a 
test of character which failed to predict choice. However, a 2nd scoring 
formula employed on this test did significantly distinguish the amount of 


R choice emitted by S and deserves further examination. 


The purpose of this experiment is to de- 
termine the effects of initial level of coopera- 
tive behavior displayed by a simulated 
“other” on the choices of subjects playing a 
Prisoner’s Dilemma game.‘ We are primarily 


1The research reported in this paper was sup- 
ported by Grants M-3291 and M-3291C1 from 
the National Institute of Mental Health, United 
States Public Health Service. 

2 Now at Michigan State University, 

8 Now at Nebraska Psychiatric Institute, 

*For those unfamiliar with the properties of 
this game, the following observations by Luce and 
Raiffa (1957) may prove useful: 

Two suspects are taken into custody and 
separated. The district attorney is certain they 
are guilty of a specific crime, but he does not 
have adequate evidence to convict them at a 
trial. He points out to each prisoner that each 
has two alternatives: to confess to the crime 
the police are sure they have done or not to 
confess, If they do not confess then the district 
attorney states that he will book them on some 
very minor trumped up charge . . . ; if they both 
confess, they will be prosecuted, but he will 
recommend less than the most severe sentence; 
but if one confesses and the other does not, then 
the confessor will receive lenient treatment for 
turning states’ evidence whereas the latter will 
get the “book” slapped at him (p. 95). 

The dilemma which the prisoners face resides in 
their isolation and resultant lack of mutual under- 
standing’ and trust. The one prisoner thinks, 

I would like to cooperate and give my partner 
(and myself) the break and not confess,. but I 
have to trust that he would not confess too. At 


interested in testing the hypothesis that a 
predominantly cooperative strategy on the 
part of the other player will tend to elicit 
similar behavior in return. The work of 
Deutsch (1958) and Solomon (1960) in- 
dicates that trust of the other person plays 
a critical role in determining choices made in 
Prisoner’s Dilemma games, and we are at- 
tempting to determine if such trust can be 
communicated by the initial behavior of the 
other participant. 

Prisoners Dilemma games may be re 
garded as mixed motive games (Schelling, 
1960), since the player has to choose be- 
tween making a choice which increases the 
total gains of both participants (if the other 
reciprocates) or a choice which increases his 
immediate gains regardless of the choice of 
the other player. While immediate self-in- 
terest dictates the noncooperative choice, it 


the same time, I’m tempted to try for the very 
least penalty which would be mine if he does 
not confess while I do, and I know he will be 
tempted by this also. Since I suspect he 15 
tempted to confess, it would be foolish for me 
not to confess thus getting the maximum penalty 
while he gets off easy. (And if I think about it 
I would reason he has arrived at the same con- 
clusion.) Thus, since I expect him to confess 
(which he does partly because he reasons as 
do), the only course for me is to confess also. 
This means we both will probably receive fairly 
Severe penalties, but what else can we do? 
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CHOICES IN PRisoNER's DILEMMA GAME 


is apparent that long-term self-interest and 
regard for the other person dictates the 
cooperative choice over a series of trials.° 
However, the experimental work of Scodel, 
Minas, Ratoosh, and Lipetz (1959) and 
Minas, Scodel, Marlowe, and Rawson (1960) 
indicates that the level of cooperative choice 
in repeated Prisoner’s Dilemma games is low 
and often tends to be lower for the later 
trials. The results were interpreted as in- 
dicating a tendency to maximize the differ- 
ence between personal gain and that of the 
other, which, in effect, makes the game a 
purely competitive one. Since an inability of 
the subjects to respond to long-term best 
interests in social situations may be re- 
sponsible for such preferences, a second 
purpose of our experiment is to determine 
the effect of the level of response on a test 
designed to measure sensitivity to the ethical 
aspects of social decision making. 

Since we sampled subjects in pairs of the 
same sex, we can also determine whether sex 
differences exist or interact with the variables 
mentioned above. 


METHOD 
Subjects 


Subjects were 24 male and 24 female students 
enrolled in general psychology and sociology classes 
at Kent State University. 


Apparatus 


Each subject is seated in one of two soundproof 
booths measuring 84 inches X 48 inches X 40 inches, 
containing an overhead 100-watt lamp, a ventilating 
fan, a secretarial stool, and a window (shuttered 
for this experiment). Each booth contained a 
$534-inch X 1614-inch panel making approximately 
a 20-degree angle from the back walls and situated 
on a 3534-inch X 21!4-inch X 28-inch table rest. 
The panel contained a matrix of four small amber 
lights forming a 4-inch square. Through the two 
lights on the left column ran a %-inch red line 
Intersecting a small button at the base of the 
Panel; a 14-inch blue line connected the right 
Column with the right-hand button. A 34-inch red 

also intersected the top row of two lights, 
while a 34-inch blue line joined the bottom row. 
When the left (R) button is pushed, one of the 
two bulbs on the left intersected by the red line 
will light, depending on what the other person (or 
experimenter) chooses to do. The word “you” ap- 
Pears over the red and blue columns and “other” 
ZPbears adjacent to the red and blue rows. Further- 


M Similar argument is presented by Luce and 
aiffa (1957). 
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Person II 
Red (R) Blue (B) 
Red (R) 8;3 0, 5 
Person I 
Blue (B) 5,0 1,1 


Fic. 1. This payoff matrix shows the return to 
Person I (the first number) and Person II (the sec- 
ond number) for each of four combinations of choice 
(RR, RB, BR, and BB) possibilities. (Entries 
represent money in cents.) 


more, just over each light is written the payoff 
(see Figure 1), while to the left of each light 
appears the payoff to the other person. In this 
way, the subject has a visual record of the results 
of his choice and that of his partner (ie. of the 
experimenter). 

A small white light is situated at the top of 
the panel serving to indicate the beginning and 
ending of the trial, which lasted 11.25 seconds. A 
counter to the left of the matrix serves to indicate 
the trial number. Each subject is provided with a 
Tally Register hand-counter which allowed him to 
keep a cumulative record of his winnings, 

A control panel, unseen by the subject, allowed 
the experimenter to switch subjects from a circuit 
designed for interpersonal games to one where each 
subject actually plays with the experimenter. A 
matrix of counters registers the payoff for each 
subject and also counts the trials. 


Procedure 


In Figure 1, the payoff matrix which was em- 
ployed in this study is presented. Cooperative and 
competitive choices are defined in terms of this 
matrix, For Person I (or II) to choose R implies a 
measure of trust that the other person will not 
thereby choose B, rendering no return for I, and 
it also implies a willingness to forego the highest 
gain (5¢) in order to insure a maximum joint 
return (3¢ and 3¢). R, therefore, will represent the 
cooperative choice; B the competitive choice. 

The character measure (see section immediately 
following) was administered to whole classes at 
a time, scored for (N+F)/(D+0) and divided 
by thirds into those scoring a high, middle, and 
low ethicality ratio. Half of the subjects at each 
level were randomly assigned to either the high 
simulated R group or high simulated B group. 
Tn the first group, pairs of like-sexed persons, while 
believing they were playing each other, were actually 
playing the experimenter, who played a random R 
8396 of the time; in the second group, the experi- 
menter played a random B 83% of the time. 

Subjects were seated in their separate booths, 
instructed about the apparatus, and the payoff 
matrix (see Figure 1), the hand-counter, and 
other relevant aspects of the procedure, Every 
care was taken to make clear the nature of the 
payoff—including a warm-up period of 10 “free” 
trials when subjects were playing each other—yet 
care was also taken to avoid prejudicing choice, 
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suggesting a competitive or cooperative set, or even 
employing the term “game,” “winning,” etc. The 
doors to the booths were then closed, and the 
trials begun. After 30 trials the experimenter 
altered his strategy to an 83% match (17% random 
mismatch) which continued for 60 trials for all 
subjects. Total length of time elapsing from the 
moment of introducing the subject to the booth to 
termination was 40 minutes. 


Measure of Character 


A previous effort (Wilson & Bixenstine, 1961) 
to survey experimentally the value of a number of 
personality measures in predicting game behavior 
had proven largely unilluminating. The measure 
employed herein was chosen entirely for its face 
validity and suggestive logical relationship to our 
analysis of the Prisoners Dilemma game. This 
measure was designed to distinguish four different 
approaches to intrapersonal conflict situations. The 
situations, portrayed in brief story dramas, are 
similar to the dynamics of our games in that the 
hero has to consider his own interests and those 
of others and must resolve this conflict either 
ethically (regarding the other person’s welfare or 
gain) or selfishly (maximizing his own immediate 
gain). The test has 20 such stories, half with fe- 
male heroines, half with male heroes; furthermore, 
half, balanced by sex of hero, are resolved ethically, 
half are resolved selfishly. Subjects are to read the 
stories and the four comments after each; they are 
then to judge the comments for agreement along a 
seven-point scale with their own reaction to the 
hero’s decision. They, thus, make 80 separate 
judgments over the 20 stories, 

The four comments are each drawn from a par- 
ticular class. Briefly, these are: N (Normal), an 
approval of the ethical hero (disapproval of the 
selfish hero) based upon what is reasonable, 
workable and equitable; F (Freudian neurotic), 
an approval of the ethical hero (disapproval of 
the selfish hero) but with special emphasis upon 
rigid rules and unreasoned obedience; D (Dis- 
sociative), an approval of the selfish hero (dis- 
approval of the ethical hero) based upon an 
essential denial of the conflict; and O (Obsociative), 
an approval of the selfish hero (disapproval of 
the ethical hero) buttressed by an assertion that 
the selfish choice is really ethical (and the ethical 
one, really selfish) .6 

Since N and F are both approving reactions to 
the ethical resolution, and D and O are both dis- 
approving thereof, the ratio (N+F)/(D+0) 
would appear to be a gross measure of the degree 
to which a person tends to emphasize general 
regard for others over regard for self (granting 


© Persons desiring a copy of this measure may 
write V. Edwin Bixenstine, Kent State University ; 
Kent, Ohio, A recent reference wherein a rather 
similar ‘categorization of ethical behavior is pre- 
sented is Peck and Havighurst (1960), and another 
which develops a very similar measurement technique 
is Miller and Swanson (1960). 
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the basic assumptions that the stories elicit identifi- 
cation with the hero and that one defends or 
attacks the hero in the degree to which one's identity. 
therewith is preserved or destroyed by the hero's 
actions). 

Another scoring formula which seemed of logical 
value was (N— F) which renders the degree to 
which agreement (or disagreement) with hero is 
based on rational-egalitarian reasons versus ir- 
rational-authoritarian reasons." The F score would 
reasonably have a measure of judgmental hyper- 
criticality with an “eye-for-an-eye” theme pre- 
dominating. With reference to our game situation, 
such an attitude would little tolerate a departure 
from “purity” by one’s partner so that any B 
choice would invite a prompt retaliatory B choice 
in return. 

Obviously, (N + F)/(D + O) as a general measure 
of “ethicality” might not effectively predict game 
behavior if our reasoning about the nature of F 
prevails. Nonetheless, this ratio appeared to be a 
reasonable first step. No reliability estimate exists 
for (N+F)/(D+ 0), as such; however, the test- 
retest r’s on N, F, D, and O separately range from 
78 to .88 for men and .62 to .79 for women. 
Furthermore, r’s between N and F were .60 and 
.66 for men and women, respectively, while r's 
between D and O were .69 and .50 for men and 
women, respectively, which indicated that these 
pairs of measures are nearly as closely related as 
the reliabilities will permit us to estimate. Further- 
more, r's between N and F, on the one hand, and 
D and O on the other, are either negative (— 33 
to — .56 for men) or zero (— .07 to .01 for women), 
which is also compatible with the (N + F)/(D +0) 
ratio as a gross scoring scheme. 


RESULTS 


None of the variables studied—treatment, 
personality, or sex—proved to have effect on 
the subject’s R choice; and the trend for 
the character measure was contrary to hy- 
pothesis. Due to heterogeneity of variance, 
the chi square and Kruskal-Wallis tests 
(Senders, 1958) were employed, revealing no 
significant effects. 

Subjects were reclassified into high, middle, 
and low groups on the basis of our alternate 
character measure, (N — F). Table 1 reports 
mean proportions of R choice under this 
classification and indicates that high (N — F) 
individuals cooperate more than middle and 
low (N — F) persons. i 

The data for which means are reported 
in Table 1 were subjected to an arc-sine trans- 
formation to correct for heterogeneity of 
variance (see Walker & Lev, 1953, pp. 423- 


* This scoring formula and its logic was developed 
by H. M. Potash. 
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CHOICES IN PRISONER’s DILEMMA GAME 


TABLE 1 


MEAN PROPORTION R CHOICE BY SEX: 
(N—F) GROUP AND INITIAL SIMULATED R CHOICE 
LEVEL FOR ĪNITIAL 30 TRIALS 


Females Males 
aie —— R 
839; R | 1795 R |83%R|17%R 
High .433 | .300 | .367 | .417 319 
Middle .367 | .233 | .233 | .189 256 
Low .227 | .293 | .367 | .280 292 
M .331 | .281 | .333 | .303 312 


Note.—Ns for High, Middle, and Low groups are 16, 12, and 

20, respectively. 
424), and separate analyses of variance were 
made for the initial 30-trial block and the 
final 60-trial block. These results are re- 
ported in Table 2. 

Both analyses in Table 2 indicate that 
(N —F) level has an effect significant at 
the .05 level. The only other point of agree- 
ment is a small and nonsignificant effect at- 
tributable to the interaction of Sex, Initial 
R level, and (N — F) level. A final point of 
interest is the larger error term obtained for 
the final 60-trial block. Since the 83% match- 
ing counter strategy used in this block of 
trials tended to “feed back” to each subject 
a responses pattern similar to his own, it is 
likely that individual differences were ac- 
centuated. 

According to Luce and Raiffa (1957), the 
last or terminal trial often has its own 
Strategy and deserves to be analyzed sepa- 
rately. The reasoning here is that subjects 
aré aware that no retaliatory consequences 


TABLE 2 


ANALYSES OF VARIANCE FOR DATA OF FIRST 
30 and Last 60 TRIALS 


First 30 trials | Last 60 trials 
Source df 

MS F MS F 
Sex (A) 1| 208| <1 | 2775| <1 
Initia R level (B) | 1| 5848| <1 |3727| «1 
(N—F)level (C) | 2 269.63 | 3.432*|306.25 | 3.271* 
AXB 1| 634| «1 |228.38| 2.439 
Axc 2142.28 | 1.813 | 11.67] <1 
Bxc 2|2178| «1 | 8211| «1 
AXBXC 2 1140.70 | 1.793 |157.27 | 1.680 
Error 36 | 7849 93,64 


*? sos, 
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are thereafter in store. Consequently, sub- 
jects cooperating due to the external con- 
trols inherent in the situation (game) would 
revert to a competitive choice on the last 
trial, while subjects cooperating out of basic 
characterological disposition (i.e. internal 
controls) would choose cooperatively even on 
the last trial. 

In Table 3 are presented the ratios of R 
to B choice for the various (N — F) groups 
on the last as compared with total (60) trials. 
If we assume that (N — F) is a measure of 
inner control and that high (N — F) would 
cooperate out of characterological rather than 
situational pressure, then we would expect 
the high (N — F) to show no more R choice 
on the last trial than before, while low 
(N — F) would tend to alter in the direction 
of higher B choice. The ratios in Table 3 
support this inference. Basing expected fre- 


TABLE 3 
Ratios or R To B CHOICE FOR VARIOUS 
(N —F) Grours 
High Middle Low 
All trials 1/2.4 1/3.3 1/3.3 
Last trial 1/1 1/11 1/9 


quencies on the 60 test trial ratios, a chi 
square of 9.93 (df = 3, p=.02) was found, 
indicating that these last trial changes were a 
significant departure from prior performance. 

The performance of the subjects was also 
examined with regard to the N, F, D, and O 
scores alone without discovery of any evident 
correlations. In general, the subjects tended 
to rank their preferences in an N, F, D, O 
order which may well reflect "social desir- 
ability” (Edwards, 1957). The (N— F) 
score may well have reflected an inability 
to distinguish N and F reactions reliably, 
which would not be obscured by the "social 
desirability” effect. There are two other 
independent difference scores which could be 
computed. One of these, (D — O), is analog- 
ous to the (N—F) score but reflects in- 
ability to distinguish between disagreements 
with the “ethical” choice. The other [(N + 
F)—(D 4 O)] represents the net fendency 
to discriminate the ethical from the un- 
ethicál choices and is analogous to the 
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(N+F)/(D+0) measure already men- 
tioned. None of these alternative scoring 
systems appeared to have any predictive 
value. 

Discussion 


That no difference in amount of coopera- 
tive behavior (R choices) could be assigned 
to the level of cooperation evinced by the 
subject’s presumed partner is surprising and 
demands further exploration. Deutsch (1958), 
of course, reports that invariable cooperative 
or “benevolent” behavior by a stooge elicits 
no significantly greater cooperation from the 
subject than does competitive, “malevolent,” 
behavior by the stooge, while “conditional 
benevolence” (matching the subject’s choice) 
by the stooge elicited significantly higher 
cooperation by the subject. In our procedure 
the stooge is neither conditionally benevo- 
lent nor invariably benevolent/malevolent. 
We wanted to explore the avenues of inter- 
personal influence and communication open 
to a real player. An actual player could not 
employ a consistent conditional benevolent 
(matching) strategy without prior knowledge 
of the partner's choice. On the other hand, 
we did not employ invariable benevolence 
because exploratory work revealed subjects 
do not believe they are really playing an 
"honest" or real person. For one's partner 
to persist in “communicating” a cooperative 
preference (ie. choosing R) when one finds 
noncooperation (i.e. choosing B) under these 
conditions all the more lucrative (thus, giving 
one 56, one's partner 0¢) apparently is too 
self-denigrating to be believed. 

None of our subjects evidenced any sus- 
picion that they were not playing the person 
in the other booth. Presuming on this basis 
that our 83% R (or B) strategy is a reason- 
able (believable) one, our results indicated 
that the Prisoner's Dilemma game is such 
that either no avenues of interpersonal com- 
munication exist or that those that do are 
enfeebled to a point of impotence. This con- 
clusion would represent a second and final 
negation of the Luce-Raiffa thesis where the 
first negating step was made by Scodel et al. 
(1959) and Minas et al. (1960). Luce and 
Raiffa ^(1957) predicted that when the 
Prisoner's Dilemma is reiterated, avenues of 
interpersonal communication should appear in 
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repeated choices (not current in just one 
choice) and partners would ultimately rest in 
a profitable joint cooperative-strategy. Scodel 
et al. and Minas et al. found that this not 
only was in error, but that partners actually 
become more confirmed in a mutual com- 
petitive strategy as trials proceed. Their 
design employed true two-person play so 
that they leave unanswered the question 
whether a mutual-cooperative strategy could 
be reached by a persistent effort to com- 
municate a cooperative bent. Our results 
would seem to answer this last question also 
in the negative. The Prisoner’s Dilemma 
game, we may conclude at this point, is not 
merely resistive to a cooperative resolution 
with reiteration, it actually obstructs the 
formation of such a resolution. Insofar as this 
kind of game captures and crudely mirrors 
the dilemmas of any social enterprise, the 
evidence to date suggests that the social 
contract does not evolve merely from evi- 
dence of good will. In addition to or in place 
of this turning-the-other-cheek strategy, an 
essential ingredient would be recourse to 
instruction, sanction, and control discussed 
in another paper (Wilson &  Bixenstine, 
1962; see also Deutsch, 1958, and Solomon, 
1960). 

The reason for the “perversity” in the 
make-up of this game is not hard to find. 
When one's partner persists in R (refer to 
Figure 1) on an 8396 schedule, one makes an 
average of 4.326 per trial choosing B, and 
an average of 2.49€ per trial choosing à 
(responsive) R. True, in choosing B one is 
not being thoughtful of one's partner who is 
making merely .41¢ per trial when he could 
make 3¢ or more if one were to choose à 
responsive R; but apparently, this is a 
minor consideration. Of course, when one’s 
partner chooses B 83% of the time, then B 
play in return is not only financially sensible; 
it satisfies whatever retaliatory urge one may 
have. 

The reasoning presented above appears to 
us to be sound, but is nonetheless subject to 
a few reservations. First of all, we are as- 
suming that essentially all that can be done 
to effect communication in a simultaneous 
game such as played herein would be to 
maintain a persistent preference for R 
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or B. However, a pattern or contingent 
strategy might prevail. Exactly matching the 
other person’s choice is an example of a 
contingent strategy, but one not possible 
under choice simultaneity. Matching the 
other person’s previous choice is another 
contingent strategy and one which is possible. 
Actually, the reader will recall that the test 
trials (60 of the 90) were on a matching, 
contingent strategy (83% match, 17% mis- 
match under the cooperative treatment). No 
difference was found between the group of 
30 treatment and group of 60 test trials. 
This finding and results in other work in 
progress using the matching strategies have 
suggested that it is no more useful in com- 
municating with and controlling the other 
person than is the persistent preference 
strategy. Subjects change to R in order (in 
their words) to “get the other to choose R, 
so I can choose B and make more.” How- 
ever, a formal and controlled comparison of 
“contingent” with “persistent preference” 
strategies would be useful. 

Another reservation is that 83% R choice 
might not really be as clearly a commitment 
to a cooperative strategy viewed from the 
vantage of a naive subject as when viewed 
from our more global and sophisticated 
vantage. In addition, we feel that it takes, in 
effect, only a little bit of malevolent and 
competitive B to elicit B in return, while 
Proving one’s altruism is much more difficult. 
We plan to test this reservation experi- 
mentally by increasing the degree of co- 
operative/competitive choice ratio as far as 
the credulity of our subjects will permit. 

Before closing our discussion, we will treat 
briefly the findings regarding our character 
Measure. The initial measure of ethicality, 
the ratio (N+ F)/(D+0), was uncor- 
related with game behavior. In view of the 
fact that the second measure, (N — F), was 
employed after having eliminated (N + F)/ 
(D + 0), the independence of and, thus, real 
Significance of (N — F) is somewhat ques- 
Honable. The fact, however, that this measure 
distinguished game behavior over both “treat- 
ment” and “test” trials and that the last 
trial strategies were consonant with predic- 
tions based on the measure tend to strengthen 
Confidence in it. 
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While the general measurement approach 
is quite suggestive and has high face validity, 
the (N — F) score derived therefrom is at 
this point factorially obscure. As noted 
earlier, (N — F) may be related to "social 
desirability." Deutsch (as reported in Joseph, 
1960) found a relationship between the F 
Scale and "trustworthy" (cooperative) choice 
in a Prisoner's Dilemma game. Of course, the 
original thinking which gave rise to the 
score viewed it as a measure of “ethicality” 
free of a judgmental and recriminatory 
character. The fact that high (N —F) 
predicts a higher rate of R play is a welcome 
bit of empirical correlation. However, we 
feel that the relationship is in need of replica- 
tion and that an examination of the scale's 
relationship with other measures is called for. 
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REMINISCENCE AND STAGE OF ILLNESS 
IN SCHIZOPHRENIA* 


JERRY HIGGINS Aw» SARNOFF A. MEDNICK 
University of Michigan 


When given a repetitive inverse alphabet printing task to perform for 5 min. 
of massed practice, Early Stage schizophrenics (N=16) displayed a greater 
degree of improvement in performance (reminiscence) after a 2-min. rest than 
did Advanced Stage schizophrenics (N = 16). Early Stage and Advanced Stage 
schizophrenia were defined in terms of a theory of schizophrenia which postu- 
lates differential levels of arousal as characteristic of the stages of the disorder. 


Mednick and DeVito (Mednick, 1958) 
have found groups of schizophrenics to be 
differentially susceptible to associative re- 
sponse competition in a verbal learning task. 
These results were attributed to different lev- 
els of arousal characteristic of different stages 
of the schizophrenic illness. Such an inter- 
pretation followed from a learning theory ap- 
proach to the development of schizophrenia 
in which early and advanced stages of the 
disorder are conceived of as poles of a learn- 
ing process continuum. This continuum pro- 
ceeds from a state of heightened arousal 
(Early Stage) to a state of reduced, even- 
tually subnormal, arousal resulting from the 
acquisition of anxiety avoidant associative re- 
sponses (Advanced Stage). In terms of arousal 
level, the normal individual might be ex- 
pected to fall somewhere between the height- 
ened arousal of the early schizophrenic and 
the lowered arousal of the advanced schizo- 
phrenic, The present experiment was an at- 
tempt to test the tenability of the postulate 
of differential arousal levels in schizophrenia 
in the area of psychomotor behavior, 

Many studies on retention have shown a 
rise rather than the expected fall in reten- 
tion over time (Ward, 1937); this phenome- 
non has been labeled reminiscence. Bleke 
(1955) and Venables and Tizard (1956) 
have investigated psychomotor reminiscence 


1 This research was supported by the National 
Association for Mental Health and the National 
Institute of Mental Health, United States Public 
Health Service. The authors wish to thank Dorothy 
Marquis of the Ann Arbor Veterans Administration 
Hospital and Alexander Dukay and Eira Mattsson 
of Ypsilanti State Hospital for their generous 
assistance. 


in groups of schizophrenics somewhat com- 
parable to Early Stage and Advanced Stage 
types. However, in both studies certain fea- 
tures of experimental design render their re- 
sults equivocal for the present problem. 
Bleke was concerned with the differential ef- 
fects of reward and punishment on reminis- 
cence rather than with reminiscence per se; 
Venables and Tizard's simple task of key 
pressing to light stimuli appears to have a 
low ceiling which was probably approache 
in their 10-minute prerest learning period, 
thereby suppressing postrest improvement. 
The present study did not impose such con- 
ditions as Bleke's reward and punishment 
and employed a more difficult experimental 
task with a shorter prerest learning period 
than that of Venables and Tizard. 

Combining the postulate that Early Stage 
schizophrenics are in a state of high arousal 
while Advanced Stage schizophrenics are in a 
state of reduced or low arousal with Kimble's 
(1950) finding that greater arousal results in 
greater psychomotor reminiscence, the fol- 
lowing prediction was made: under identica 
test conditions, Early Stage schizophrenics 
exhibit greater reminiscence effects than Ad- 
vanced Stage schizophrenics, It was also €x- 
pected that reminiscence effects shown by 4 
reference group of normals would fall be- 
tween the reminiscence effects displayed by 
the two schizophrenic groups. 


METHOD 


Subjects. Thirty-two male schizophrenics (median 
age, 38 years) were tested; 23 were drawn from Ann 
Arbor Veterans Administration Hospital, 9 from 
Ypsilanti State Hospital. Sixteen normals (median 
age, 48 years) were tested at the same Veterans 
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. schizophrenic subjects were on drugs, and had 


hospitalized for considerably longer periods of 
d than the normals, the meaningfulness of any 
comparisons between the normal and schizophrenic 
ups is somewhat attenuated. In view of these 
factors it was decided to employ the normals prima- 
‘ily as a reference group, with the main analyses 
eing concerned with reminiscence differences be- 
the Early Stage and Advanced Stage schizo- 
phrenic groups. 
Scales. Following an item analysis for internal 
msistency, a set of “Stage of Illness Scales” was 
- developed for roughly locating schizophrenics along 
the early-advanced continuum. The Scales as a whole 
tempt to determine the stage to which the schizo- 
nic process has progressed. This assessment was 
proached somewhat arbitrarily, but conveniently 
tating purposes, by dividing the continuous 
phrenic process into three time blocks. A single 
e was allotted to each of these three periods, 
“resulting in the following measures: degree of pre- 
bid withdrawal (score range: 1 to 5), rate of 
inset of symptoms (score range: 2 to 10), and dura- 
on of hospitalization (score range: 0 or 5). Thus 
he Scales take into account the total duration of 
fhe illness—in and/or out of the hospital. An esti- 
mate of the extent to which the patient's illness has 
progressed is obtained by summing his scores on the 
4 ree scales, yielding a Stage of Illness Index. 
, "Each schizophrenic subject was rated on the Stage 
Tliness Scales by the psychiatrist, psychologist, or 
Social worker most closely associated with the given 
tient, (As in most instances it was difficult to find 
"more than one staff member whose familiarity with 
patient's case was adequate; estimation of the 
bility of the Scales was regrettably impossible.) 
"schizophrenic subjects were broken at the 
Stage of Illness Index into Early Stage 
7:16, median age, 36.0 years, median hospitaliza- 
tion, 4.0 months) and Advanced Stage (N = 16, 
edian age, 39.5 years, median hospitalization, 11.2 
mths) groups. 
Procedure. The experimental task was that of 
Dverse alphabet printing, previously used by Kimble 
(1949) in research on reminiscence. After a warm-up 
od which consisted of printing the complete 
habet in the customary fashion, the subject was 
given an untimed “practice” period to ensure his 
derstanding of the task. In the practice period the 
Subject was required to copy (once) inverted ex- 
ples of the first 10 letters of the alphabet. The 
ject was then instructed to, “Start printing out 
Whole alphabet from right to left, printing each 
er upside down.” Each subject was given five 
d 1-minute learning trials. To maximize the ef- 
expended, the subject was told and periodically 
nded to “Always work as fast as you can” and 
‘our hand should become tired it is most im- 
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TABLE 1 
REMINISCENCE EFFECTS DISPLAYED BY THE 
EXPERIMENTAL GROUPS 
Normal Advanced. 
ini Early Stage 
Reminiscence] , +4 E reference Stage 
schizoplirenics group schizophrenics 
None 3 4 8 
Moderate 5 7 6 
High 8 5 2 


Note.—N = 16 for each group. 


instructed to begin the inverse alphabet printing 
again, and was given two massed trials of 1 minute 
each, All trials began with the letter “A.” 

In an investigation of methods of measuring 
reminiscence, McGeoch (1935) pointed out that the 
occurrence of the phenomenon of reminiscence is 
essentially an all-or-none phenomenon with respect 
to individual subjects; they either show an improve- 
ment in performance following rest or they do not. 
Thus, measures of reminiscence based on average 
performance of the whole group are equivocal, in 
that such group averages include those subjects who 
failed to show reminiscence, thereby masking a por- 
tion of the reminiscence actually shown (by the other 
subjects). McGeoch concludes, therefore, that the 
proportion of subjects in a group exhibiting remi- 
niscence and the amount shown by them are more 
valid measures of reminiscence. The present study 
adopted McGeoch's recommendations, A subject was 
tabulated as having displayed reminiscence if the 
number of letters he printed on the first postrest 
trial was greater than the number of letters he 
printed on the last prerest trial, ie. if his perform- 
ance improved following rest. For each subject who 
showed reminiscence, amount of reminiscence was 
calculated by subtracting the number of responses 
he made on the last prerest trial from the number 
of responses he made on the first postrest trial. 
Dichotomizing as close to the median as possible, 
amounts of improvement in postrest performance 
of four responses or less were tabulated as Moderate 
Reminiscence; improvements of five responses or 
more were tabulated as High Reminiscence. 


RESULTS 


It can be seen from Table 1 that the Early 
Stage schizophrenic subjects displayed the 
greatest reminiscence effects, the Advanced 
Stage schizophrenics displayed the least remi- 
niscence effects, and the normal reference 
group displayed intermediate reminiscence 
effects. In agreement with the major hypothe- 
sis, that the Early Stage schizophrenics would 
exhibit greater reminiscence effects than 
would the Advanced Stage schizophrenics, the 
obtained difference in reminiscence between 
the two schizophrenic groups (results for 


316 


normals not considered) was significant (x? = 
5.98, df =2, p < .026, one-tailed). That this 
difference was not due simply to the greater 
duration of institutionalization of the Ad- 
vanced Stage group was evident; when the 
schizophrenic subjects were divided at their 
median number of months of hospitalization 
(7.5), the two resulting groups showed no 
difference in reminiscence (y? = .05, df = 2, 
b > 97, two-tailed). 

When the normal group was considered the 
results were in the predicted direction but 
failed to reach statistical significance (x? = 
6.74, df=4, p < .08, one-tailed); however, 
when only the extreme categories of amount 
of reminiscence were considered the results 
were significant (x? = 6.00, df = 2, p < .025, 
one-tailed), Thus the secondary hypothesis, 
that the reminiscence effects shown by the 
reference group of normals would fall be- 
tween the reminiscence effects shown by the 
Early Stage and Advanced Stage schizo- 
phrenic groups, received weak confirmation. 


Discussion 


The results confirmed the main hypothesis 
of the study: Early Stage schizophrenics dis- 
played significantly greater reminiscence ef- 
fects than did Advanced Stage schizophrenics. 
Such confirmation lends support to the theo- 
retical assumption underlying the prediction, 
namely, that the early stages of schizophrenia 
are marked by a state of heightened arousal, 
while the advanced stages of the disorder 
involve a state of low, or reduced, arousal. 

Bleke’s (1955) study, mentioned above, 
classified schizophrenic subjects into “Good 


TABLE 2 


MEAN NUMBER or LETTERS PRINTED ON EACH TRIAL 
FOR EACH EXPERIMENTAL GROUP 


Trial 
Group Prerest Postrest 
1|2|s|4|s|e]|7 
Early Stage 
schizophrenics | 22.0] 22.6] 25.1| 26.3| 26.5 29.7| 28.4 
Normal reference 
group 21.8) 24.9) 25.7 25.9| 25.4) 28.5] 28. 
Advanced Stage X in 
schizophrenics | 14.4] 15.9 16.9] 18.5 18.4 19.9] 18.8 
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Premorbid” and “Poor Premorbid” groups, 
The classifications of Good Premorbid and 
Early Stage schizophrenia and Poor Pre- 
morbid and Advanced Stage schizophrenia 
have much in common in that they emphasize 
relative adequacy of premorbid adjustment, 
It is worth noting that under conditions of 
reward Bleke found that a greater proportion 
of his Good Premorbid schizophrenics showed 
reminiscence than did his Poor Premorbid 
schizophrenics, and that the Good Premorbids 
also showed greater amounts of reminiscence, 
While inspection of the data presented in 
Bleke’s tables suggests that these results were 
not statistically significant, their direction is 
consonant with the results obtained in the 
present study. 

The phenomenon of decelerated improve- 
ment in performance as an individual ap- 
proaches his asymptote or his “ceiling” neces- 
sarily influences the extent of reminiscence 
observed. That is, the nearer his ceiling a 
subject was on the last prerest trial, the more 
difficult it was for him to show unit improve- 
ment on the first postrest trial. No means for 
estimating a subject’s distance from his 
ceiling at the conclusion of the prerest trials 
was built into the experimental design. Thus 
it was impossible to correct for any variability 
in postrest improvement contributed by 
individual differences in prerest distance 
from ceiling. Inspection of the data revealed 
markedly superior performance of the Early 
Stage schizophrenics over the Advanced Stage 
schizophrenics on all five prerest trials ( Table 
2). Therefore, it seems possible that the Early 
Stage schizophrenics were closer to their 
ceilings prior to rest than were the Advanced 
Stage schizophrenics. It is suspected that, if 
corrected for this factor, the obtained dif- 
ference in reminiscence between the two 
schizophrenic groups would have been greater. 
It is accordingly proposed in future remi- 
niscence studies to estimate the subject’s pre- 
rest distance from his ceiling by taking the 
subject to his asymptote by means of dis- 
tributed practice. This distance could then 
be used in the assessment of the subject’s 
postrest improvement. 

The superior performance of the Early 
Stage over the Advanced Stage schizophrenics 
on the experimental task is consonant with 
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the assumption of higher levels of arousal in 
the early stages of the disorder, assuming that 
the task of inverse printing is relatively non- 
competitional. However, the latter assump- 
tion seems to be questionable, leaving the 
interpretation of these data unclear. 

The difference between the two schizo- 
phrenic groups in prerest performance level 
raised two questions concerning the obtained 
difference in reminiscence effects. The first 
question was whether reminiscence effects 
were independent of level of prerest per- 
formance. The answer appears to be in the 
affirmative; point-biserial correlations be- 
tween mean level of prerest performance and 
amount of reminiscence (dichotomized) for 
Early Stage and Advanced Stage schizo- 
phrenics were .34 and .01, respectively, 
neither approaching significance. Kimble 
(1950) also reported reminiscence to be inde- 
pendent of prerest performance level. The 
second question involved the possibility that, 
by virtue of their superior prerest perform- 
ance, the greater postrest improvement dis- 
played by the Early Stage schizophrenics 
merely reflected greater prerest practice and 
hence greater familiarity with the task ma- 
terial. Analysis of each schizophrenic subject’s 
performance revealed, however, that for the 
two groups the proportions of reminiscing 
subjects who could have improved solely on 
the basis of previously practiced material were 
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virtually identical. These proportions for 
Early Stage and Advanced Stage schizo- 
phrenics were .385 and .375, respectively. 
The reminiscence differences between the 
normal and the schizophrenic groups, while 
in the predicted direction, were not signifi- 
cant. The lack of comparability of the 
normal and the schizophrenic groups due 
to drug effects and inequivalence of length 
of hospitalization perhaps attenuates the 
appropriateness of this comparison. 
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SELECTION OF A TARGET FOR DEFENSIVE PROJECTION? 
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This hypothesis was tested and supported (p<.05): a person receiving 
unambiguous undesirable information about himself which is dissonant with 
his self-concept is more likely to project the attribute onto persons in his 
own social category than onto outgroup members. Each college student S 
was told either that he was lacking in or that he possessed homosexual 
motivation; he then rated the homosexuality of a speaker telling TAT stories 
on a tape recording. 34 of the Ss believed the speaker was another student; 
¥% believed he was a criminal. Attribution of homosexuality was a positive 
function of dissonance only when the speaker was seen as a student. Projection 
onto the student was greater among Ss with higher self-esteem. 


Consider a person who has been exposed 
to information unfavorable to himself and 
dissonant (Festinger, 1957) with his selí- 
concept. There is experimental evidence (e.g., 
Bramel, 1962; Murstein, 1956; Wright, 1942) 
that one of the responses sometimes made to 
such dissonance is to attribute the undesirable 
characteristic to other people. Who are these 
other people, and why do they rather than 
others become targets for defensive projection? 

The proposition that selection of a target 
for defensive projection is some function of 
the target’s social category membership or 
similarity to the projector has both theoretical 
and empirical support. Recent writers in 
the psychoanalytic tradition (e.g, Adorno, 
Frenkel-Brunswik, Levinson, & Sanford, 
1950; Bettelheim & Janowitz, 1950) have 
argued that threat produced attribution of 
own traits is typically directed toward dis- 
liked outgroups. Unfortunately, a convincing 
experimental test of this hypothesis does not 
appear in the literature. A rather different 
hypothesis, derived from dissonance theory, 
was tested in an experiment reported earlier 
(Bramel, 1962). It was expected that under 
certain conditions projection of one’s unde- 
sirable characteristics onto respected persons 
or members of one’s own group would be 
dissonance reducing. By use of falsified test 
results the self-esteem of subjects in one con- 
dition was experimentally raised, while that 


1 The research reported in this Paper was done at 
Stanford University and was supported by National 
Science Foundation Grant G-11255 to Leon Festinger, 
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of subjects in the other condition was lowered. 
All subjects were then privately given clear 
evidence that they were homosexually aroused 
by photographs of men. Each subject then 
estimated the arousal level of another subject 
(a fellow student of the same sex, enrolled in 
the same course). Subjects with high self- 
esteem (for whom the arousal information was 
theoretically more dissonant) attributed more 
homosexuality than did those with lower self- 
esteem. Further, an internal analysis of the 
data revealed that dissonance produced at- 
tribution of homosexuality was most marked 
among those subjects who had been favorably 
impressed by the other subject prior to intro- 
duction of the dissonant information, In 
short, the experiment presented evidence that 
defensive projection is sometimes directed 
toward members of one’s own social category, 
and especially toward favorably evaluated 
members of the category. However, since the 
subject's initial perception of the other person 
was not experimentally manipulated, it was 
not possible to generalize with certainty as 
to the stimulus characteristics of likely and 
unlikely targets for projection. 

The experiment reported in this paper was 
designed to clarify the role of similarity and 
group membership and to replicate aspects of 
the earlier study. It is important to note that 
the hypotheses being tested here apply only 
to the case in which a person is prevente 
from denying possession of the undesirable 
trait. There remains the possibility that, aC 
cording to psychoanalytic theory, projection 
would be aimed at different targets if the 
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person were able to deny possessing the trait. 
The main hypothesis tested was as follows: 


Persons who are induced to ascribe unfavor- 
able and dissonant traits to themselves will 
be motivated to attribute such traits to mem- 
bers of their own group or social category but 
not to members of unfavorably evaluated 


outgroups. 
METHOD 
Overview 


Subjects were randomly assigned to two self- 
concept conditions. Those in the No Dissonance 
condition received no undesirable information about 
themselves. Those receiving the Dissonance treatment 
were given unfavorable dissonant information about 
their level of homosexual motivation. After the self- 
concept treatment, each subject rated the degree of 
homosexuality possessed by a target person repre- 
sented by a vojce on a tape recording. Some of the 
subjects were told the person was a member of social 
categories similar to their own (Student condition), 
and others were led to believe the voice belonged to 
a member of markedly different social categories 
(Criminal condition). Subjects with quite low initial 
self-esteem were segregated into an additional condi- 
tion in which they received information that they 
had homosexual motivation and were asked to judge 
the tape with Student instructions (Low S-E Student 
condition), 


Subjects 


Exclusive of subjects used in pretesting, 109 Stan- 
ford University freshmen male volunteers participated 
in the experiment. 


Procedure 


The procedure was administered to the subjects 
one at a time and was introduced as a study of 
impression forming processes among untrained ob- 
servers of human behavior. It was explained that 
clinical psychologists were having difficulty training 
people to interpret Thematic Apperception Test 
(TAT) materials, and that study of the spontaneous 
intuitive judgments of TAT stories by untrained 
persons might yield valuable clues to the problem. 

© this end, the subject was to be given an oppor- 
tunity to listen to a tape recording of a person 
telling stories to a set of TAT pictures. His task 
Would be to try to form an impression of the per- 
Sonality of the person speaking. On the pretext that 
A representative selection of tape recordings was 
Needed in the experiment, the subject was asked 
Whether, at the end of the hour, he would be willing 
tae his own stories to another set of TAT cards. 
we i Securing the usual affirmative reply, the experi- 
- SB Dointed out that it would be necessary to 

SOR. i Objective information about the subject's per- 
sa ity so that ratings made of his recording by 
of sequent subjects could be checked for accuracy 

interpretation. He was told there would first be 
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a brief questionnaire and then “some very accurate 
physiological measurements” of one or two important 
personality variables. The questionnaire included a 
set of polar adjective line scales for measuring the 


„subject’s level of self-esteem. 


The procedure for introducing the unfavorable 
dissonant information into the subject’s cognition 
was similar to that employed in an earlier experiment 
(Bramel, 1962). Care was taken to ensure that the 
degree of threat would not be extreme; these pre- 
cautions will be discussed below. 

The instructions preparing the subject for the 
measurement of his own homosexual motivation 
were read by the experimenter in a matter-of-fact 
tone. It was explained that this particular trait was 
being studied because of the difficulties of measuring 
it by clinical interview methods. There followed an 
elaborate, detailed, and largely deceptive explanation 
of the physiology of sexual arousal and the excellent 
technique for measuring such arousal. Reference was 
frequently made to the “psychogalvanic skin response 
apparatus” which was placed, with its impressive 
array of knobs, dials, and wires, on the table in 
front of the subject. Emphasis was placed on the 
unconscious nature of sexual arousal and the impos- 
sibility of exerting conscious control over its expres- 
sion in the psychogalvanic skin response, The task 
set for the subject was to observe on his galva- 
nometer dial his own sexual arousal response to each 
of a series of photographs of men which would be 
projected onto the screen 12 feet in front of him. 
He was to record his readings in an anonymous 
booklet. All subjects were explicitly told that move- 
ments of the needle (which were actually controlled 
remotely by the experimenter) would indicate homo- 
sexual arousal to the photographs. As a precaution 
against excessive threat, the subject was told that 
persons with extremely strong homosexual tendencies 
would consistently have scores off the end of the 
scale using these instruments. Further, he was care- 
fully assured that his scores were completely con- 
fidential and anonymous. The galvanometer dial was 
visible only to the subject himself. 

It was hoped that the instructions and apparatus 
would be so impressive to the subject that denial 
of the fact that homosexual arousal was being 
indicated would be very difficult. 

Each photograph had been assigned an ap- 
propriate needle indication in advance, depending 
upon the experimental condition which the subject 
represented. For subjects in the No Dissonance con- 
dition the needle remained very close to the low end 
of the scale throughout. For those in the Dissonance 
condition the photographs of handsome men in states 
of undress received much higher needle indications 
than did those depicting unattractive and fully 
clothed men. 

‘An additional experimental condition, Low S-E 
Student, was added to the design after about one- 
third of the subjects had been run. This consisted of 
subjects whose self-esteem (as measured by self- 
rating scales at the beginning of the experimental 
session) was below a predetermined point. By rapidly 
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scoring this self-esteem measure, it was possible to 
allocate all low self-esteem subjects to the Low S-E 
Student condition. These subjects received exactly 
the same treatment as did those in the Dissonance 
condition, and were told the person speaking on the 
tape recording was a student (as was the case for 
all subjects in the Student condition). The low self- 
esteem condition was included in order to replicate 
the conditions of a previous experiment which 
showed that persons with higher manipulated self- 
esteem were more likely to project. It was expected 
that the more favorable cognitions a person has 
about himself, the more dissonance is introduced by 
unfavorable information about the self. 

Immediately after the subject recorded his needle 
readings for the 15 photographs, the experimenter 
said that it was now time to listen to a tape 
recording of a person telling stories to TAT cards, 
If the subject had been assigned (randomly) to 
the Student condition, the experimenter noted out 
loud that the subject was a freshman majoring in 
——_________. The experimenter then selected, 
with apparent deliberation, a tape recording from a 
long shelf of tapes. He told the subject that this 
recording had been made earlier in the year by a 
person who was “also a freshman here at Stanford, 
and also majoring in ————_____________.” If 
the subject had been assigned to the Criminal condi- 
tion, he was not reminded of his own social category, 
but was simply told that the recording selected was 
made in the San Francisco City jail, where “they 
have a testing program for the inmates.” He was 
informed that the person speaking on the tape had 
just been arrested for the third time, “this time after 

' a long string of armed robberies,” and that he was 
24 years old. Thus some subjects were led to believe 
the recording had been made by a person rather 
similar to themselves, while the others were pre- 
sented with a member of a more disliked and dis- 
similar outgroup. In truth, the same recording was 
used for all subjects. 

The recording consisted of four stories which had 
been selected from an assortment of TAT protocols. 
They contained no direct reference to heterosexual or 
homosexual behavior, and were in general very 
ordinary and unimaginative, containing a fairly large 
amount of mere description of the superficial aspects 
of the pictures, In order to find homosexual symbol- 
ism in the stories, the listener would have to look 
rather hard; nevertheless, it would be possible for 
someone motivated to find such material to find it. 
The script was read in a realistic manner by a fresh- 
man with a rather deep voice whose manner con- 
tained none of the stereotyped signs of homesexual- 
ity. The TAT pictures used were Numbers 6, 12, 11, 
and 18. In order to make the recording believable 
as either a student or a criminal, the pace was rather 
slow and the vocabulary limited. The story for Card 
Number 6 came first, and probably made it plausible 


that the person might be a criminal. An excerpt 
follows’ 


.., Also this man could be a...I suppose 
... a police investigator coming to the woman 
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to ask about someone in her family. I would say 
if it was news about somebody, it wouldn't be 
very good, I mean, judging by their expressions, 
Maybe someone was put on the police blotter or 
record. . .. 


While listening to the recording, the subject looked 
at the card being described. Immediately after the 
7-minute tape had ended, he was given a question- 
naire to fill out, containing measures (described 
below) of the impression he had formed of the 
stimulus person, his rating of his own degree of 
homosexuality, and other related questions. 

At the completion of the questionnaire, the experi- 
menter explained in detail the true nature of the 
experiment, uncovered all deceptions, and discussed 
the meaning of the findings up to date. At the end 
of the quarter, a summary of the design and results 
was sent to each person who participated in the 


experiment. All available evidence indicates that the | 


subjects considered the experience interesting and 
worth their while. 


Measures 


The primary measure of the dependent variable, 
attribution of homosexuality onto the stimulus 
person, was a six-point rating scale running from 
“Completely lacking in homosexual tendencies" to 
“Very strong homosexual tendencies." Another scale 
tapped attribution of *overt homosexual experiences." 
Similar scales measured the stimulus person's per- 
ceived friendliness, generosity, and the overall favor- 


ability of the impression he made on the subject. | 


Next, the subject was asked to write down any 
aspects of the tape recording which were useful in 
deciding the degree of homosexual motivation pos- 
sessed by the stimulus person. He was told to look 
at each TAT card in turn to help him to recall the 
stories. This free response material was later coded 
for evidence of projection. Following this, the subject 
was asked to rate himself on the six-point scale, 
“Completely lacking" to “Very strong homosexual 
tendencies.” Another question instructed him to com- 
pare himself to the stimulus person as to similarity 
in general. Finally, he was asked to estimate the 
prevalence of homosexuality in the population and to 
describe his own personal attitude toward homo- 
sexuality per se. 


RESULTS 
Adequacy of Experimental Inductions 


In order to test the central hypothesis, it 
was necessary that two requirements be met. 
First, subjects must differ in their self- 
ascription of homosexuality as a result of 
the experimental treatments. Second, subjects 
must perceive the stimulus person as being 
in general more similar to themselves in the 
Student than in the Criminal condition. These 
conditions would also be expected to differ 1n 
the favorability of the subject’s attitudes 
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TABLE 1 


MEANS AND STANDARD DEVIATIONS OF RATINGS OF 
Own HOMOSEXUALITY, PERCEIVED SIMILARITY 
TO AND ATTITUDE TOWARD 
STIMULUS PERSON 


Condition 
OM Student Criminal 
No Dis-| Disso- | No Dis- | Disso- 
sonance | nance |sonance | nance 
(N =17) |(N =31) |(N =16) |(N =33) 
Similarity 
M 1,53 1.29 .84 | 1.03 
SD .62 33 57 AT 
Overall impression 
(three scales) 
M 4.16 3.99 3.27 3.66 
SD .50 .90 .86 .85 
Own homosexuality 
M 1.8 | 2.74 | 242 | 2.67 
SD .93 1.00 96 | 1.05 


toward the stimulus person. The relevant data 
are presented in Table 1. (Subjects in the 
Low S-E Student condition are considered 
separately below.) Self-ascription was meas- 
ured by a six-point line scale, with a higher 
score indicating greater self-ascription of 
homosexuality. It can be seen that there is 


- a clear difference between the No Dissonance 


and the Dissonance conditions; subjects in 
“the Dissonance condition ascribed more of 
the trait to themselves (t= 3.29, df — 95, 
$ < 01)? Using a four-point scale, the sub- 
ject was asked to rate the similarity in general 
between himself and the person on the tape 
recording, As shown in Table 1, subjects 
perceived more similarity between them- 
selves and the “student” than the “criminal” 
(t= 3.06, df = 95, p < .01). On other six- 
point scales the subject rated the stimulus 
person as to his generosity, friendliness, and 
the favorability of the overall impression he 
made. Averaging these scales scored for favor- 
ability yielded a measure of the attitude of 
Subjects toward the person on the tape record- 
ing. The data in Table 1 show a less favorable 
Tating of the “criminal” than of the “student” 
| (t= 3.07, df =95, p < .01). One may con- 

lude that subjects in the Student condition 
‘Were more favorable toward and perceived 


? All reported significance levels for £ are based 
Upon two-tailed tests. 
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themselves more similar to the stimulus person 
than did those in the Criminal condition. 


Projection as a Function of the Stimulus 
Person 


The major dependent variable, attribution 
of homosexuality to the person on the tape 
recording, was measured in three ways. The 
means and standard deviations for these meas- 
ures are presented in Table 2. The primary 
measure of attribution was the six-point scale 
running from “Completely lacking” to “Very j 
strong" homosexual tendencies. Within each 
stimulus person condition the baseline for 
comparison was of course the level of attribu- 
tion among subjects receiving the No Disso- 
nance treatment. The hypothesis would be 
confirmed if the introduction of dissonance 
(as in the Dissonance condition) produced a 
greater increase in attribution of homosexu- 
ality in the Student condition than in the 
Criminal condition. The tendency, in the No 
Dissonance condition, for subjects to rate 
the "criminal" more homosexual than the 
“student” (t= 2.17, df = 31, p < .05) does 
not affect a test of the hypothesis. The rele- 
vant comparisons are concerned only with the 
change in attribution resulting from the intro- 


TABLE 2 


MEANS AND STANDARD DEVIATIONS OF MEASURES OF 
ATTRIBUTION OF HOMOSEXUAL TENDENCIES 
TO THE STIMULUS PERSON 


Condition 
Student Criminal 
Variable 
No Dis-| Disso- | No Dis- | Disso- 
sonance nance | sonance nance 
(N =17) |(N =31) | (N =16) | (V =33) 
Homosexual 
tendencies 
M 194 | 2.97 | 2.69 | 2.76 
SD 90 | 120 | 108 | 1.17 
Overt homosexual 
experiences 
M 147 | 190 | 2.12 | 1.94 
SD .87 | 140 | 126 | 1.08 
Homosexuality in 
TAT stories* 
M 42 | 135 | 150 | 121 
SD 2.69 | 3.60 | 3.10 9 2.78 


a Number of statements attributing homosexuality minus the 
number attributing an absence of homosexuality. 
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Fic. 1. Mean attribution of homosexuality to 
stimulus person as a function of his social category 
and the amount of dissonance. 


duction of dissonance. In the Student condi- 
tion attribution was significantly greater in 
the Dissonance than in the No Dissonance 
treatment (t= 3.10, df= 46, p< 01). In 
the Criminal condition there was no effect 
traceable to the magnitude of dissonance. The 
most appropriate statistical test of the hy- 
pothesis would appear to be a test of the 
interaction between stimulus person conditions 
and presence or absence of dissonance. Since 
the four relevant cells contained unequal fre- 
quencies, a ¢ test for the significance of the 
difference between differences (Walker & Lev, 
1953, p. 158) was used. This comparison 
yielded a t of 2.00 (df = 93), which is signifi- 
cant beyond the .05 level. The interaction is 
presented graphically in Figure 1. Results 
(shown in Table 2) using the secondary 
measures of attribution were consistent with 
these findings, although differences were not 
statistically significant. 

Since the dominant response to the disso- 
nant information in the Dissonance condition 
was to ascribe homosexuality to the self, it 
can be seen that the results for the primary 
measure are in clear accord with the hypothe- 
sis—namely, that with self-ascription projec- 
tion of the trait is aimed at members of one’s 
Own social category and not at members of 
an outgroup. Results for the two secondary 
measures were less strong, but were both in 
the predicted direction. 


Evidence for the Role of Dissonance 


Let us consider a possible alternative expla- 
nation of the increased attribution of homo- 
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sexuality to the “student” in the Dissonance 
condition. Since these subjects ascribed the 
trait to themselves, their ratings of the 
“student” might have resulted simply from 
a learned tendency to assume (in the absence 
of clear evidence) general similarity within 
one’s own social category. Attribution, it could 
be argued, resulted from mere self-ascription, 
rather than from dissonance between the GSR 
information and self-concept. If this were the 
case, one would expect attribution of non- 
dissonant self-ascribed traits to the same 
degree as attribution of dissonance producing 
traits. A test of this possibility was included 
in the experimental design, Subjects with low 
self-esteem (as measured prior to the experi- 
mental operations) were segregated into the 
Low S-E Student condition and were exposed 
to the Dissonance treatment. According to | 
dissonance theory the introduction of unfavor- 
able information about the self should produce 
less dissonance for persons who already have 
a large proportion of unfavorable self-referent 
cognitions. It was intended, then, that sub- 
jects in the Low S-E Student condition would 
find the information about their own homo- 
sexuality less dissonant than would other sub- 
jects receiving the same information. It can 
be seen from the data in Table 3 that self- 
ascription was slightly greater among subjects 
with lower self-esteem. Yet there is, as pre- 
dicted, a difference in attribution of the trait 
to the “student.” Subjects in the higher self- 
esteem group rated the "student" slightly 


TABLE 3 


MEANS AND STANDARD DEVIATIONS OF RATINGS OF 
SELF AND "STUDENT" By SUBJECTS 
DIFFERING IN SELF-ESTEEM 


Self-esteem 
ie O 
Variable Higher. 
$ i 
aem | ang 
Attribution of homosexuality to 
"student" 
M 2.38 2.93 
SD 96 | 120 
Favorability of overall impression 
405 | 400 
SD .69 | 
Self-ascription of homosexuality 
$00 | 2.70 
SD 1.15 99 
mu 
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more homosexual than did those with lower 
self-esteem (¢ = 1.46, df = 41). It is possible 
to make a more powerful test by comparing 
means only of those subjects who clearly met 
the specified conditions of the hypothesis— 
that is, who attributed a substantial degree 
of homosexuality to themselves. Considering 
subjects in the two self-esteem categories who 


- rated themselves at three or higher on the 


six-point homosexuality scale, the difference 
in attribution to the “student” yields a ¢ of 


209 (df = 22, p < .05). 


In order to make the above argument con- 


vincing, it is of course necessary to show that 


the difference between self-esteem groups in 
attribution of traits to the target person is 
specific to the trait of homosexuality and does 
not simply reflect general tendencies to use 
different parts of rating scales. Each subject 
rated the stimulus person as to his generosity, 
friendliness, and the overall impression he 
made, Responses to these scales were scored 
for favorability and averaged for each subject. 
The data, presented in Table 3, indicate that 
there was no difference between self-esteem 


* groups in the favorability of their ratings of 


the "student." Means on the individual scales 
were almost identical for the two groups. 
This analysis suggests that the attribution 
difference between self-esteem groups in the 
Student condition was indeed a function of 


- 4 difference in dissonance. This is consistent, 


under somewhat different conditions, with the 
findings of the earlier experiment in this series. 
The data contain additional evidence rele- 


- vant to the alternative hypothesis that sub- 


Jects in the Student condition simply made 
a nondefensive assumption of similarity within 
their own group. Tf that hypothesis were true, 
One would expect to find a correlation between 
Seli-ratings and ratings of the "student" on 
traits not associated with dissonance. In addi- 
tion to the homosexuality scale, there were 
two other scales, for generosity and friendli- 
ness, on which the subjects rated both them- 
Selves and the stimulus person. On each of 

ese, cross-tabulation of ratings of self and 
lé "student" in the Dissonance condition 
Yielded chi square values of approximately 
zero. In contrast, the association between 
ratings of self and “student” on homosexual- 
ity produced a chi square of 4.74 (p< 05). 
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The absence of correlation on nondissonant 
traits suggests that nondefensive assumed 
similarity rating tendencies were not an 
important factor in the homosexuality ratings. 

Let us consider one further alternative ex- 
planation of the main findings. Suppose that 
subjects in the Dissonance condition displaced 
hostility onto another member of their own 
group, the “student,” and that this was re- 
flected in the tendency to attribute homo- 
sexuality to him, The relevant data do not 
support this hypothesis. On traits other than 
homosexuality, there was no difference be- 
tween the No Dissonance and Dissonance 
conditions in the favorability of evaluation 
of the “student.” 


Additional Targets for Projection 


After rating the person on the tape record- 
ing, the subject was asked some questions 
about the distribution of homosexuality in the 
population. The first of these was as follows: 
“What proportion of American adult males 
would you say possess homosexual tendencies 
to some degree?” In the Student condition 
subjects receiving the Dissonance treatment 
attributed homosexuality to a larger pro- 
portion of the population than did subjects 
receiving the No Dissonance treatment 
(t = 1.63, df = 46, p = .11). In the Criminal 
condition, on the other hand, there was no 
difference at all between high and low disso- 
nance treatments. A similar contrast between 
the Student and Criminal conditions was 
revealed in responses to another question 
which asked the subject to estimate the preva- 
lence of homosexuality among students in gen- 
eral. Considering only subjects receiving the 
Dissonance treatment those in the Student 
condition attributed more homosexuality to 
students than did those in the Criminal condi- 
tion (£2 249, df = 62, $ < .02). It is of 
interest to note that dissonance produced at- 
tribution onto broad social categories of 
people seems to have occutred only in the 
Student condition, even ough this avenue 
of dissonance reduction was also available to 
subjects in the Criminal condition. 


Attitude toward Homosexuality as Such 


Finally, the subject was asked to describe 
his attitude toward homosexuality on a five- 
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point scale running from “morally desirable” 
to “extremely immoral and disgusting.” There 
was not the slightest difference between sub- 
jects in the Dissonance and No Dissonance 
conditions in their responses to this question. 
This suggests that re-evaluation of the dimen- 
sion in the abstract was not a significant 
immediate means of dissonance reduction. 


DiscussIon 


The results are clearly consistent with the 
hypothesis that persons who are induced to 
ascribe unfavorable and dissonant traits to 
themselves will be motivated to attribute such 
traits to other members of their own social 
category but not to members of a disliked 
outgroup. A likely response to this evidence 
for dissonance produced projection is to ask, 
how, exactly, such attribution would be ex- 
pected to reduce dissonance. Presumably the 
selection of projection in the present is 
partly contingent upon its rewarding conse- 
quences in the individual’s past experience. 
There are several conceivable ways in which 
projection might produce a reduction in disso- 
nance. For example, attributing an undesirable 
trait to favorably evaluated people might 
result in a change in the meaning of the 
dimension itself, Becoming less undesirable, it 
would then less strongly negate the person’s 
favorable attitudes toward himself. In the 
present experiment, however, there was no 
evidence of re-evaluation of homosexuality 

resulting immediately from the presence of 
dissonance. In the absence of information as 
to the reliability of the scale used to detect 
such re-evaluation, it is not possible to reject 
the hypothesis with confidence, however. 

Consider another way in which dissonance 
reduction might result from the kind of pro- 
jection which occurred in the experimental 
Situation. There are many occasions in a 
person's life when he receives information 
about himself, but where an adequate context 
for comparison is not readily available. In 
the experiment described here, for example, 
the subject could clearly see that, in absolute 
terms, he was homosexually aroused by the 
photographs. The needle on the calibrated dial 
of his'galvanometer indicated this unmis- 
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takably. But, lacking information concerning 
averages and ranges, the subject still had the 
problem of deciding “how homosexual is 23 
milliamperes?” In order to apply necessary 
adjectives such as “very” or “slightly” to the 
label, “homosexual,” the subject must have | 
some further information about how he com- 
compares with other people (Festinger, 1954), 
Thus his subjective evaluation of his own 
position on the homosexuality scale is depend- 
ent upon where he perceives certain relevant 
other persons to be located on that scale. It 
is plausible to assume that a person’s com- 
parison group consists of people with back- 
grounds similar to his own and with whom he 
generally associates, It follows that if the | 
subject can believe that other students like 
himself have about the same amount of homo- 
sexual motivation as he has, then his position 
on his subjective scale is not extreme. The | 
cognition that one is “average” in homo- 
sexuality probably does not strongly negate . 
a moderately favorable level of self-esteem, ^ 
and is certainly less dissonant than the cogni- 
tion that one is more homosexual than others - 
in the comparison group. With this kind of 
resolution process it is possible that the 
person's evaluation of homosexuality in the 
abstract will not change markedly. Rather, his - 
evaluation of his own position on the scale 
has changed. 
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EFFECTS OF PRIOR DISHONESTY ON POSTDECISION 
DISSONANCE 


TIMOTHY C. BROCK? 


Towa State University 


145 children (aged 3-12) indicated liking for 3 toys and 3 crackers. After 
being exposed to a temptation to be dishonest and given a choice of 1 of 
the objects, they gave liking ratings of their chosen and unchosen alternatives. 
Postdecision dissonance reduction (increased liking for the chosen, and de- 
creased liking for the unchosen alternative) was greatest when choice from 
among dissimilar objects (toy and cracker) followed dishonesty; it was 
least when honesty preceded choosing from among similar objects (2 toys 
or 2 crackers). Results were consistent with formulations by Hull, on drive 
summation, and Lewin, on tension spread: irrelevant tension combines with 
relevant tension to produce greater response to the latter. 


Cognitive dissonance is thought of as tension 
with drive character resulting from the juxta- 
position of cognitions that do not fit with one 
another (Festinger, 1957). An interesting theo- 
retical problem concerns the relation between 
dissonance and other cognitive tensions. The 
nature of this relation may be understood in 
terms of models by Hull (1943) and Lewin 
(1951). 

Although exceptions exist (e.g, Grice & 
Davis, 1957), the bulk of experimental evidence 
(Amsel, 1950; Brandauer, 1953; Braun, Wede- 
kind, & Smudski, 1957; Kendler, 1945; Siegel, 
1946; Webb, 1949) has generally supported 
derivations from Hull’s theory concerning sum- 
mation effects of irrelevant drives on response 
Strength to the relevant drive. The pertinent 
formula is D = 100(D + D)/D + 100, where 
D means the total effective drive strength oper- 
ating at a given moment; and D and D denote 
drive strengths ascribable to relevant and to 
irrelevant or quasi-drives, respectively. The 
highest value of D obtains when both D and 
Dare high; the least value of D, when both D 
and D are low; and intermediate values of D 
Under other conditions, i.e., when either D or 
D is high. 

_Lewin’s (1951) expression of the basic rela- 
tion between neighboring tension systems, ¢(S") 
Am (S), is somewhat analogous to Hull's 
formula. In Lewin's statement, “bs: and bs 
Indicate the boundaries of the systems S! and 


1 
the The Tesearch was carried out while the author was at 
de University of Pittsburgh. The writer is greatly in- 
ed to A. R. Cohen for his comments and suggestions. 


S?, bsi-bs: their common part. If (5!) 4 (5?) 
and bs!-bs: z 0, a tendency exists to change so 
that £(S!) = #(S?)” (p. 11). Tension should be 
maximal in S! when both £(S!) and (S?) are 
high; least, when both /(5*) and (S?) are low; 
and intermediate under other conditions. 

It is consistent with Hull's and Lewin's 
formulations to say that irrelevant tension, D 
or (S), may combine or summate with the 
relevant tension, D or (5), to produce greater 
response to the latter. The relevant tension in 
the present study, postdecision dissonance, was 
varied by manipulating the similarity of alter- 
natives involved in a decision. Selection of the 
source of irrelevant tension, dishonesty, was 
based on pilot research which showed that 
manifest tension, as operationally defined be- 
low, was less in tempted subjects who were 
honest than in those who were dishonest. 
The relevant tension, postdecision dissonance, 
should be greater when the decision involves 
dissimilar rather than similar alternatives 
(Brehm & Cohen, 1959); it should also be 
greater when accompanied by tension from an 
irrelevant source, namely, prior dishonesty. 
Thus, it was predicted that postdecision dis- 
sonance reduction would be greatest. for persons 
making a decision between dissimilar alterna- 
tives immediately following dishonest be- 
havior; dissonance reduction would be least for 
persons making a decision between similar 
alternatives following honest behavior; dis- 
sonance reduction would be intermediate under 
other conditions (dissimilar-honest, similar- 
dishonest). Lewin’s common boundary require- 

. 
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ment was met by having the victim of dis- 
honesty be the same person who provided the 
choice among alternatives. 


METHOD 


Children were asked to indicate how much they liked 
each of three kinds of small toys and each of three kinds 
of crackers. An attractive guessing contest was intro- 
duced in which a child could maximize his chances for 
winning by being dishonest. Then, about one half the 
subjects was given a choice between similar objects (two 
bags of crackers or two toys) and the other half, a choice 
between dissimilar objects (a bag of crackers and a toy). 
Tt was assumed that postdecision dissonance would be 
reduced by increased liking of the chosen, and de- 
creased liking of the unchosen alternative (Brehm, 
1956; Brehm & Cohen, 1959). An index of the child's 
change in liking of the chosen and unchosen alternatives 
was the dependent variable. 


Subjects 


The original samples consisted of 141 pupils, aged 
5-12, at Baldwin Township Highlands School? and 37 
children, aged 3-4, present for diagnostic testing at 
Pittsburgh Children’s Hospital Speech Clinic. More 
subjects were available in the two higher of four age 
Categories: 10-12, 7-9, 5-6, 3-4. A selection procedure, 
to be described later, reduced the usable number in each 
of these categories to 45, 40, 29, and 31, respectively. 
Tn each category the number of boys and girls was 
approximately the same. 


Procedure 


The experimenter brought each subject to the experi- 
mental room and explained that she was a writer of 
children’s stories and books and needed to have informa- 
tion on “what children liked and didn’t like.” The 
experimenter showed the subject six objects: saltine, 
graham, and cheese crackers, in transparent plastic 
bags; and three small plastic toys requiring manual 
manipulation (ring toss, spinning top, and yoyo). The 
children were allowed to taste a little of each of the 
crackers and to handle the toys. The experimenter 
said: 

So that I can have a record of how much you like or 

don’t like each of these things I would like you to 

rate each one. 


Each object was placed at the top of a separate rating 
sheet on which was printed the name (e.g., Yoyo Toy, 
Graham Crackers). The subject was asked to “check 
here on this sheet how you really feel about this [name 
of toy].” When the ratings were completed, the experi- 
menter put the sheets and objects away in a drawer. 


Then the experimenter called the subject’s attention 


to a large (approximately 2’ X 2^ X 2’) box at her side 
and said: 


* Special thanks are due Thomas Burkhart for his 
Cooperation in obtaining subjects and Richey Hale for 
her able assistance in collecting the data. » 
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Now I want to tell you about a special guessing con, | 
test that we are having here. The reward for winning 
this contest is two tickets to any movie you want to 
goto. You can bring your mother with you or any- 
body you want. Would you like to win those tickets? 
[All subjects were enthusiastic.] OK, in that box 
there are three things that most children like to play 
with and you can win the movie tickets if you can 
guess what they are. [The box actually contained 
three familiar playthings. ] 
While giving these instructions, the experimenter lifted 
the lid of the box just barely two or three times. Then, 
the experimenter gave the subject a sheet with clear line 
drawings of the following: doll, teddy bear, skates, 
whistle, bat and ball, dump truck, airplane, basketball, 
sailboat, air rifle, toy blocks, balloons. The experi- 
menter explained that the contest involved guessing 
which 3 of the 12 objects were in the box; the subject 
was to check three of the drawings to represent his guess 
and the winner would be the one who guessed the most 
correctly. The subject was given a brown manila | 
envelope with his name on it and asked to put his guess- 
ing sheet inside when he was finished. When the con- 
test instructions were clearly understood, the experi 
menter excused herself, saying she had to see a teacher 
who was waiting for her, that she would be gone B 
minutes, and that the subject should go ahead and 
check his guesses. When the experimenter left, she took 
the previously obtained ratings of toys and crackers 
with her, 
On her return, the experimenter put the manila enve 
lope for the guessing contest aside and said: 


Now I would like to give you something for helping 
me. Here are two things you might have but since! 
don’t have very many I would like you to choose 
whichever one of these you really want most. 


About half the subjects were given a choice between - 
dissimilar objects (a toy and a bag of crackers); half, & 
choice between similars (two toys or two bags 0! 
crackers). Following the subject's choice, the same 
rating procedure described above was employed for the 
chosen and unchosen objects. The subject was told to 
return before going home to pick up the object he had 
chosen. Finally, the experimenter glanced into the 
subject’s guessing-contest envelope and said: 


Well, you got at least one right so you might be the 
person who will win the movie tickets. 


The experimenter enjoined the subject to secrecy 0 
protect his chances of winning and so the contest woull 
not have to be stopped. When the subjects returned for 
their chosen objects, they were again reminded to main- 
tain secrecy; there was no evidence of contamination. 
After the completion of the experiment, an arrangement 
was made to show a movie to all subjects with the 
rationale being “so many of you made good guesses in 
the contest.” Thus, dishonesty was not differentially 
rewarded and, of course, each subject was offered a gift 
he desired. 


Measures 


Rating sheet. The rating sheet was a scale consisting 
of a horizontal line of 71 equally spaced dots with 
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equally spaced markers labeled : don't like it very much; 
don’t like it a lot; don’t like it; don’t like it a little; 
like it a little; like it; like it a lot; like it very much. 
Values were given to these ratings by assigning the 
numbers 0 through 7.0 to these labels, respectively, and 
scoring each check to the nearest tenth of an interval 
(Le. to the nearest dot on the scale). Subjects were 
instructed to make a pencil check anywhere along the 
dotted line. 

Manifest tension. On returning to the experimental 
room, the experimenter recorded the presence or not of 
the following behaviors: fidgeting, playing with hands, 
picking at hem of dress, refusal to look at the experi- 
menter, fixated staring at contest box, startle, hurry to 
leave, spontaneous verbal confirmation or denial (e.g., 
saying “I didn’t look”), and restlessness (Deutsch, 1954, 
p. 200; Lewin, 1938, p. 138). The reliability of the mani- 
fest tension rating, .87, was determined from scoring of 
23 subjects by the experimenter and the writer (who 
observed the subject through a window in the door of 
the experimental room). 

Dishonesty. After the subject returned to his previ- 
ous activity, the experimenter compared his guesses 
with the objects in the box (these were changed fre- 
quently by the experimenter as a precaution). A sub- 
ject with all three correct was considered to have peeked, 
to have been dishonest, since a blind correct selection of 
3 target objects out of 12 is an extremely improbable 
event, 


Control Condition and Modifications for Pre- 
school Subjects 


Tn the control condition, the experimenter remained 
seated in front of the subject while he guessed what 
playthings were in the contest box—there was no oppor- 
tunity for peeking. Forty control subjects were avail- 
able but the usable number was 15 in the 10-12 year 
old, and 16, in the 7-9 year old group. 

For the preschool subjects, aged 3 and 4, certain 
Modifications were necessary. A 24" X 24" plywood 
panel was used to conceal the contest objects instead of 
the box with the liftable lid. The rating sheet labels 
vere slowly read to the subject by the experimenter and, 
although these subjects were capable of grasping à 
Dencil and making some sort of mark, it was necessary 
for the experimenter to ask the subject to point to where 
he wanted his mark to go: the experimenter guided his 
hand and helped him make his mark, This admittedly 
€rude procedure was only needed for the rating sheets; 
the drawings on the guessing sheet were so large and 
distinctive that they were easily marked by the pre- 
School subjects. 


Experimenter’s Selection of Choice Alternatives 
and Usable Subjects 


When the experimenter left the subject during the 
Suessing period, she selected alternatives for the sub- 
Jécts (within random assignment to similar and dis- 
Similar conditions) in terms of requirements that had to 

€ met to permit an appropriate test of the hypotheses 
(Brehm & Cohen, 1959). One of the choice objects had 
to be liked more than the other so that its choice could 
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be expected; but not liked so much that an increase in 
liking would be impossible. To insure that an initially 
less liked alternative was seriously considered as a pos- 
sible choice, its initial rating could not be much lower 
than that of the more liked alternative. When one 
alternative is considerably less attractive, little post- 
decision dissonance is aroused (Brehm, 1956). 

The criteria used were: the initially more liked al- 
ternative had to receive a rating between 3.8 and 6.5, 
or, in other words, a moderate to high Liking score; the 
other alternative could be no more than 1.5 scale points 
lower, but had to be at least .1 scale point lower (the 
least measurable difference). When, for 24 subjects it 
was impossible to satisfy these requirements, the experi- 
menter did not complete the experiment (provide a 
choice and second ratings): instead, the subject was 
given the usual secrecy injunction and told he could 
return at the end of the day to pick up a reward for 
helping, whatever object he initially rated highest. 
Data from nine additional subjects were omitted be- 
cause they did not choose the initially more liked 
alternatives 


RESULTS 


The experimenter’s recording of manifest ten- 
sion was made before she knew whether peeking 
had occurred. One or more tension behaviors 
were noted in 82% of the Peekers, in 15% of 
the Nonpeekers, and in 8% of the Control 
subjects (where opportunity for peeking was 
ruled out). The difference between the values 
for the Nonpeekers and the Control subjects 
was not statistically significant but both groups 
differed from the Peekers at beyond the .001 
level. 

Analysis of variance of initial ratings of the 
chosen and unchosen alternatives yielded no 
between-condition differences (Fs < 1). The 
principal dependent variable was dissonance 
reduction as measured by decrease in liking for 
the unchosen alternative and increase in liking 
for the chosen alternative, For both com- 
ponents (Table 1), results were consistent with 
predictions: dissonance reduction was greatest 
when dishonesty preceded a choice between 
dissimilar alternatives; it was least when sub- 
jects were honest before choosing between 
similar alternatives; and intermediate under 
other conditions. Subsequent comparisons were 
based on a dissonance reduction score obtained 
by adding amount of increase in liking for the 
chosen to decrease in liking for the unchosen 
alternative. 

3 These subjects were discarded because their initial 
ratings were apparently not valid. While this procedure 


may be questionable, the inclusion or exclusion of these 
nine subjects made no difference in the results. 
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TABLE 1 j 
MEAN CHANGES IN LIKING OF CHOSEN AND UNCHOSEN ALTERNATIVES 
Peeked Did Not Peek Control 
Age group E 
Dissimilar Similar Dissimilar Similar Dissimilar Similar 
10-12 j 
Chosen 1.11 (7) 0.81 (7) 0.60 (8) 0.46 (8) 0.79(7) 0.39 (8) 
Unchosen —1.20 —0.80 —0.55 —0.38 —0.57 —0.40 
-9 
Chosen 1.17 (6) 0,70(5) 0,99 (7) 0.68 (6) 0.75(8) 0.43 (8) 
Unchosen =1,25 —0,66 —0.94 —0.63 —0.80 —0.43 
5-6 
Chosen 1.20(8) 0.66(5) 0.75(8) 0.60(8) 
Unchosen —1.4 —0.78 —0.71 —0.54 
Chosen 1.39(7) 0.88(8) 0.84(8) 0.61(8) 
Unchosen —1.24 —0.83 —0.70 —0.68 


Note.—A positive number indicates increased liking, a negative, decreased liking. Cell Ns in parentheses. 


An analysis of variance (Table 2) showed 
a significant main effect (p < .001) of the 
dissimilar-similar variation (thus corroborat- 
ing the same finding by Brehm & Cohen, 
1959); and a main effect of the split into 
Peekers and Nonpeekers. Prior dishonesty led to 
greater dissonance reduction than prior honesty 
(p < .001). The significant interaction between 
similarity and peeking (p <.05) was at- 
tributed to the fact that the difference within 
Peekers between Dissimilar and Similar sub- 
jects was more than twice as great as the corre- 
sponding difference within Nonpeekers. A test 
of the simple effect of peeking versus nonpeek- 
ing under high dissonance (dissimilar alterna- 
tives) yielded a two-tailed ¢ of 4.72 (p < .001). 
The direct effect of manifest tension on dis- 
sonance reduction was further evaluated by 
splitting Peekers approximately at their me- 


TABLE 2 


ANALYSIS OF VARIANCE OF DISSONANCE 
REDUCTION SCORES 


Source df MS F 
Peeking versus No Peeking | 1|17233 | 20.81** 
Dissimilar versus Similar 1|167.90 | 20.27** 
Age group | 3| 8.02 | <1 
Peeking X Similarity 1| 29.84 3.60* 
Peeking X Age 3| 8.68 1.05 
Similarity X Age 3| 195 | <1 
Peeking X Similarity X Age | 3| 0.58 | <1 
Error 9 8283 


Note—-Because subclass frequencies were unequal, an ap- 
proximation by Walker and Lev (1953, p. 381) was employed, 
Omits control groups data, 
*p «.05. 
Dp «001, s 


dian in manifest tension scores and testing the | 
difference in dissonance reduction between the | 
Highs and Lows thus obtained. Peeking sub- 
jects high in manifest tension reduced disso- 
nance more than the Lows but the two-tailed / 
(1.73) fell short of significance (.05 < p < .10). 

There is the possibility that an appreciable 
amount of conflict was created in the Non- 
peekers because their honesty was inconsistent 
with their desire to win a prize (Mills, 1958). 1 
Tf so, this might be reflected in their dissonance ^ 
reduction scores. The likelihood of this effect | 
in the present experiment was examined by 
testing the difference between the average dis- 
sonance reduction of the Control subjects (1.10, - 
N = 31) and that of the 7-12 year old Non- 
peekers (1.29, N = 29). Although the direction - 
of this difference was consistent with Mills’ 
argument, it was small enough to be attribut 
able to chance variation (/ < 1). Both of these 
groups, on the other hand, showed significantly 
less dissonance reduction than the correspond- 
ing group of 7-12 year old Peekers (p < 01, 
two-tailed), 

Finally, in that age category had no effect, 
it was considered appropriate to regard the 
main predictions as being supported by three - 
replications.‘ 


DISCUSSION 


The main finding of the present experiment 
concerned the effect of the arousal of irrelevant 


* Although there was a slight nonsignificant tendency 
for greater peeking among boys in the oldest age cc 
gory, sex did not contribute to the variance, either in 
main or interaction effects. 
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cognitive tension on magnitude of dissonance. 
The results were consistent with minor exten- 
sions of formulations by Hull (1943) and Lewin 
(1951). Temporally proximate tension from 
dishonesty increased postdecision dissonance 
beyond the magnitudes produced by choosing 
between dissimilar (or similar) alternatives. 
That the variable of irrelevant tension was not 
a true manipulation but a product of subject 
self-selection, is, of course, a general qualifier 
to the following conclusions. 

The present results have significance for 
theories in psychology which recognize that, at 
a given moment, the individual’s behavior is 
typically energized by more than one motive, 
drive, tension, etc. Some investigators using 
infrahuman subjects (Kendler, 1945; subse- 
quent authors) have already explored this 
problem. A significant implication of the pres- 
ent experiment is that the same general formu- 
lation, Hull’s, leads to confirmable deductions 
when the motives or tensions involved are cog- 
nitive rather than physical. Cognitive tensions 
“summated” to produce greater response to 
the relevant tension (dissonance). This re- 
sult parallels the expectation, for example, 
that thirsty animals will respond more to 
hunger than nonthirsty animals. Theoretically, 
therefore, the present findings strongly re- 
inforce the conceptualization of dissonance 
as a drive because, when “dissonance” was 
substituted for “drive” in an appropriate for- 
mula, the ensuing derivations were empirically 
Supported. 

A second implication concerns the adequacy 
of present compilations of antecedents of disso- 
hance. It has been assumed that dissonance can 
only occur when inconsistent cognitions are 
based on relevant need satisfactions: choosing 
or commiting oneself to Cognition A means 
giving up the need satisfaction mapped by 
Cognition B. While this restriction is not being 
challenged—in fact, it was essential to the 
derivation that choice between dissimilar al- 
ternatives would produce more dissonance than 
à choice between similars—further elaboration 
of the theory may be required. Knowledge or 
Manipulation of motives or cognitions irrele- 
vant to the dissonance setting ones will enable 
the investigator to make more precise predic- 
tions, When the conditions for differential dis- 
Sonance are produced, the introduction or 
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presence of a temporally contiguous but irrele- 
vant tension generates greater response to 
dissonance (its reduction) than when irrelevant 
tension is not present or has not been intro- 
duced. This proposition assumes that contigu- 
ous irrelevant tension will always summate 
with dissonance and it is unspecific as to the 
basis for the irrelevant tension. Whether sum- 
mation (and not interference or canceling) will, 
in fact, always occur, and, whether the source 
of tension is immaterial (appetition, noxious 
stimulation, etc.), are problems for further 
research. 


Lewinian Explanation 


In Lewin’s terms, tension spread from an 
innerpersonal system concerned with conflict 
between superego prescriptions and actual be- 
havior to another system concerned with dis- 
parities between what was gained and what 
was relinquished by making a decision. Spread 
of tension was facilitated by overlap among the 
two systems: the victim of dishonesty was the 
same person who proferred the choice among 
alternatives. Given this framework, an explana- 
tion was sought by examining relations that 
can exist between neighboring systems. Two 
have been emphasized: intercommunication or 
spread of tension between innerpersonal sys- 
tems corresponds to fluidity and/or level of 
reality-irreality (Lewin, 1951, pp. 11-29); a 
dominant system ‘‘uses” a subordinate system 
as its “tool” (Lewin, 1938, pp. 102-104), In 
terms of the first possibility, it might be as- 
sumed that each subject felt some tension 
merely from being left alone by the experi- 
menter and exposed to a temptation. Subjects 
who peeked adopted a relatively more irreal 
level of functioning than those who did not 
peek. Thus, for peeking subjects, fluidity 
among innerpersonal systems was increased: 
any tension that was present would be com- 
municated more readily to neighboring sys- 
tems, specifically, that concerned with the 
decision. The honest subjects, on the other 
hand, although also experiencing tension, main- 
tained relatively impermeable boundaries be- 
tween innerpersonal systems: tension did not 
spread to motoric systems (significantly less 
was actually observed) or to the decision situa- 
tion. The gist of this explanation is that dis- 
honesty implies departure from reality func- 


330 


tioning and thereby increases fluidity between 
innerpersonal systems and consequent inter- 
communication of tension. 

A second possibility involves the notion that 
one innerpersonal region or system bears an 
instrumental relation to another. Lewin as- 
sumed force fields corresponding to relatively 
central and relatively peripheral innerpersonal 
systems. It is plausible to assume that concern 
with violating superego prescriptions mobilized 
a relatively central system; and the decision 
situation, a relatively peripheral system. The 
former is concerned with highly salient ethical 
standards, and the latter, with simple tastes 
and preferences; violation of an ethical rule is 
more consequential for the person than choos- 
ing between crackers and toys. According to 
Lewin, “the basic dynamical fact” is that ten- 
sion in a field of forces corresponding to a 
peripheral system may be induced by tension 
in a field of forces corresponding to a central 
system, Reduction of tension in the peripheral 
serves tension reduction in the central system. 
Tf this explanation is correct, residual tension 
from dishonesty should be less for high than 
for low dissonance reducing subjects. Some in- 
direct evidence concerning this implication was 
available by making a further assumption: 
subjects who felt considerable tension from 
dishonesty would be less likely to return to pick 
up their chosen object than subjects who felt 
little or none. When all subjects were split at 
the median into Highs and Lows according to 
their dissonance reduction scores, there was no 
difference in number of returning subjects 
within Nonpeekers (96% of the Highs and 
100% of the Lows). Within Peekers, however, 
the values were 88% for the Highs and 56% 
for the Lows. These differences were consistent 
with the assumption that reduction of tension 
in a peripheral system (dissonance tension) 
served reduction of tension in a central system 
(dishonesty tension). 


Dissonance Interpretations 


Peeking might be said to have aroused dis- 
sonance since dishonesty does not follow from 
usual values and training (Mills, 1958). Thus, 
“dissonance” might be substituted for “irrele- 
vant tension” and the argument recast in terms 
of cumulating dissonances. This view requires 
the assumption (Brehm, 1960, pp. 182-183) 
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that restraining forces, values and training, 
and inducing forces, attractiveness of movie 
ticket prize, were roughly equivalent for both | 
Peekers and Nonpeekers. However, to para- 
phrase Brehm, honest subjects presumably had 
a strong attitude against peeking prior to the | 
experiment or they would have peeked when 
tempted. If this issue of self-selection can be | 
ignored, a problem still remaining concerns how 
dissonance occasioned by peeking can be re- 
duced by revaluation of choice alternatives, 
A possibility is that a person can reduce disso- 
nance from dishonesty by increased liking for 
whatever he is subsequently given by the 
victim of his dishonesty. This hypothesis might 
Suggest increased liking of the chosen but not 
obviously decreased liking of the unchosen al- | 
ternative. The appropriate test was examina- | 
tion of differential change in liking of the 
chosen and unchosen alternatives separately to 
see whether the chosen alternative contributed 
a greater difference between Peekers and Non- 
peekers than did the unchosen. Contrary to 
this account, the difference between the means 
for Peekers and Nonpeekers on the chosen 
alternative (.32) was less than that on the 
unchosen alternative (.38). These values did 
not differ reliably (£ < 1). 

A second dissonance formulation was sug- 
gested by research on the role of volition in 
determining magnitude of forced compliance 
dissonance. See Cohen’s (1960) review. A per- 
son given a choice in the enactment of some 
behavior experiences dissonance when the be- 
havior does not follow from some prior cogni- 
tion or experience. It was assumed that ac 
ceptance of a gift from the experimenter did 
not follow from victimization of the experi- 
menter by dishonesty. At least, gift acceptance 
was considered less inappropriate by a subject 
who behaved honestly when tempted. Thus, 
Some overall dissonance may have been present 
for Peekers, but not for Nonpeekers, because 
they were accepting a gift from a person they | 
victimized and volition was engaged in choos- 
ing the gift. Dissonance from these sources may 
have combined with that from the decision to 
produce heightened revaluation of the choice 
alternatives. In sum, it was difficult to prefer 
either Lewinian or dissonance interpretations 
because manifest tension was present 10 
Peekers before the choice situation. i 
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The hypothesis that schizophrenics would exhibit greater distortion in 


estimation of duration time for stimuli 


relevant to their major conflicts than 


for other stimuli was investigated. 3 groups, each with 16 Ss who were 
homogeneous with respect to experiencing primary conflicts with either 


aggression, sex, or dependency, were for 
was used in which Ss estimated stimuli 


med. A factorial Latin square design 
durations following the tachistoscopic 


presentation at 4 exposure times of 4 pictorial stimuli, 1 of which represented 


their major conflict area. Results demo 


nstrated that greatest distortions oc- 


curred for conflict pictures at the .001 level. Distortion elicitation of the 
aggression stimulus was greater than other stimuli at .001 for the combined 
schizophrenics, an effect not found with normal control Ss. 


Time perception and distortion have been 
long considered as important behavioral facets 
of psychopathological conditions. While there 
has been much theorizing concerning the 
relationship of temporal experience to the 
emotional or neurophysiological etiologies of 
psychopathological states, relatively few ex- 
perimental studies in this area have been 
reported. In their review of experimental 
studies of time perception, Wallace and Rabin 
(1960) stress that the few studies of time 
distortion by psychopathological subjects have 
cast little light on the nature of the underlying 
processes. 

"The past dozen years have seen a prolifera- 
tion of studies and theorizing about the rela- 
tionship of personality and emotional charac- 
teristics of subjects to their perceptual and 
cognitive processes. While there has been 
controversy concerning the specific processes 
involved, the majority of such investigations 
have been consistent in finding that affective 
and perceptual processes interact. Studies with 
Schizophrenic subjects (Daston, 1956; Dunn, 
1954; Harris, 1957; McGinnies & Adornetto, 
1952; Zahn, 1959) have indicated the per- 
ceptual thresholds and estimation of size are 
dependent upon the emotional concomitants 


2The authors wish to acknowledge the help pro- 
‘vided by Karl J. Kadlub in rating subjects. We are 
also indebted to Terrence Allen for statistical 
consultation, 
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of the visual stimuli (tachistoscopic or screen 
presentations of words or pictures) ordinarily 
employed. 

The prominence of emotional factors and 
conflicts in schizophrenics suggests that affec- 
tive factors may be integrally involved in the 
distortion of time perception frequently found 
in schizophrenic subjects. In line with other 
findings concerning perceptual accuracy, time 
perception distortion should be a function of 
the emotional state or conflict a schizophrenic 
subject is in at the time his perception of time 4 
passage is obtained. The perception of the 
exposure time duration of stimuli with sym- | 
bolic threat which tap known conflict areas of | 
subjects should be more distorted than per- 
ceptions of temporal duration of relatively 
non-conflict-laden stimuli. 

In the present study, this hypothesis v. 
tested through the tachistoscopic presentation 
of one neutral and three conflict arousing pic- | 
ture stimuli to schizophrenic subjects pte 
viously rated as primarily having conflicts and 
difficulty in one of three areas; namely, a£ | 
gression, sex, or dependency. Falk and Bindra | 
(1954) have reported that subjects undet 
Stress tend to overestimate time intervals. A! 
assumption of this study is that schizophrenic | 
subjects will experience varying degrees of 
Stress upon presentation of the conflict plc 
torial stimuli; hence a secondary hypothesis 
of the study is that more time perception 
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overestimation distortions are produced by 
subjects when stimulated with appropriate 
conflict arousing pictures than when they ate 
shown stimuli not tapping their major conflict 


area. 
METHOD 
Subjects 


Forty-eight patients recently hospitalized in a 
Veterans Administration Neuropsychiatric Hospital 
and diagnosed as schizophrenic served as subjects. 
They were under 45 years of age and were selected 
from a larger group of schizophrenics appearing be- 
fore an admission diagnostic staff. The presentation 
of available background information relevant to each 
patient followed by a brief diagnostic interview per- 
mitted staff personnel to discuss psychodynamic fac- 
tors and other relevant considerations in arriving at 
a diagnosis of schizophrenia., Armed with such 
information, the clinical psychologist staff member 
rated each patient for the extent that his emotional 
and behavioral problems were due to difficulties and 
conflicts concerning hostility, sexual, or dependency 
factors, A separate five-point rating scale was utilized 
for each of these three factors.? On the basis of such 
ratings, three patient groups were formed, each with 
16 subjects who were similar in that their behavioral 
difficulties were concentrated in one of the three 
dynamic areas, Thus the subjects in the Aggression 
group had mean ratings of 3.75, .37, and .31 on the 
Hostility, Sexual, and Dependency scales. Members 
of the Sexual conflict schizophrenic group had mean 
ratings of .38, 3.75, and .88 on the three scales, 
while the Dependency group schizophrenics averaged 
75, .13, and 3.6 on Hostility, Sexual, and Dependency 
scales, respectively. These groups thus differed in the 

- area of affective difficulty and could be compared for 
the effect on time perception that stimuli relevant 
lo each emotional area could elicit. 


Measures 


"Three conflict pictorial stimuli intended to stimu- 
late anxiety in each of the conflict areas and one 
neutral picture were employed. These were identical 
to those previously used by Sines (1955). The ag- 
gression stimulus depicts a huge, scowling man with 
manacled wrists and clenched fists raised high ready 
to strike down upon an unsuspecting, smaller man 
Standing in front and facing the other direction. The 
Sex picture shows an attractive young woman in a 
filmy hegligee lying on a couch with arms stretched 
——— 


? The rater and one of the authors had evaluated 
25 diagnostic staff patients prior to the present study. 
Identical scale ratings or one-step deviations Were 
Made for 22 patients while two-step discrepancies on 
the sexual and dependency conflict scales were found 
for 3 subjects. This was taken to indicate satisfactory 
reliability for the purposes of the impending study 
and study ratings were subsequently made by only 
the rater, 
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out toward a man leaning slightly toward her. Card 
6BM of the Thematic Apperception Test (TAT) 
series showing a downcast young man with hat in 
hand standing behind an older woman who is looking 
out of a window was used as the dependency 
stimulus. TAT Card 12BG picturing a peaceful scene 
with a boat against a stream bank served as a neutral 
pictorial stimulus. For each subject group, one 
picture served as a conflict stimulus, while the other 
three were considered as being relatively neutral. 


Procedure 


A Gerbrands tachistoscope was used to present 
the above four stimuli pictures to the three subject 
groups. Identical lighting and commonality of presen- 
tation was thus provided. Each picture was presented 
at four different exposure speeds, 5, 10, 20, and 30 
seconds, Each subject therefore viewed 16 tachisto- 
scopic presentations. Following an initial randomiza- 
tion of the order of presentation of the stimuli at 
the several exposure times, a systematic 16 X 16 
factorial Latin square was constructed which guided 
the order of stimulus presentation. This Latin square 
was replicated for each of the three subject groups. 
Rows corresponded to the subjects within each group 
while columns corresponded to the sequence at which 
the stimuli pictures at a particular exposure time 
were presented. Within each subject group, therefore, 
the order of stimulus duration time varied system- 
atically. A particular presentation order was therefore 
identical for three subjects, one from each of the 
patient groups. 

Subjects were given no initial time estimation set 
for the presentation of the pictures, but merely told 
that some pictures were going to be presented 
through the viewing instrument. Following the first 
timed presentation and thereafter, subjects were 
asked to judge the length of time the picture was 
present on the instrument screen. 


Analysis 

All time judgments of subjects were converted 
into scores based upon the ratios of estimated to 
actual time and then multiplied by a factor of 10. 
Scores of 10 thus represented accurate time percep- 
tions while over- and underestimation corresponded, 
respectively, to scores larger and smaller than 10, 
Since the primary intent was to investigate the extent 
of time perception distortion, the absolute deviation 
of subjects’ scores from the score of 10 (accurate 
judgment) disregarding algebraic sign was then ob- 
tained and used in the basic analysis of data. The 
secondary hypothesis, however, was evaluated 
through a consideration of algebraic signs; that is, 
the number of scores greater and smaller than an 
accurate time perception score of 10. 


RESULTS 


Since Hartley’s F max test indicated no 
significant heterogeneity of variance for the 
subject groups, an analysis of variance was 
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TABLE 1 


ANALYSIS OF VARIANCE OF TIME PERCEPTION 
DEVIATIONS OF SCHI70PHRENIC SUBJECTS 


Source SS df MS F 
Sequence (A) 186.91 | 15 | 1246| .90 
Type of subject (B) 86.32 2 | 43.16 | 3.11* 
AXB 416.62| 30| 13.89 
Pictures (C) 765.49 | 3 |255.16 | 8.52*** 
CXA 1067.97 | 45 | 23.73] .79 
CXB 1468.86| 6 24481 | 8.17*** 
CXAXB 2695.76 | 90 | 29.95| 1.26 
Exposure times (D) 6248| 3|2083| .62 
DXA 1574.86 | 45 | 35.00| 1.05 
DXB 4212| 6| 702| .21 
DXBXA 3013.76 | 90 | 33.49 | 1.41** 
CxD 50.46} 9| 5.61] .24 
CXDXA 3652.97 | 135 | 27.06 | 1.14 
CXDXB 236.65} 18 | 13.15| .55 
CXDXAXB 6405.57 | 270 | 23.72 

* 
bod js d 
ak € 001, 


carried out. The results of 
are summarized in Table 1. 


the main analysis 


Major Hypothesis 


Support for the major hypothesis that time 
perception distortions of schizophrenics should 
be greatest when presented stimuli represen- 
tative of their major conflict is evidenced in 
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TABLE 2 


MEAN TIME PERCEPTION DISTORTIONS FOR 
SCHIZOPHRENIC SUBJECT GROUPS ON 
CONFLICT STIMULI Pictures 


Pictures 
Subject groups 
Aggression | Sex | Dependency 
Aggression subjects 9.9 43 3.6 


4.6 6.9 42 
6.3 5.3 6.5 


Sex of subjects 
Dependency subjects 


Table 1 by the significance of the interaction 
of subject type and stimuli pictures at the 
.001 level. Table 2 presents the mean distor- 


tion for each subject group on each conflict | 


stimulus picture. The diagonal cells show | 


the largest distortions and suggest that the 
greatest selective distortion occurred when 
subjects responded to stimuli tapping the area 


of their greatest maladjustment. ¢ tests com- | 


paring each diagonal mean to the combined 
average of the other two lateral means (e.g, 
for Aggression schizophrenics, the aggression 
stimulus mean was compared to the average 
of the sex and dependency picture means) 
were carried out. Aggression subjects showed 
significantly greater time perception distortion 


Dependency Neutral 


Pictorial Stimuli 
Fic, 1. Relative time distortion means of schizophrenic conflict groups on four pictorial stimuli. 
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Fic, 2, Relative time distortion means of four pictorial stimuli. 


on the aggression stimulus at the .001 level 
than on the other two conflict stimuli. Schizo- 
Phrenics with primary sexual maladjustment 
also distorted significantly more on the sex 
Stimulus at the .01 level than they did on the 
aggression and dependency pictures. Such 
Selective distortion was not found for depend- 
ency schizophrenics. Figure 1 graphically 
shows the presence of time perception distor- 
tions. It indicates that Dependency subjects 
showed an essentially equivalent distortion for 
all stimuli presentations. 


Stimuli Pictures, Subject Set, and Subject 
Groups 


It Can be seen from Table 1 that the vari- 
Ous picture stimuli differ significantly at the 
001 level in the extent that they elicit time 
ee distortion. Mean time perception 
1 is for the four pictures were analyzed 
Eun Duncan's multiple range test. The 
a an distortion on the aggression photograph 
ES Significantly greater than distortion occur- 
5 € on the sex stimulus at the .05 level and 
n the dependency and neutral pictures at the 


.001 level. The sex stimulus elicited greater 
distortion at the .05 level than did the neutral 
picture, but did not differ significantly from 
the dependency stimulus. No significant dif- 
ference between dependency and neutral pic- 
tures was present. These relationships are 
graphed in Figure 2. 

Table 1 also indicates that while subject 
groups did not differ significantly in the over- 
all accuracy of their time perceptions, a 
tendency for this to occur seemed present 
(.05 <  «.10). Examination of data sug- 
gests that the sexual conflict schizophrenics 
tended to be most accurate while dependency 
schizophrenics tended to be least accurate. 
Figure 3 illustrates the distortions of the 
three groups and the combined schizophrenic 
sample at each exposure speed. Dependency 
and sex schizophrenics show similar response 
patterns of time distortion at the four ex- 
posure times although magnitudes are higher 
for dependency subjects. Although less distor- 
tion in time perception occurred at the,longest 
exposure time of 30 seconds, Table 1 again 
indicates that there were no significant dif- 
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ferences between exposure times and that all 
groups were essentially similar in their distor- 
tions at the various times. 

Since subjects were initially unprepared to 
render time judgments, the first and possibly 
second estimations may be considered as being 
given under a nonset condition. Thereafter, 
a time perception set may be considered as 
being operative. Analysis of.sequences, shown 
in Table 1, was nonsignificant. Hence no sig- 
nificant difference between nonset and set 
conditions was present. Table 1 also indicates 
a significant interaction at the .05 level be- 
tween exposure speed, type of subject, and 
Sequence of presentation. An examination of 
data failed to provide a satisfactory psycho- 
logical interpreation of this interaction. 


Time Perception Over- and Underdistortions 


The secondary hypothesis that more time 
perception overestimation distortions would 
be produced by subjects when stimulated with 
appropriate conflict arousing pictures than 
when shown stimuli not tapping their major 
conflict area was investigated through appro- 
priate chi square analysis. The pattern of 
subjects’ time perceptions was found to differ 
significantly for conflict and nonconflict 
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stimuli at the .001 level. This was largely - 
due, however, to the significantly greater | 
number of correct time perceptions at the | 
.001 level made by all subjects to the non- 
conflict stimuli presentations than to the 
appropriate conflict pictures. However, a tend- | 
ency was found for subjects to distort more 
often by overestimation than by underestima- 
tion for appropriate conflict stimuli than for 
the nonconflict pictures (.05 < p< .10), 
While this tendency is not definitively signifi- 
cant, it does lend some support to the study's 
secondary hypothesis. | 
Further chi square analyses revealed the | 
following: f 
1. No significant differences in number of | 
over- and underestimations were found be- 
tween the four exposure times. | 
2. Subjects overestimated significantly 
more often at the .01 level at each exposure | 
time than they underestimated. 
3. Aggression schizophrenics’ time percep- | 
tion distortions involved significantly more 


overestimations than did Sex and Dependency | 
schizophrenics’ distortions at the .001 and 
.05 levels, respectively. Dependency schizo- 
phrenics distorted significantly more often 
by overestimating than did Sex schizophrenics 
at the .02 level. | 


sha 


20 Sec. 30 Sec. 
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Fic, 3, Time distortion means of schizophrenic conflict groups at four exposure times. 
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TABLE 3 


ANALYSIS OF VARIANCE OF TIME PERCEPTION 
DEVIATIONS OF NoRMAL SUBJECTS 


Source SS df | MS F 
Subjects (A) 2860.06 | 15 190.67 | 13,2502*** 
Sequence (B) 238.22| 15| 15.88| 1.1035 
Pictures (C) 21.70|- 31 7208 .5024 
Exposure Times (D) 17.65 3| 588 .4086 
CXD 181.05| 9| 20.12| 1.3982 
AXD 640.92 | 45| 14.24 .9896 
AXC 452.82 | 45| 10.06 .6991 
Error 1726.32 | 120 | 14.39 

*& p < .001. 


Normal Control Subjects 


The above results characterize schizo- 
phrenic subjects but do not permit us to say 
whether normals would respond differently in 
a similar time perception distortion situation. 
In particular, we cannot exclude the possi- 
bility that the distortion proneness of the 
aggression stimulus is a cultural phenomenon 
which would affect normal subjects in a 
manner similar to that of schizophrenic 
patients. In order to clarify this issue, the 
experiment was replicated for a group of 16 
normal subjects who were randomly re- 
cruited from the various occupational levels 
at the hospital. Data were analyzed through 
analysis of variance which is summarized in 
Table 3. 

Results clearly indicate that the control 
Subjects differed significantly among them- 
Selves in their time perception distortions. 
This was expected since no attempt had been 
Made to select subjects who were homogene- 
ous with respect to personality characteristics. 
There was no significant interaction between 
Control subject distortion and pictorial stim- 
uli, Contrary to the finding with schizo- 
Phrenic patients, the pictorial stimuli did 
hot differ significantly in their time percep- 
tion distortion elicitation with the normal 
Subjects, As with the schizophrenics, no 
Significant differences were present between 
lime perception set and nonset conditions for 
the several exposure time presentations. Chi 
Square analyses disclosed that the schizo- 
Phrenics overestimated times significantly 
More often than did control subjects at the 
001 level. Contrary to the finding with 


Schizophrenics, control subjects did not over- 
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estimate significantly more often than they 
underestimated. 


Discussion 


The results indicating selective time per- 
ception distortions for schizophrenic subjects 
are thus analogous to findings reported by 
others that other aspects of perception and 
judgment are similarly affected by the per- 
sonal relevance of the stimuli. Although the 
several schizophrenic groups were sensitive 
primarily to one idiosyncratic class of picture 
stimuli, the aggression picture produced the 
greatest overall time perception distortion 
for the combined groups of schizophrenics 
at each exposure duration. Since such dis- 
tortion was not found for control subjects we 
can assume that this finding is relevant to 
the dynamics of schizophrenia. If distortion 
effects may be interpreted as stemming from 
anxiety induced by emotion arousing stimuli, 
then the aggression picture would appear to 
pose the greatest threat to the average 
schizophrenic subject. From a common sense 
view, aggressive behavior is most likely to 
result in interpersonal difficulties and tends 
to lead quickly to hostile and energetic 
counterreactions from society. From a psy- 
choanalytic framework, several investigators 
have viewed aggression as being the core of 
the schizophrenic disorder. Bak (1954) has 
postulated that in schizophrenia, the ego has 
been unable to neutralize aggressive drives, 
that psychosis is predominantly an unsuc- 
cessful defense against aggression. Cohen 
(1954) supports this view and finds a dy- 
namic relationship in therapy between schizo- 
phrenic symptoms and aggression. He reports 
that when severe hostile feelings appeared in 
schizophrenics, their symptoms receded; 
when their symptoms reappeared, hostility 
diminished. 

Some question may be raised as to the 
adequacy of TAT Card 6BM as a depend- 
ency stimulus. It is not as clear-cut in its 
stimulus import as are the aggression and 
sexual stimuli and may not picture depend- 
ency strivings or conflicts primarily. It may 
be that the use of a stimulus with greater 
saliency relative to the dependency variable 
also would have demonstrated a selective time 
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perception distortion for this area of conflict 
of the dependency schizophrenics. 

Time perception of normal subjects by 
the method of verbal estimation has been 
reported by Clausen (1950) as character- 
istically resulting in greater distortion 
through over- rather than underestimation. 
Our normal control subjects, however, did 
not overestimate significantly more often 
than they underestimated. Overestimation of 
short time intervals also has been found to 
characterize the performance of schizophrenic 
subjects (Lhamon & Goldstone, 1956), al- 
though Rabin (1957) reports data showing 
that schizophrenics tend to underestimate 
longer periods of time (.5—1 hour) more often 
than they overestimate. Fisher and Fisher 
(1953) suggest that in general, subjects’ un- 
conscious perceptions of parents as being 
dominant is correlated with their tendency 
to overevaluate time. Since parental domina- 
tion, particularly by the mother, has been 
reported in numerous studies of schizo- 
phrenics and their family dynamics, schizo- 
phrenic subjects should be expected to 
overestimate time. The greater number of 
time overestimations given by schizophrenics 
to the short time intervals of this study are 
thus consistent with expectations and previ- 
ous findings, 

The tendency for more frequent overesti- 
mations to occur when schizophrenics were 
shown their appropriate conflict picture than 
for the other pictorial stimuli bears some 
similarity to a finding by Harris (1957). In 
a study of the size judgment of pictures 
with mother-son thematic content, he found 
that poor premorbid adjustment schizo- 
phrenics who presumably experienced poor 
mother-son relationships, tended to enlarge or 
overestimate picture size; good premorbid 
patients, however, tended to diminish picture 
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size. If the thematic content of Harris’ pic- 
tures may be considered as traumatic for 


poor premorbid subjects, then it would seem, | 
as in the present study, that the stimulation | 


of core conflicts and the induction of anxiety 


result in exaggerated perceptual and cognitive | 


subject reactions. 
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CONTRIBUTION OF RESPONSE BIAS TO 
RECOGNITION THRESHOLDS 


JANET TAYLOR SPENCE? 
Northwestern University 


The influence of differential usage preference on verbal identification measures 
was investigated using a technique which closely mimics the typical recognition 
threshold investigation. Ss were unrestricted by E in their choice of responses 
and were presented with stimulus materials at varying duration levels. The 
stimuli, however, were presented in such a manner that Ss’ partial perception 
could to some degree be controlled. These partial perceptions made 2 re- 
sponse alternatives equally appropriate, one a familiar word and the other a 
less familiar one. In 2 experiments varying in procedure, the frequency with 
which familiar responses occurred as guesses was significantly (p < .01) 
greater than the frequency of unfamiliar responses. 


The original purpose of the studies investi- 
gating the relationship between recognition 
thresholds and nonphysical properties of the 
stimuli was to test the hypothesis that per- 
ceptual processes, conceived of as hypothetical 
internal events, are influenced by such vari- 

- ables (e.g., Bruner & Postman, 1947; Postman, 
1953). In interpreting the empirical relation- 
ships that were obtained as supporting the 
perceptual hypothesis, several implicit assump- 
tions appear to have been made: first, that for 
all practical purposes, the subject’s perceptions 
could be assumed to be isomorphic to his 
overt behavior (typically a verbal response), 
and second, that the subject’s perception of 
the presented stimulus complex could con- 
veniently be regarded as occurring on an all- 
Or-none basis. 

Since the challenge of the perceptual inter- 
Pretation of the taboo word studies by Howes 
and Solomon (1950) and the demonstration 

.by Bricker and Chapanis (1953) that sub- 
threshold presentations of stimuli do convey 
information, the importance of partial cues 
In influencing recognition measures has been 
receiving increasing attention. The role of 
Partial cues has, in turn, been used by a 
Number of investigators to dispute various 
aspects of the perceptual interpretation of 
reshold studies, 

While the intent of many of these studies 
Seems to have been to disconfirm the per- 

Ceptual hypothesis or to suggest the methodo- 
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logical inadequacies of various psychophysical 
techniques as “pure” measures of perception, 
positive information concerning the factors 
that do contribute to empirical recognition 
measures have been obtained in the process. 
Perhaps as a result of these critical efforts, 
interest seems increasingly to have shifted 
away from preoccupation with the “is it or 
isn’t it perception” issue. Replacing this 
interest is concern with the empirical rela- 
tionships per se, and with the factors, per- 
ceptual or nonperceptual, that determine 
them. For example, in attempting to resolve 
the dilemma of “unconscious perception” 
posed by the so-called perceptual defense and 
subception phenomena, such investigators as 
Ericksen (1956) and Wiener and Schiller 
(1960) have shown how these phenomena can 
be more adequately explained on the basis 
of responses to partial cues. Other investi- 
gators, particularly Goldiamond (1958), have 
gone even further, suggesting that if partial 
perceptions are taken into account, both 
sensitization and defense phenomena can be 
more parsimoniously explained by response 
factors that are independent of perceptual 
processes. The typical technique employed in 
recognition studies, Goldiamond has noted, 
requires the subject to guess the identity of 
the presented stimulus pattern under varying 
degrees of perceptual adequacy. Under sub- 
liminal conditions in which the information 
perceived by the subject makes a number 
of responses equally appropriate, the Subject 
may exhibit a preference or bias for emitting 
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familiar responses in preference to unfamiliar, 
responses congruent with his motivational 
state, etc. In fact, Goldiamond argues, the 
various empirical relationships that have been 
found in these recognition studies may be 
due exclusively to the effects of these non- 
perceptual response biases. 

Several investigations have been conducted 
to demonstrate the influence of response bias 
on recognition measures. By arbitrarily desig- 
nating certain responses as “correct” on each 
exposure of a blank slide, Goldiamond and 
Hawkins (1958) found lower "thresholds" 
for familiar than for unfamiliar stimuli via 
a pseudoascending methods of limits tech- 
nique. Similarly, Goldstein (1962) was able 
to demonstrate greater “accuracy” of identi- 
fication of neutral as opposed to anxiety 
evoking words when subjects were shown 
blank slides or were presented with the actual 
stimuli at subliminal levels. 

Certain features of the design of these 
studies should be noted. In both investiga- 
tions, subjects were not only restricted in 
their responses to the limited number of items 
with which they had previously been familiar- 
ized and/or with which they were furnished 
during the recognition test, but were also 
expected on every trial to select one of the 
responses from this list. With little or no 
stimulus information, subjects had no alterna- 
tive but to guess, thus encouraging the 
operation of response biases. Equally impor- 
tant, only the severe restriction in their choice 
of responses made it possible for subjects 
to guess correctly on a fair number of oc- 
casions, thus allowing these biases to influence 
"accuracy" measures. In the usual recognition 
study, however, the subject has an almost 
unlimited choice of responses from among 
his entire verbal repertoire; accurate identi- 
fication is possible to achieve primarily if not 
exclusively because of information obtained 
from the presented stimulus complex. Not 
only is the subject’s response rarely a pure 
guess (ie, most exposures convey some 
information) but the probability of chance 
congruence between the stimulus and a pure 
guess is almost nil. Response bias could, of 
course, also operate when the subject is per- 
ceiving partial cues and thus not purely 
guessing. But it might be argued that no 
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convincing evidence has been provided that 
response biases influence threshold measures 
to any important degree in the typical situa- 
tion in which the subject is relatively un- 
limited in his response alternatives and must 
therefore rely on fairly extensive partial per- 
ceptions in order to identify the stimulus 
correctly. Further, a blank slide study by 
Smock and Kanfer (1961) indicated that the 
relationship between degree of prior familiar- 
ity with nonsense words and pseudothresholds 
was less marked in a group led to believe 
that stimuli were really being exposed than 
in a group told that correct guessing had to 
be based on ESP. Such results also suggest 
that response bias may be reduced when 
partial cues are actually perceived. 

In the present study, an attempt was also 
made to determine the influence of response 
biases on verbal responses but with procedures 
which more closely mimic the typical recog- 
nition investigation. The experimental vari 
able examined in this context was the degree 
of word familiarity, as reflected by the 
Thorndike-Lorge (1944) word count. The 
subjects were unrestricted in their choice of 
responses and were actually presented with 
stimulus materials. The conditions were such, 
however, that subjects’ partial perceptions 
could to some degree be controlled and 
made equally appropriate several response 
alternatives varying in their degree of prior 
familiarity. 


EXPERIMENT I 
Method 


Materials and apparatus. The basic unit out of 
which the experimental lists were constructed Con- 
sisted of a four-letter “word” made up of three 
letters and a blank in one of the four spatial post 
tions. The blank could be filled in with either of tw? 
(but only two) letters to form complete English 
words (eg. s-r may be completed as sERF or SELF): 
In each instance, one of these completions was D 
high frequency (HF) word, as determined by * 
Thorndike-Lorge (1944) count of 50 or more pë 
million; the second completion was a low frequency 
(LF) word, as determined by a count of 9 0r less 
per million. Twelve units were employed. h 

Two slides were prepared for each unit. In bot 
slides the three common letters were typed so aa 
each was clearly visible. The fourth letter differed 
on the two slides to form the two possible com 
pletions. This letter was typed faintly so that it W 
barely visible. Two experimental lists were dev! 
from the 24 slides. The first list contained one of th 


it 


Her occurred twice in spatial positions 2-4 in the 
and the LF words for each list. 
| 12 word units are as follows, the first letter 
h parenthesis indicating the HF completion and 
ed letter the completion employed in List 1: 
J)NK, R(O,1)LL, DA(R,N)K, SE(L,R)F, CLU(B,E), 
), L(LO)ON, R(U,4)SH, MI(L,N)E, GO(A,U)T, 
EA), and NOO(N,K). To the 12 experimental 
on each list were added 5 additional four-letter 
5 all appearing in uniformly dark type: a 
item and four control items, Two of the 
"were HF words and two were LF. Both experi- 
lists contained the same practice and control 


e 17 words on each list, typed in capital letters 
ate slides, were presented by means of a 
ids Mirror tachistoscope. Both exposure and 
osure fields were white and of equal illumina- 
The words appeared between two fixation 
| marked vertically on the pre-exposure field. 
cts. Students in introductory psychology 
sat Northwestern University served as subjects. 
Ibjects were alternately assigned to one of two 
as they appeared to be tested, each given one 
iwo parallel lists. Fifty subjects, 24 in Group I 
6 in Group II, were tested; six additional sub- 
Were seen but testing discontinued when they 
© identify a word correctly within a pre- 
ned exposure time (.50 second). 
dure. The instructions to each subject were 
for recognition threshold studies using the 
method of limits and did not include any 
€ to the faint letters on most of the slides. 
ual, the subject was encouraged to guess on 
exposure. 
h subject was then given one of the two lists. 
actice item occurred first and the control items 
fourth, eighth, twelfth, and sixteenth positions. 
Ider of appearance of the specific experimental 
Control words was random, a different order 
(used for each subject. 
threshold for each word was determined by 
ascending method of limits. The initial exposure 
à Word was set at .03 second, the settings 
increased in .01-second steps until the subject 
the word correctly. 


Or each subject the mean threshold per 
Was computed separately for the control, 
and LF words and from these, the means 
h group on the three types of items. 
data are presented in Table 1. A com- 
of the mean thresholds of the HF and 
rds for each group yielded highly sig- 
t differences (£s of 5.43 and 9.74, 
01). These results, of course, essentially 
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do no more than to replicate the findings of 
previous studies with respect to the relation- 
ship between word frequency and threshold. 
Of greater interest to the present study is the 
analysis of the subjects’ guessing behavior. 

For each experimental word the number of 
times each subject guessed the alternate com- 
pletion was counted (e.g., the number of 
times the subject offered ROLL as a response 
when the correct response was RILL). As 
may be seen from Table 2, the number of 
HF guesses to LF words were far more fre- 
quent in both groups than the LF guesses to 
HF words. Direct difference /'s indicated 
that these differences were highly significant 
(p < 001). 

Because of the unequal variances in Group 
II, a Wilcoxon matched-pairs signed-ranks 
test was also perfomed on the data for 
this group; the difference between HF and 
LF words remained significant (z= 4.46, 
p = 005). Since subjects not infrequently 
gave the HF guess to an LF word more than 
once, a more stringent comparison involved 
counting for each subject the number of words 
of each type to which the alternative com- 
pletion had been given one or more times. 
The mean number of LF words to which an 
HF guess was given were 3.00 and 3.73 
(out of 6) for Groups I and II, respectively, 
as compared to means of .75 and .38 for LF 
guesses to HF words. Direct difference ¢’s 
indicated that these differences were also 
highly significant (p < .001). 

These discrepancies in guessing behavior 
between the two types of words are in part 
due to differences in threshold, i.e., LF words 
required a greater number of exposures than 


TABLE 1 


MEAN THRESHOLDS PER WORD FOR THE THREE 
Types or WORDS: EXPERIMENT I 


Word type 
Group 
Control HF LF 

I 

M 7.11 10.18 15.49 

c 247 3.00 6.68 
Er M 8.05 9.32 18,73 

c 2.26 2.78 .68 


Note.—In 1/100 second. 
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TABLE 2 


MEAN NUMBER OF ALTERNATIVE GUESSES PER WORD 
FOR THE Two Types oF WORDS: EXPERIMENT I 


Group I Group II 
Guessed 
word type 
M [4 M [4 
HF 1.05 57 247 4.78 
LF AT 29 12 35 


HF words before being identified correctly, 
thus giving a greater number of opportunities 
for the incorrect alternative to be given as 
a response. Analyses of covariance of the 
number of HF and LF guesses were therefore 
carried out, adjusting the guessing measures 
for differences in thresholds. Although the 
differences were attenuated, the Fs were 
significant for both groups (p = .01). 


EXPERIMENT II 


An attempt was made to replicate the results of 
the first study using a somewhat different method. 
Each experimental item appeared as three clearly 
typed letters, plus a vague smudge in place of the 
missing letter of the “word”; the two completions 
(HF and LF) were, with this arrangement, equally 
reasonable. In a preliminary study, one of the al- 
ternatives was arbitrarily designated as “correct” for 
each item and an attempt made to find “thresholds” 
by the ascending method of limits. This procedure 
produced highly variable results, many subjects 
failing to give the correct response on one or two 
items after many exposures at the slowest duration. 
For this reason, the method of constant stimuli was 
employed in the present study, the subject being 
presented with each item a standard number of times 
at predetermined speeds. Since there was no objec- 
tively correct response, no attempt was made to 
establish “thresholds” from these data. Instead a 
comparison was made of the number of HF and LF 
guesses for each item. 


Procedure 


Subjects. Twenty-seven students in introductory 
psychology classes served as subjects. 

Materials and apparatus. The 12 experimental 
units were the same as those employed in Experi- 
ment I. Each was typed on a slide with three clear 
letters; the missing fourth letter was represented by 
a smudge. As in Experiment I the stimulus materials 
were presented to each subject individually by means 
of a Gerbrands tachistoscope. 

Procedure. After demonstration of the apparatus 
with a practice word, the subject was shown each 
of the 12 experimental items once at each of six 
duration intervals (.06, .09, .12, .15, .18, and .21 
second). The order of the 72 presentatións was 
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random with respect to item and to duration interval, ġ 
a different order being used for each subject. The 
subjects were not informed of the incomplete form 
of the items and were asked to guess at the word 
after each presentation. 


Results 


Responses to each experimental item at 
each duration level were examined for each 
subject and the number of HF and LF guesses 
counted. (Responses other than the two 
correct alternatives were thus ignored.) As | 
may be seen from Table 3, although the 
number of both HF and LF guesses increased | 
with duration level, HF responses occurred | 
far more often at each level than LF. In 
order to compare the occurrence of the two 
types of guesses statistically, the combined 
total of HF and LF guesses was first deter- 
mined for each subject and then the per- 
centage of this total represented by HF re- 
sponses was obtained. The hypothesis was 
then tested that the group was a sample 
drawn from a population whose mean per- 
centage of HF guesses was 50. It was possible 
to reject this hypothesis at a high level of 
confidence (t = 16.39, p < .001). 


DISCUSSION 


It was suggested earlier that response 
biases are likely to be more prominent when 
the subject has only minimal cues upon which 
to base his response (a hypothesis that has 
bearing on the rationale of projective tests) 
and that with minimal cues, the extent t0 
which these biases influence accuracy meas- 
ures is determined by the restriction that 
the experimenter places on the subject’s choice 
of response alternatives. The results of the 


TABLE 3 


Mean Numser or HF Anp LF GUEsSES FOR THE 
12-Worp Units AT EACH Duration LEVEL 
AND MEAN PERCENTAGE OF TOTAL 
GuzssEs REPRESENTED BY HF 
RESPONSES: EXPERIMENT II 


Duration level» M9 HF 


Nos guesses 


type 


6 9 12 15 18 


HF |3.18|4.59 | 5.29 | 5.48 | 5.66 | 5.59 85.58 


LF .59| .63| .85 | 1.11 | 1.22 | 122 


a In 1/100 second. 


^ 


RESPONSE BIAS AND RECOGNITION THRESHOLD 


: present investigation, however, clearly demon- 
strate that when the subject is relatively un- 
restricted in his responses and is provided 
with fairly complete information about the 
stimulus, differential usage preferences may 
still influence his verbal responses to a marked 
degree. These results are not necessarily con- 
tradictory to the contentions offered above 
since different factors (or weightings of these 
factors) may contribute to the empirical rela- 
tionships found in studies in which the subject 
is restricted in his choice of responses and 
those in which he is not. 
7 In the former instance, the experimenter 
has made available a number of responses, 
including the correct one. When little or no 
stimulus information is provided, and response 
alternatives are few, purely chance congru- 
ences between the subject’s guess and the 
Stimulus will occur on an appreciable number 
of occasions. If the subject is biased toward 
Tesponding with one type of item over an- 
other, the probability of chance congruence 
will be affected, resulting in pseudodifferences 
between stimulus classes in “accuracy” of 
identification. However, as perceptual condi- 
tions become more adequate, partial cues 
become available. The subjects’ responses 
under these conditions tend to be limited to 
those items on the list that these partial 
Cues make appropriate (Bricker & Chapanis, 
1953). These cues, and the number and type 
Of items they happen to fit, are largely chance 
Matters, The overall effect is to allow less 


: toom” for guessing biases to operate. 


—.—— ——sIÁ 


When the subject is relatively unrestricted 
by the experimenter in his responses, the 
Same type of self-restriction takes place, i.e., 
the subject tends to limit his responses to 

ose that are appropriate to the partial in- 
formation he perceives. But all appropriate 
ES and not merely those that happen to 

* On a list supplied by the experimenter, 
are reasonable choices. If a number of alterna- 
lives occur to the subject it is likely that he 

Will respond with the most familiar, the least 

"leatening etc. He may thus exhibit the 

e kind of response or guessing bias found 

E the blank slide studies. However, what 

ight be called response availability is also 


E determining performance, i.e., there 
Many occasions in which only some, if 
1 
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any, of the possible alternatives occur to the 
subject. For example, examination of proto- 
cols obtained in Experiment II of the present 
study revealed that at longer duration inter- 
vals quite frequently subjects merely reported 
(correctly) the three typed letters; apparently 
they were unable to supply either alternative 
as a guess. In Experiment I, it was not un- 
usual to find that subjects reported only the 
three letters after they had given the incorrect 
alternative as a response, particularly the HF 
alternative to an LF word. Thus it is quite 
reasonable to assume that certain classes of 
stimuli, such as familiar words, are more 
readily available as response possibilities than 
others, Thus two factors may be operating to 
varying degrees in the restricted versus un- 
restricted types of experimental arrangements: 
differential response preferences among equally 
available responses and differential response 
availability. 

It still remains to be considered as to 
whether the relationship between recognition 
thresholds and such variables as word famili- 
arity can be attributed solely to differential 
response probabilities, as Goldiamond has hy- 
pothesized. An attempt was made by Taylor, 
Rosenfeldt, and Schulz (1961) to assess this 
possibility by employing a forced-choice tech- 
nique in which the subject was given a word 
by the experimenter and required to locate 
its spatial position from among a number. of 
simultaneously presented words. Using this 
technique, which makes the correct response 
available and permits equal distribution of 
any bias for spatial position over various 
classes of materials, these investigators found 
that familiar words were more accurately lo- 
cated than less familiar ones. However, in 
a similar study by Goldstein and Ratleff 
(1961), no significant differences were found 
in accuracy scores as a function of prior 
frequency of exposure to nonsense materials. 

Similar inconsistencies have been reported 
in forced-choice investigations of emotional 
and neutral words. MacIntosh (1960) found 
significant differences in the accuracy of loca- 
tion of her emotional and nonemotional words. 
However, a second forced-choice study, by 
Goldstein, Himmelfarb, and Feder (1962), 
failed to replicate the MacIntosh findings, no 
significant differences emerging between their 
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anxiety provoking and neutral words. Overall, 
there is some evidence to suggest that, under 
certain conditions at least, a relationship be- 
tween a recognition measure and word fa- 
miliarity and affectivity will be found when 
differential probabilities of usage are con- 
trolled. A similar conclusion concerning 
word frequency was reached by Brown and 
Rubenstein (1961) in a study using auditory 
presentation of stimuli of varying frequency 
bands. These investigators found that the 
slope of their accuracy curves were steeper 
than predicted by a mathematical model 
which assumed that response bias was the 
sole factor determining ease of recognition. 

Such studies disconfirm the nothing-but 
hypothesis concerning the role of differential 
probabilities of usage. The residual effects 
could, of course, be given a perceptual inter- 
pretation (ie., that one of the effects of fa- 
miliarity, affectivity, etc., is to influence what 
the subject sees"), However, once more the 
role of partial cues must be considered. As 
the present writer has pointed out elsewhere 
(Taylor et al., 1961), it is possible that other 
things being equal, the amount of information 
necessary for the subject to reconstruct a 
complete stimulus from partial cues may be 
related to the same variables as have been 
found to influence recognition thresholds.? 
Procedures, including the forced-choice tech- 
nique, which present stimuli under marginal 
conditions may involve this phenomenon since 
all of them are devices for producing partial 
perceptions, however uncontrolled and acci- 
dental they may be. Such a relationship seems 
particularly likely with the familiarity con- 
tinuum, the subject having had prior experi- 
ence in viewing familiar words under a variety 
of conditions (peripheral vision, different sizes 
and styles, etc.) and thus greater opportunity 
to learn to recognize these stimuli from subtle 
partial cues. 


?Similar statements concerning both the amount 
of information necessary to reconstruct a complete 
stimulus and the response availability notions dis- 
cussed above have been made by Postman (1951) 
and more recently by Johnson, Thomson, and Frincke 
(1960). Unlike these investigators, however, the 
present writer is explicitly proposing that these phe- 
nomen? may involve the reconstruction of stimulus 
patterns from partial cues that are suprathreshold 
and, in this sense, involve hypothetical processes that 
are nonperceptual in nature. 
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REPRESSION AND GUILT IN RELATION TO AGGRESSION 
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College men and women who were opposed to electric shock in research were 
instructed to deliver intense shock to others (victims). Before administering 
shock 14 the Ss read a positive communication (shock is beneficial) and the 
other 14, a negative communication (shock is harmful). There was significantly 
| less recall of the negative communication. For the same-sex pairs (male S-male 
victim and female S-female victim) the positive communication elicited sig- 
nificantly more guilt and opposition to shock than did the negative com- 
munication; for opposite-sex pairs the results were reversed. These findings 
could be explained by cognitive dissonance theory or by an identification 


hypothesis. 


: 

| The present experiment reflects a con- 

: vergence of traditional concern with ante- 
cedents of repression and recent interest in 

- effects of behavior on cognition (Festinger, 

1957). Repression, defined as selective for- 
| getting of personally relevant cognition, has 

been investigated in several ways. The oldest 
- method is clinical, comparing recall of 
- pleasant and unpleasant experiences. One 
- experimental method has been to pair neutral 
material with an aversive stimulus; for ex- 

ample, nonsense syllables are associated with 
: electric shock (Belmont & Birch, 1951). A 

Significant decrement in recall or relearning of 
- the shock associated material in comparsion 
| to the neutral material may be attributed to 

repression. An interesting variation of this 

method employs words that are “emotionally 

laden" on the basis of an individual's response 

to them. For example, Worchel (1955) se- 
lected “repressed” and neutral words on the 
basis of the subjects’ response to the Jung 
Word association test. He found that the 
recall of the former was significantly poorer 
| than recall of the latter. 

In Previous investigations the subject was 
Telatively passive while being exposed to 
Selected stimuli or “treated” in some fashion 
(stressed, failed, interrupted, etc.). The 
Present experiment is different in that the 
Ds of the subject’s interpersonal action on 
!5 Subsequent reproduction of relevant cogni- 


1 
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tion was examined. This approach was sug- 
gested by research on forced compliance, 
specifically a previous study (Brock & Buss, 
1962) on cognitive dissonance and aggression. 

The present technique involved selecting 
subjects who were on record as being opposed 
to a given activity and then requiring them 
to carry it out. The undesired activity was 
giving electric shock to another person in 
the laboratory, and the cognition to be re- 
produced was a pro- or antishock communica- 
tion read by the subject before he adminis- 
tered shock. Under these conditions the 
negative communication (antishock) was 
considered more likely than the positive to 
arouse unpleasant affect. The positive com- 
munication, on the other hand, would be re- 
assuring and therefore recalled better than the 
negative communication. Furthermore, if the 
communications were credible, the negative 
communication should induce guilt, whereas 
guilt over hurting another person might be 
minimized by the positive communication. 
Hence, greater expression of guilt was 
expected by subjects reading the negative 
than by subjects reading the positive com- 
munication. 

METHOD 


Subjects 

The population consisted of undergraduate students 
at the University of Pittsburgh, and the following 
item was used to select subjects for the present 
sample. “It has sometimes been necessary for scien- 
tific purposes to use electric shock on men and 
women. As regards doing this, I am personally 
opposed.” Approximately 30% agreed with the item 
and 81 of these were recruited, 47 women and 34 


men. 
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Procedure 


Experimental group. Subjects were placed ina 
situation involving the administration of electric 
shock by another person, The apparatus, an “ageres- 
sion machine,” is described fully by Buss (1961). 
Briefly, the experimenter tells the subject that he 
is to play the role of an experimenter and that he 
is to teach another subject a concept. This “subject” 
is an experimental accomplice, hereafter called the 
victim. The subject is shown how to present stimuli 
and record responses. Whenever the victim makes 
an incorrect response, the subject shocks him. The 
victim makes a programed series of responses such 
that 24 shocks are to be given for incorrect responses 
in the 50-trial “learning series.” The victim does not 
receive the shock but gasps and groans as though 
he did. Instructions preclude communication and a 
barrier prevents the subject from seeing the victim, 
who records the intensity of the shock he is 
ostensibly receiving. 

There are 10 shock buttons, and at the start of 
the experiment the subject receives shock from 
Buttons 1, 2, 3, and 5 in order to know how much 
his victim will receive. Buttons 1 and 2 give mild 
shock; Button 3 is painful, and Button 5 is extremely 
noxious, The shock is delivered to the fourth finger, 
and it never reaches a dangerous level, 

The subject was instructed by the experimenter 
to “use only Buttons 7 and 8 for maximum effec- 
tiveness.” Thus the punishment ostensibly delivered 
to the victim was severe. This punishment is ag- 
gression, the delivery of a noxious stimulus to another 
person, Although this was a forced compliance situa- 
tion, in which the subject had no choice but to 
aggress, three subjects refused to participate because 
of their extreme opposition to giving electric shock. 

Just before leaving to bring in the victim, the 
experimenter handed the subject a communication, 
telling him that it concerned the experiment and 
that he should read it very carefully. The com- 
munication was either positive or negative; the 
wording was otherwise identical. The negative al- 
ternative in each item is in brackets, and the positive 
alternative is usually in italics. 


The Medical Association Symposium on 
Electric Shock 


In July, 1960, a special committee was appointed 
by the Medical Association to evaluate the effects 
of electric shock on adult men and women. A 
summary of the findings of this committee is 
presented below. In this summary an attempt has 
been made to eliminate technical terms. 

Human beings [cannot] adapt to repeated ad- 
ministration of even large [slight] amounts of 
electric shock. When the shock intensity remains 
the same during a given period of shock adminis- 
tration, the physiological and psychological reac- 
tions to the shock become more and more mild 
[severe]. 

The dramatic results of electro-shock therapy 
consistently support the conclusion that repeated 
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electrical stimulation beneficially [adversely] affects 
all areas of mental and physical functioning. 

Although there may be some [no] obvious 
immediate effects of shock on the person receiving 
it, no serious [often] damage has ever been [is] 
discovered weeks or months after the shock episode 
has taken place. 

Studies of electric shock in the electrocution of 
criminals show that no real pain is ever felt—the 
advent of death is so swift [the pain is unspeak- 
ably severe and lasts a long time before death 
finally occurs]. 

That some people are afraid of electric shock 
was considered by the committee to be an uncon- 
structive and unnecessary fear [a constructive and 
necessary fear]. 

A systematic investigation of shock was begun 
by the research department of the Medical School 
on hundreds of individuals. The experiment, in 
which “line” voltages (amount used in homes and 
offices) were employed, had to be continued longer 
than intended when many persons returned to 
again experience the exhilaration, “mental stimula- 


tion,” “kicks,” that increased for them when they | 


were given the shocks [to be discontinued when 
many persons reported. burns, heart palpitation, 
and “nervousness” which persisted or began to 
increase after the shocks were given]. The inter- 
esting [side] effects were found even at very high 
[low] voltages. 

The committee concluded its report to the 
Medical Association by saying: “Scientific investi- 
gators who administer electric shock to human 
beings may increase the knowledge of mankind in 
a way and to an extent [be causing damage, the 
magnitude of which has] not yet [been] realized." 


Immediately after the subject ran the victim om 
the aggression machine, the following sequence 0€ 
curred. The subject was given the same item used 
to select those opposed to using electric shock} 
this time he used a rating scale that ranged from 
“extremely opposed” to “extremely in favor” in six 
10-point steps. Then the subject rated his guilt over 
giving shocks on a similar scale that ranged from 
“none at all to an “extremely great amount 
Subsequently, he was asked to reproduce in writing 
as many major points of the communication 8$ 
possible. The time between reading the communica- 
tion and reproducing it was 15 minutes. Finally, he 
was presented with 10 items and asked to indicate 
those he recognized from the communication he rea 
earlier, Five of the items actually were in the 
communication, and five were not; these latter wert 
plausible, for example, “a study by the Medical 
School showed that electric shock used on some 
species of mammals and’ birds has especially 
damaging effects.” 


Control Group 


The control subjects were given the positiv 
the negative communication in a classroom. setting 
and allowed approximately 2 minutes to read i 


e of 


REPRESSION AND AGGRESSION 


TABLE 1 


MEAN RECALL SCORES OF EXPERIMENTAL 
AND CONTROL SUBJECTS 


Control Experimental 
Communi- 
cation 
Men Women Men Women 
Positive 3.4 3.4 3.3 3.7 
Negative 3.0 3.1 2.6 1.8 


After an interval of 15 minutes, the same time lapse 
as for experimental subjects, they were tested for 
recall, 


Design 


In the experimental group there were three inde- 
pendent variables: nature of the communication 
(positive or negative), sex of subject, and sex of 

` victim. The subjects were divided into equal groups 
for each independent variable, yielding a 2 X 2 X 2 
design, There were five subjects per cell and a total 
experimental N of 40. 

In the control group there were two independent 
Variables: nature of the communication and sex of 
Subject. Because of the classroom procedure, it was 
hot possible to obtain equal cell Ns. Ten women 
received the positive, and 17, the negative com- 
munication; 9 men received the positive, and 5, the 
negative communication, 


RESULTS 
Recall 


The writen recall of the subjects was scored 
by both authors, with infrequent disagree- 
ments resolved by discussion. The possible 
Tange was zero to seven items recalled, and 
the actual range was zero to six. 

Tt was necessary to determine whether the 
ommunciations themselves led to differential 
recall, The appropriate data are on the left 
Side of Table 1. The difference betwen recall 
of the Positive versus the negative communi- 
Cation is small for both men and women, and 
40 analysis of variance confirmed the lack of 
difference (F = 1.3, p > .20). : 
De an analysis of variance was per- 
Ebr on all the data in Table 1, two 
D. cant Fs were found. There was a sig- 
; Cant Communication effect (F = 11.64, 

puni, df— 1/73) and a significant 
^ ups X Communication effect (F = 3.94, 

S 06, dj = 1/73)? The decrement in recall 
More Subclass frequencies were unequal in 

ons involving the control subjects, an ap- 


‘Proxima ti, 
tmploye by Walker and Lev (1953, p. 381) was 
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TABLE 2 
MEAN RECOGNITION SCORES 
Men Women 
Communi- 
cation 
Male Female Male Female 

victim. victim victim victim 
Positive T2 8.0 8.8 9.2 
Negative 8.6 8.8 T4 7.2 


of the negative communication by the experi- 
mental subjects was evidently enough to yield 
a significant Communication effect, whereas 
there was none for control subjects alone. 
This repression effect was sufficient to yield a 
significant interaction. 

Separate analyses of variance were com- 
puted for the positive and negative com- 
munications. The difference between experi- 
mental and control groups was significant 
only for the negative communication. Thus 
engaging in aggression led to a decrement 
in recall only when the recalled. material in- 
volved the punitive and harmful aspects of 
such aggression. 

Next, an analysis of variance was per- 
formed on the recall data of only the experi- 
mental subjects. This analysis included three 
main classifications (Communication, Sub- 
ject, and Victim °) and their various inter- 
actions. Again the F for Communication was 
significant (5 < .005), showing a clear re- 
pression effect for experimental subjects. No 
other F had p < .10. 


Recognition 

The recognition sheet contained 10 items, 
5 from the communication. The subject’s 
score was the number of items checked that 
were actually in the communication plus the 
number not checked that were not in the 
communication. Thus the possible range was 
0 to 10. 

The control subjects were not tested for 
recognition; the recognition data for the 
experimental subjects are presented in Table 
2. The women’s results were consistent with 
the recall data, i.e., better recognition of 
positive than of negative items. The men’s 
data were in the opposite direction. A three- 

3 Subject means sex of subject, and Victim means 
sex of victim. This usage prevails throughout the 
article. * 
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variable analysis of variance (Communica- 
tion, Subject, Victim) confirmed these 
trends. The only significant F was for Com- 
munication X Subject (F = 9.3, p < .005, 
df = 1/32). 

Clearly, the women’s recall and recognition 
scores were related, and the men’s were not. 
The recall-recognition correlations were .77 
for women and .34 for men, with only the 
women’s correlation being significant. There 
was no obvious explanation for the lack of 
relationship between the men’s recall and 
their recognition. 


Guilt 

The guilt data and analysis of variance 
are presented in Table 3. The mean scores 
ranged from 18.4 (“a very slight amount”) 
to 42.6 (“a very great amount”). 

The absence of a significant Communica- 
tion effect was noteworthy because it was 
expected that the negative communication 
would lead to more guilt than the positive 
communication. There were significant effects 
for both Subject and Victim. The women 
were guiltier than the men, and female 
victims aroused more guilt. These findings 
were not considered surprising in light of 
sex typing in our society. 

The significant second-order interaction 


TABLE 3 
GUILT ABOUT UsrNG SHOCK 


Men Women 
Communi- 
cation 
Male | Female | Male | Female 
victim. victim victim victim 
Positive 26.4 26.0 20.4 42.6 
Negative | 18.4 37.8 39.2 39.4 
Analysis of Variance 
Source df MS F 
Communication (C) 1 211.6 1.9 
Sex of subject (S) 1 639.9 5.8* 
Sex of victim (V) 1 | 1060.8 9.6** 
CXS 1 72.9 
CXV 1 10.0 
SXV 1 1.6 
CXSXV 1| 1026 | 100** 
Within 32 110.6 
Total 39 
* p < .025. 
=b Z .005. 
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TABLE 4 


MEAN GUILT Scores IN TERMS or 
THE PERSONAL INTERACTION 


Subject and victim 
Communication 


Same sex Opposite sex — | 
Positive 34.5 232 
Negative 28.9 38.5 


can be understood if the men's and women's 
data in Table 3 are examined separately. For 
male subjects with a male victim, the posi- 
tive communication led to more guilt than 
the negative communication; with a female 
victim the positive communication led to 
less guilt. For female subjects with a male 
victim, the positive communication led to 
less guilt than the negative communication 
but with a female victim the result was | 
opposite. 

These complex findings may be simplified 
by grouping the Subject-Victim interaction 
into same-sex (male subject-male victim and 
female subject-female victim) and opposite 
sex (male subject-female victim and female 
subject-male victim). The rearranged data 
are presented in Table 4. This table shows 
clearly that in the same-sex interaction a posi- 
tive communication led to more guilt than 
a negative communication. This finding was 
surprising because the negative communica: 
tion should arouse guilt, whereas the positive 
communication should not. In the opposite 
sex interaction this was precisely what 0€ 
curred: more guilt after the negative com- 
munication than after the positive commun 
cation. 


Opposition toward Using Shock 


The mean scores and analysis of variance 
of attitude toward using shock are present 
in Table 5. Note that lower scores meal 
greater opposition. Since the correlation be 
tween opposition to shock and guilt wi 
substantial (r = — .45, p < .01), the results 
should be similar. As with guilt, a fema i 
victim elicited significantly more oppositio? 
than a male victim. There was no significat! | 
sex difference for subjects, which may D 
attributed to the women's lack of oppositio). 
with male victims. 


REPRESSION AND AGGRESSION 


The Subjects X Victim interaction rep- 
resents the same reversal from the opposite- 
sex to the same-sex interaction that occurred 
on the guilt variable, regardless of the nature 
of the communication. The importance of 
the Subject-Victim interaction is further en- 
hanced by the significant (p < .06) triple 
interaction. As with guilt, the negative com- 
munication led to more opposition than the 
positive communication for an opposite-sex 
personal interaction. Also, as with guilt, when 
the subject and victim were of the same sex, 
the negative communication unexpectedly led 
to less opposition than the positive com- 
munication, 


DISCUSSION 


The main finding of this experiment was 
a clear repression effect; recall of the nega- 
tive communication was significantly poorer 
than recall of the positive communication. 
There was no repression of the negative com- 
munication in the control group. Being op- 
posed to using electric shock was not in 
itself sufficient to cause repression. Selective 
forgetting occurred only when subjects 
carried out behavior (in this instance, ag- 
gression) relevant to the opposed attitude. 
Although not derivable uniquely from 
Festinger’s (1957) theory, the demonstrated 
avoidance of cognition incongruent with one’s 
behavior was considered consistent with the 
dissonance model. 

In previous studies, being confronted with 
Material contrary to one’s attitude was suffi- 
Gent to cause selective forgetting. For ex- 
ample, Levine and Murphy (1943) found 
Poorer recall of pro-Soviet material than of 
anti-Soviet materal by anticommunist sub- 
Jects. In the present study a negative 
attitude toward electric shock did not lead to 
Tepression unless the subject actually used 
shock. An anticommunist attitude is no doubt 
of greater importance and personal relevance 
than an antishock attitude. Perhaps with 
attitudes of lesser importance and salience, 
Tépression occurs only when the individual 
“gages in behavior relevant to the attitude. 

this is true, the present approach should 
* explored with socially relevant attitudes. 
9r example, Clark (1953) found that suc- 
cessful integration was rarely served by 
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TABLE 5 
ATTITUDE TOWARD UsrNG SHOCK 
Men Women 

Communi- 

cation 

Male Female Male Female 
victim victim. victim victim 
Positive 20.2 29.4 31.2 13.8 
Negative | 26.2 21.6 274 17.8 
Analysis of Variance 
Source df MS F 

Communication (C) 1 17 

Sex of subject (S) 1 32.5 

Sex of victim (V) ISIS, 4,3%* 

CXS 1 25 

CXV 1 22.5 

SXV 1| 624.1 | 8&6"* 

CXSXV 1| 29014 4.0* 

Within 32 72.8 

Total 39 

*5 € .06. 

** p € .05. 
*** x Ol, 


direct attempts to modify attitudes and be- 
liefs. Instead, the situational evocation of 
some form of pro-Negro behavior tended 
to produce the desired cognitive and at- 
titudinal restructuring. 

The correlation of guilt and opposition to 
shock was not surprising, but the absence 
of a communication effect for either variable 
was unexpected. Rather, the communications 
interacted with the Subject-Victim relation- 
ship. When the subject and victim were of 
the opposite sex, the negative communica- 
tion caused more guilt and opposition, but 
the positive communication led to more 
guilt and opposition when the subject and 
victim were of the same sex. 

This finding may be accounted for by 
dissonance theory. According to Brehm 
(1960), “. . . Given commitment to disliked 
behavior, the less supportive is additional 
information, the greater the magnitude of 
dissonance and consequent increase in liking" 
(p. 379). In the present study when subjects, 
who were engaging in disliked behavior 
(shocking another person), read the negative 
communication, they should have experienced 
more dissonance and consequent adoption of 
a favorable attitude toward the use of electric 
shock.than subjects who read the positive 
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communication. The results for same-sex 
interactions were in accord with the disson- 
ance interpretation, but the opposite-sex 
interaction yielded contrary results. Since 
magnitude of dissonance is a function of 
the extent to which inconsistent cognitions 
are perceived as relevant to one another, it 
might be alleged that the relevance criterion 
was only weakly satisfied under opposite-sex 
pairings: the subject perceived less relevance 
between what he was doing and how his 
victim was responding than under same-sex 
conditions. If this post hoe interpretation 
has any validity, revision of dissonance 
theory may be required before it can ade- 
quately account for consequences of forced 
compliance in interpersonal contexts. 

The results may also be explained in terms 
of identification. The findings for the op- 
posite-sex interactions required no special 
hypothesis: the negative communication 
leads to more guilt and opposition because 
it mentions dangerous and undesirable fea- 
tures of behavior that the subject actually 
carries out. What must be explained is the 
contrary findings with the same-sex inter- 
action. When the subject and victim are of 
the same sex, it should be easier for the 
subject to identify and empathize with the 
victim. The positive communication states 
that shock is harmless and relatively painless. 
Yet the subject has already received the 
painful shock and heard the gasps and 
groans of the victim. He would, therefore, 
tend to not accept the positive communica- 
tion, perhaps believing it designed to conceal 
the real pain and danger in administering 
electric shock. Failure to accept the positive 
communication would cause these subjects to 
be more guilty and opposed than those who 
received the negative communication and 
therefore “knew the worst.” 

Such a reaction against the positive com- 
munication should not occur in the opposite- 
sex pairs because the subject would identify 
less when the victim is of the opposite sex. In 
the absence of close identification and 
empathy, there is little reason for the subject 
to doubt the communication. Therefore, in 
the opposite-sex interaction the negative com- 
munication should lead to more guilt than 
the positive communication. ^ 
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This interpretation is similar to that made 
by Willis (1960) in explaining his finding | 
that “. . . Judges were more lenient in judging 
the attractiveness of own-sex photographs 
than of other-sex photographs” (p. 373), 
Willis assumed that identification with mem- 
bers of one’s own sex might produce more 
leniency in judging. Similarly, in the present 
study identification with one’s own sex might 
have led to greater feeling of knowing pre- 
cisely what the victim was undergoing, with | 
a consequent rejection of the positive com- 
munication. Thus the identification hypothe- | 
sis fits the findings of both Willis’ study and 
the present one. | 

In any event, the results for same-sex and 
other-sex interactions are interesting in their 
own right. These interpersonal pairings 
should receive more attention than they have, 
especially in light of their relevance to any 
approach to social interaction. The technique 
employed here appears to be a powerful one: 
having individuals engage in behavior rele- 
vant to their attitudes has consequences for 
not only the attitudes themselves but also for 
guilt and memory. This is especially true 
when one person is called upon to aggress 
against another. 
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A group of 69 male, long-hospitalized chronic psychotics in an experimental 
rehabilitation unit was tested on a new device, the Hunter problem box, 
prior to their treatment by 1 of 3 variants of social therapy. A summary score, 
the Hunter Process Index, was developed to relate to patients’ postprogram 
adjustment on completion of a social therapy program. The index reflects 
changes in the degree of problem solving rigidity displayed by the patient as 
he moves from a simple, 2-light, single alternation problem to a more com- 
plex, 4-light, double alternation problem. The Hunter Process Index was 
validated with a group of 71 long-hospitalized, female psychotics who also 
had been exposed to the social therapy approach of the rehabilitation unit, 
and again with a group of 48 male chronic psychotics who had received some 
slight variation in the program. The index scores were predictive of the 
outcome of patients in these groups. Clinical judgments failed to predict out- 
comes. A normal control group in a separate study earned better Hunter index 
scores than did either the released or hospitalized groups of male chronic 


psychotics. 


Deficits in psychomotor performance, re- 
action time, problem solving behavior, and 
other psychometric measures are often suf- 
ficiently large and reliable to allow useful 
differentiation of acute schizophrenics from 
chronic schizophrenics. Winder (1960), in a 
Comprehensive review of studies of psycho- 
logical deficit, summarized the more firmly 
éstablished differences between acute and 
chronic schizophrenic groups. Little is said, 
however, of differentiation among chronic 
Schizophrenics. This lack of systematic, or 
even incidental, investigation of differences 
among chronic schizophrenics, or chronic 
Psychotics in general, probably mirrors the 
limited social usefulness of such discrimina- 
| tions. Most hospitalized chronic psychotics 

Characteristically have received nearly iden- 

tical treatment, i.e., routine ward care. The 
_ Problems of differentiation become socially 
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meaningful only when differential treatment 
procedures are made available. 

With the recent proliferation of somewhat 
novel social therapy programs for chronic 
psychotics, it became obvious that these 
patients were not a homogeneous group. As 
a result of exposure to treatment innovations 
such as therapeutic communities and social 
interaction treatment programs, some chronic 
psychotics improved dramatically. Yet, some 
remained the same, and a discernible minority 
began to resemble acutely disturbed patients. 
Thus the need to specify and measure the as 
yet unknown variables or dimensions of 
chronic psychosis, poses an increasingly im- 
portant research problem. Because of the 
limited availability and relatively high cost 
of the new social therapies, their advent has 
made patient selection and prediction of 
outcome forceful research issues. 

Unfortunately, most of the traditional 
tools that.psychologists have used to dif- 
ferentiate individuals are inadequate for 
comparisons among long-institutionalized 
chronic psychotics. These tests and instru- 


351 


352 


ments often rely too heavily on verbal 
facility or require response behaviors that 
chronic psychotics are either unwilling or 
unable to furnish. Diagnostic instruments 
are needed which are applicable to the gen- 
eral population of chronic psychotics, are 
capable of making discriminations among 
this group, and are predictive of future, 
socially meaningful behavior. This paper 
describes the development and reports pre- 
liminary results from one such instrument. 

The technique outlined, the Hunter prob- 
lem box, was developed to measure certain 
cognitive processes. The goal was to tap 
those problem solving processes that underlie 
adaptive modifications of behavior in new 
situations. It was conjectured that if a person 
were unable to modify his perceptual set so 
as to allow solution of test problems varying 
in complexity, then, he also would be un- 
able to modify his overt behavior sufficiently 
to adjust to treatment program demands. 


METHOD 
Subjects 


The Psychology Department at Philadelphia State 
Hospital was faced with the problem of a lack 
of appropriate predictor instruments when it began 
the process of selecting patients for an experi- 
mental social interaction treatment program. Be- 
cause of the dearth of valid empirical data on 
prognostic indicators, selections from a pool of 
long-term hospitalized chronic psychotics were 
based on a review of the patient’s record and a 
semistructured screening interview by two experi- 
enced clinical psychologists. The screening team 
made certain predictions and ratings, as well as the 
decision to accept or reject the patient. 

The male patients selected for the social inter- 
action treatment project by the screening team 
appeared to be representative of the “better” two- 
thirds of the population of patients found on the 
“chronic” wards of most state hospitals. They 
ranged in age from 19 to 72 years; the median was 
around 43 years. In 90% the primary diagnosis was 
psychosis, 82% were schizophrenic, the majority 
paranoid. In the remaining 10%, the primary diag- 
nosis was chronic brain syndrome. Over 90% of 
the total group had been previously treated in a 
mental hospital. The median length of hospitaliza- 
tion was over 10 years with a range of 1 to 36 
years. Most of the group came from the lower 
socioeconomic segments of the population. Over 
60% never reached high school; only 8% had 
any college or specialized training. The average 
estimated IQ was approximately 95 (WAIS, four 
subtests). Less than half of the patients kad ever 
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been gainfully employed for any length of time, 


Seventy-two percent had never been married, 
Despite their poor socioeconomic status and the 
length of their hospitalization, these patients 


probably were better adjusted than the average 
chronic patient at this particular hospital. 

Alcoholics, severe behavior disorders, and seri- 
ously brain damaged patients were ruled out. On 
the other hand, patients in good remission were 
also excluded. Those chosen had debilitating motiva- 
tional, psychiatric, or social withdrawal symptoms, 
but were judged to have potential for successfully 
responding to the 9-12 month social therapy pro- 
gram. Of the nearly 2000 male chronic patients 
screened for the unit, approximately 200 met 
criteria for acceptance. 


Experimental Treatment Program 


Patients who were accepted for the program 
were given a battery of nontraditional test pro: 
cedures, and then assigned, on a random basis, to. 
one of three social interaction treatment groups or 
to a control condition. All four groups had com- 
parable compositions in regard to age, diagnosis, 
and length of hospitalization. The four groups 
varied in the amount of social interaction included 
in their treatment program. 

At the first level, the control condition, the 
selected patients remained on their regular wards 
and received their usual care. No alterations were 
made in their lives and their status as controls was 
unknown to all but the research workers. 

Patients in the other three groups were trans- 
ferred to the rehabilitation unit where they were 
exposed to variants of social treatment. These 
three groups were provided, respectively, a thera- 
peutic community, a core interaction treatment 
program, and an intensified interaction program. 
The therapeutic community included a physical 
environment approximating that of boardinghouse 
life in the extramural community, a staff with 
positive attitudes towards the treatment potential 
of the chronic patient, and specific treatment goals 
and time limits for patients to achieve these goals: 
The therapeutic community was used both as an 
independent treatment condition, and as the basic 
framework for the two interaction treatment 
programs. 

The core interaction treatment program included, 
in addition to the therapeutic community, a pt0- 
gram of activities which both demanded interaction 
and provided training in specific social behaviors. 
total of 10 hours of each patient's week was 
devoted to these programed activities. 

The intensified interaction treatment program | 
which was also conducted within the therapeutic 
community setting, included all the activities offere 
in the core interaction treatment program, plus | 
group therapy and patient government. These latter | 
activities were used to intensify the socialization 
process. A fuller description of the program 4m! 
the research design is available elsewhere (Sanders, 
Weinman, Smith, Smith, Kenny, & Fitzgerald, 1962: | 
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Apparatus 


One of the test devices included in the battery 
given to all experimental patients was an apparatus 
which has come to be known as the Hunter problem 
box: This device bears certain resemblances to 
an apparatus described by Jensen (1958). The 
design of his apparatus stimulated and helped 
translate into action 5 the investigators thinking. 
Jensen’s emphasis, however, was on securing esti- 
mates of the absolute level of performance on each 
of his problems, ie. he was interested in compar- 
ing the number of errors, and similar indices of 
accuracy, from subject to subject. The approach of 
the present study was to investigate improvement 
or changes in performance from problem to prob- 
lem within each subject. 

The Hunter problem box is an enclosed rec- 
langular metal box approximately 17" X 13" X 3". 
It is divided into examiner and patient halves by 
a metal shield that slants upwards and towards the 
examiner at a 45? angle, and thus protects the 
controls, on the examiner's side, from the view of 
the patient. The patient is confronted, on his side, 
by a row of five small red bulbs. Each bulb has 
a small button mounted 2" in front of it. Any one of 
the buttons may be activated by the examiner's con- 
trols so that its depression by the patient will 
switch on the red light behind it. The examiner, 
by means of a selector switch, may alternate 
among the five bulbs the one bulb that can be 
turned on by the patient. For example, the examiner 
may activate Button Number 2 in the row of 
five. The numbers themselves appear only on the 
cxaminer’s side. The patient is then given the task of 
finding the button that turns on a light. He may, 

trying to do so, push Buttons 3, 4, and 5 
before pushing Button 2 and meeting success. In 
that case, he has made three errors on that 
Particular trial. More complex problems may be 
Used. For example, the patient may be .required 
to learn the sequence, Button 2, then Button 5. 
Bulb 5 will not go on until the patient has suc- 
ceeded in lighting Bulb Number 2. Long sequences 
of single alternations may be used, or double 
alternation problems may be arranged, for example, 
1, 1, 4, 4, Once brief instructions have been given 


to the patient, the task becomes essentially non- 
Verbal, 


Testing Procedure 


A Patient was brought into the testing room and 
nd in front of a small table containing the 

"nter problem box. The examiner sat behind the 
us 


D rie name was chosen to commemorate the 
: Walter S. Hunter of Brown University who 
tioy Pioneer work on the single and double alterna- 
es that this device utilizes. 

eter Janicki of the Psychological Laboratory 


bu rinceton University took the rough plans and 
Be. and later revised, the present Hunter problem 
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table, and, after rapport was established, gave the 
following instructions: 


One of these black buttons [pointing to the row 
of five buttons facing the patient] will turn on a 
red light when you press it down. Your job is 
to find out which button turns on the light each 
time. Try to make as few mistakes as you can. 

After you have found the button which turns 
on the light, wait until the light goes off before 
trying again. Push only one button at a time. 


The first problem, the Sequence Buttons 2 and 
5, was then begun. Every time the patient pushed 
an incorrect button it was counted as an error, 
The total sequence of button depressions that 
terminated with lighting the correct bulbs, con- 
stituted a trial. After the patient had succeeded 
in getting all the correct bulbs lit, the first bulb of 
that problem was again activated, in order to 
start the second trial. The patient continued with 
a problem until he had made no errors on 2 
successive trials, or until he had completed 10 
trials. When either of these points was reached, 
the examiner said, 


All right, now we will start over again, See how 
well you can do this time. 


In this fashion a total of four problems was pre- 
sented. They were, in terms of the bulb position 
numbers: A, 2-5; B, 4-2-3; C, 5-1-4-2; D, 4-4-2-2. 

A record was kept of the number of errors made 
on each trial, the number of trials taken on each 
problem, and the number of aberrant responses 
made to each problem. An aberrant response was 
scored when a patient depressed for a second (or 
more) times an already depressed incorrect button, 
without yet having tried all five buttons. In ad- 
dition to these raw scores, a record was kept of 
total and derived scores, including a tally of 
repeated error patterns. The latter was scored when, 
for example, a patient earned an error score of 
3-4-2 (ie, three errors before getting the first 
button of the problem, four errors before getting 
the second button, and two errors before getting 
the third button) and then went on to repeat, 
identically, this error sequence in the next trial, 
Such a repetition of the numerical sequence of 
errors was labeled a repeated error pattern and 
recorded for each of the four problems. 


Criterion Measure 


Several types of criteria were used to evaluate 
the postprogram condition of the experimental 
patients. However, not all of these criteria were 
available for all groups, and several of the in- 
dicators of final condition were designed for 
specific purposes, for example, measures of post- 
program social interaction. The criterion adopted 
for the present study was disposition, ie. deaving 
or remaining in the hospital at the end of the 
program. Although disposition may be only a gross 
indicator of psychiatric adjustment, such data do 
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have considerable social relevance, and what they 
lack in measurement precision, they may partially 
compensate for through their personal, economic, 
and social significance. Approximately 50% of all 
patients who underwent the social treatment pro- 
grams left the hospital at the end of their programs. 

On the rehabilitation unit, the decision to release 
a patient from the hospital was made by the unit 
psychiatrist at a regularly scheduled staff meeting. 
The case and recommmendations were presented 
by the psychologist in charge of that patient’s 
cottage, but the decision remained with the psy- 
chiatrist. None of the test data was available to 
the psychiatrist. In the rest of the hospital, the 
release decision was made in a staff presided over 
by the clinical director and including the unit’s 
psychiatrist. Thus, though our control patients were 
housed in several buildings, the criterion for their 
release appeared to be similar to that used in the 
rehabilitation unit. Additional safeguards were 
present in that the unit psychiatrist was not 
intimately familiar with the details of the three 
social interaction treatment programs, and the 
clinical director and his staff did not know the 
identity of the ward controls. 


Development of the Hunter Process Index 


As part of the larger research design, ratings of 
both preprogram and postprogram adjustment were 
secured from the psychologists who served as admin- 
istrators for each of the three experimental treatment 


TABLE 1 


DERIVATION OF THE HUNTER Process INDEX 
FROM Raw DATA 


Number Scores 

Total number or repeated error | 0 -—1 
patterns 2 1 
3-6 2 
7 or more 5 

B 
Error patterns (Problem D —1 or less 0 
— Problem A) 1 
+1 or more 5 

(c: 
Error patterns (Problem C —1 or less 0 
— Problem A 0 if 
+1 or more 5 

D 
Number of trials (Problem D —1 or less 0 
— Problem A) 1 
+1 or more 5 

E 
Number of trials (Problem C —1 or less 0 
— Problem A) 1 
% +1 or more 3 

Note. Indes = Score A +B +C +D +E. : 
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units. The postprogram ratings were used as an 
interim criterion in the development of the Hunter 
Box Process Index. It should be noted that this 
criterion, while not perfectly correlated with disposi- 
tion, is far from independent of it. The postprogram 
ratings were made after the disposition decision, 

Preliminary study of the distributions of the 
various raw scores from the Hunter problem box 
indicated that none of the 22 basic recorded scores 
seemed to differentiate among groups constituted on 
the basis of their rated postprogram adjustment, 
Although the patients who were judged to be better 
adjusted postprogram had performed slightly better 
on most scores in the preprogram Hunter box testing 
than had the poorly adjusted patients, the extreme 
variability in the scores minimized the chance of 
useful differentiation. 

Further experience with the instrument seemed to 
indicate that the salient factor was not the absolute 
level of performance on any of the scoring variables, 
but rather the problem solving process of the patient 
as he went from Problem A, the 2-light single 
alternation problem, to Problems C and D, the 
4-light single and double alternation problems. The 
sicker patients, without regard to initial level of 
performance on the task, seemed to perseverate in 
their approach, and to become stereotyped and rigid, 
ie. their performance, whether adequate or poor on 
Problem A, seemed to remain at that level for the 
subsequent problems. The better adjusted patients, 
on the other hand, seemed to improve on most scores 
as they progressed from Problem A to D, despite 
the increasing complexity of the task. This improve- 
ment appeared relatively independent of the absolute 
level of initial performance. The data suggested that 
the absolute value of the scores might be related 
to intelligence or some general problem solving 
ability, while the improvement in scores from 
problem to problem might be related to the ability 
to modify one's approach or adapt to the task. 
This phenomenon, which was, perhaps, a demonstra- 
tion of flexibility, may be more closely related to 
psychiatric factors than to intellectual factors. 

Scores on three types of performance measures 
were secured for each problem. These were: the 
number of errors, the number of trials, and the 
number of repeated error patterns. The differences 
between scores on Problems A and C and between 
Problems A and D were computed for each of the 
three measures. Each of these six sets of different 
Scores was correlated with the criterion rating 9 
postprogram adjustment. The differences between A 
and C, and between A and D, on number of tri 
and number of repeated error patterns bore promising 
relationships to this interim criterion. An index WA 
derived which combined these four scores plus # 
fifth score, the total number of repeated: errot 
patterns. The total number of repeated error patterns 
was included because it was the only measure 
directly tapped rigidity of performance on all four 
problems. E 

The five scores were weighted on the basis ma 
their apparent contribution to differentiation among. 


PREDICTING THE OUTCOME OF THERAPY 


the patients, and then additively combined. The 
resulting index, the Hunter Process Index, varies 
from 1 to 23 with the low numbers indicating 
favorable prognosis. The actual technique for 
deriving the Hunter Process Index from the raw 
data is given in Table 1. 

Hunter process indices were then computed for 
all male subjects who had been given this test pre- 
program, and who had remained in the experimental 
units until they had completed their programs and 
received postprogram adjustment ratings. Eighty- 
seven male chronic psychotics remained after elimi- 
nating untested or unrated patients and early trans- 
fers from the program. Since a comparison of the 
predictive efficiency of the Hunter Process Index 
with the preprogram ratings of adjustment was to 
be made, 18 more patients, without preprogram 
ratings, had to be eliminated. The remaining group 
of 69 patients had received the preprogram Hunter 
box test, the preprogram ratings of adjustment, one 
of three experimental social treatment programs 
for 9-12 months, the postprogram ratings of 
adjustment, and a final disposition from the unit. 

The preprogram Hunter Process Index was cor- 
telated with the interim criterion, postprogram 
ratings of adjustment, for 87 male patients. Pearson 
t= 45, p < 0001. 

In order to compare the Hunter Process Index 
with the early clinical ratings of adjustment, each 
of these was correlated with the disposition criterion. 
Unit disposition was of two types, release from the 
hospital or return to a ward. Because of the dichoto- 
Mous nature of this criterion, biserial correlations 
Were computed for the 69 male patients involved in 
this comparative study, The biserial correlation be- 
tween the Hunter Process Index and postprogram 
disposition was r, = .47, p < .0001. The biserial cor- 
relation between the early clinical ratings of adjust- 
Ment and postprogram disposition was f»-— .16, 
Which is not significant. The mean score of the 
released from the hospital group” was compared 
With the mean score of the “return to the ward 
Btoup,” for each of the two measures. In the case 
of the Hunter Index, t=3.27, p=.0005. For the 
Pteprogram clinical ratings the means did not differ 
Significantly (t = 1,05). : 

ince scores on the Hunter problem box might 
„ave been nothing more than estimates of general 
Intelligence, the Hunter Process Index was correlated 
m an IQ estimate based on the administration of 
m information, Comprehension, Picture Completion, 
n Block Design subtests of the Wechsler Adult 
Eee Scale. Both tests had been given on the 
Ri day. Pearson r= —26, p=.01, N — 80. This 
DAE suggested that while intelligence is un- 
bs *dly one component of the Hunter box per- 
E. one the Hunter Process Index is not simply 
with y IO estimate. The WAIS IQs were correlated 
ea € postprogram ratings of adjustment. The 
tson y of only — 04 demonstrated that the sig- 

“ant relationship between Hunter Process Index 
Postprogram adjustment was not an artifact of 


ac : 
ommon relationship to intelligence. 
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RESULTS AND DISCUSSION 


It will be recalled that the Hunter Process 
Index was derived by observation and study 
of data from the male subjects who entered 
into the analyses presented in the develop- 
ment section. Consequently, the need for 
validation on an uncontaminated group was 
apparent., Female chronic psychotic patients 
were included in the larger study of social 
interaction treatment programs, of which this 
particular study is only one aspect. The test- 
ing procedures and general methodology were 
highly similar for both groups. In order to 
investigate the generality of the reported 
results, Hunter process indices were computed 
for 71 females for whom full data were avail- 
able. The biserial correlation between the 
Hunter Process Index and postprogram dis- 
position was zy = .31, p < .02. The biserial 
correlation between the early ratings of ad- 
justment and postprogram disposition was 
7» —.12 which, as in the case of the males, 
was not significant. Thus the significant rela- 
tionship between the Hunter Process Index 
and postprogram patient status seems to bi 
more than a statistical artifact. í 

Although the relationship was demonstrated, 
its attenuation in the group of female chronic 
psychotics, warranted further scrutiny. Com- 
parison of the performances of the male and 
female patients indicated a rather clear sex 
difference. The males, as a group, scored 
better than the females on almost all meas- 
ures of performance, suggesting that the 
components of the index would need to be 
weighted somewhat differently for optimal 
use with female chronic psychotics. Despite 
this limitation, the index predicted post- 
treatment psychiatric disposition for the fe- 
males, while the preprogram clinical ratings 
made by the psychologists did not. 

An additional sample of male subjects 
served for a second validation study. This 
sample of 48 male chronic psychotics was 
composed of 30 pre-experimental, 11 “service,” 
and 7 control patients. Their Hunter protocols 
had never been scored. 

The pre-experimental patients had been 
exposed to either the core or intensive social 
interaction treatment program during its 
developmental stages. This program, although 
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TABLE 2 
Comparison OF MEAN HUNTER Process INDEX 
Mean 
Group N Hunter SD t 
Index 
Normal control (A) 30 6.27 527 
Chronic psychotics re- | 71 7.34 4.90 
(5) from hospital 
Chronic psychotics re- | 62 11.60 6.54 
maining in hospital 
(C) 
A versus B .88 
A versus C 3.86* 
B versus C 4.22* 
*p «.001. 


it preceded the formal experimental program, 
was quite similar to it in content and 
duration. The patients themselves were 
comparable to those entering the later studies. 

The service patients either were brought to 
the unit after the formal data collection period 
had ended or were assigned on a clinical 
rather than a random basis to the particular 
variety of social therapy. All three types of 
social therapy programs were represented in 
this group. 

The seven control patients represented an 
addition to the experimental sample which 
served to increase the variety of treatment 
approaches sampled. 

Hunter Process Index scores were computed 
for all 48 patients and, as in the previous 
studies, were correlated biserially with disposi- 
tion. The biserial correlation = .44, p < .01. 
Thus the magnitude of the relationship, as 
well as its existence, seems to have some 
stability. 

A different, and somewhat indirect, ap- 
proach to the problem of validation of the 
Hunter Process Index as a predictor of psy- 
chiatric condition may be found in the fol- 
lowing study. A group of 30 male “normal” 
employees of the hospital was selected from 
a list of volunteers. The intention was to select 
a group comparable, in so far as possible, 
to the hospitalized chronic psychotics but 
with no history of mental illness. These hos- 
pital employees were drawn from the non- 
professional staff and included attendants, 
kitchen workers, laundry workers, garage 
helpers, and warehouse men. Their mean age 
was 45 and they averaged slightly under a 
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ninth grade education. Their average IQ 
(WAIS, four subtests) was 99 and the dura- 
tion of their employment at Philadelphia 
State Hospital had a similar distribution 
to the length of hospitalization for the 
experimental group of chronic psychotics, 
Under the circumstances they appear to be 
a reasonably adequate normal control 
group—similar in many demographic respects 
to the patient group. 

The normal control subjects were admin- 
istered the Hunter problem box and the 
Process Index was derived. The mean Hunter 
Process Index for this group was 6.27, a some- 
what better score than that earned by the 
patients who were released from the hospital, 
and a considerably better score than that 
made by the patients remaining in the hos- 
pital. Scores for all male patients who were 
released were pooled into one group; similarly. 
combined were those remaining in the hos- 
pital. The comparative data is presented in 
Table 2. There was a highly significant 
difference between the scores of either the 
normal controls or the released patients and 
the patients remaining in the hospital 
The difference between the normals and the 
released patients, while in the predicted 
direction, was not significant. 

A small positive relationship had been 
demonstrated between the Hunter Index 
scores and estimated IQ (WAIS, four sub- 
tests) in the male chronic psychotic group. 
Speculation suggested that this relationship 
might be an artifact of disturbed functioning 
affecting performance in a similar fashion on 
both measures, i.e., the small correlation may 
simply arise from an equivalent impairment 
in performance in both tests on the part of 
any given patient. If this were, true, then the 
relationship between IQ and Hunter Index 
should be minimal in a normal group. The 
Pearson correlation between WAIS IQ (four 
subtests) and Hunter Process Index was com- 
puted for the normal control group. It was 
r= 12, which is not significantly different 
from zero. Thus there is at least a strong 
suggestion that the Hunter Process Index taps 
thinking processes which are quite independ- 
ent of intellectual functioning as ordinarily 
measured. 

The overall results suggest that the Hunter 
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Process Index may have utility in selecting 
from among hospitalized chronic psychotic 
patients those with significantly better poten- 
tial for a social therapy program and perhaps 
for other treatment approaches as well. It 
appears that the candidates so selected may 
not be the same patients whom clinical judg- 
ment would indicate. Perhaps, based on our 
experience, this would be an asset. Approxi- 
mately 50% of the patients who were serviced 
by the social treatment programs did not 
leave the hospital at the end of their pro- 
gram. Although this is a laudable release rate 
for chronic psychotics, the same figure raises 
serious questions about the effectiveness of the 
selection procedures. It may be that innova- 
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tions in screening and selection procedures 
will allow the assignment of more appropriate 
chronic psychotics to the new social treatment 
procedures and thus presage an improved rate 
of social recovery. 
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ATTITUDINAL AND COGNITIVE ASPECTS OF 
INTENSIVE STUDY OF A SECOND LANGUAGE * 


WALLACE E. LAMBERT, R. C. GARDNER,? H. C. BARIK, Ann K. TUNSTALL 


McGill University 


Students attending a 6-week French Summer School were examined for at- 
titudes toward French people and culture, their orientations toward learning 
a 2nd language, and their feelings of anomie at the start and end of the 
course. Attention was also given to modification in the meanings of French 
and translated-equivalent English concepts. Students at 2 levels of skill in 
French were compared. Results supported the theory that learning a 2nd 
language efficiently depends on an appropriate pattern of attitudes toward 
the other cultural group and a particular orientation toward language study. 
Anomie increased for both groups of students during the course. Students 
utilized the semantic features of both their languages and permitted the 2 


to interact. This linguistic 
achievement. 


If one views the learning of a foreign 
language from a social-psychological stand- 
point, the process takes on a significance 
which might otherwise be overlooked. Psy- 
chological questions then come to mind, such 
as: What motivates people to learn another 
language? Is there anything unique about 
the learning process involved in language 
study? Recent research carried out in our 
laboratory (Anisfeld & Lambert, 1961; 
Gardner, 1960; Gardner & Lambert, 1959) 
has focused on such questions, and various 
findings have increased our confidence in a 
psychological theory of  second-language 
learning. This theory states that an in- 
dividual successfully acquiring a second 
language gradually adopts various features 
of behavior which characterize another 
linguistic and, as is often the case, another 
cultural group. The learner's ethnocentric 
disposition and his attitude toward the other 
group are believed to influence his success in 
learning the new language. His motivation to 
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interdependence 


correlates positively with 


acquire the language is considered to be deter- 
mined both by his attitudes toward the other 
group and by his orientation toward learning 
a second language. The orientation takes an 
“instrumental” form if the purposes of 
language study reflect the more utilitarian 
value of linguistic achievement such as 
getting ahead in one’s work, and an “in- 
tegrative” form if the student is oriented 
toward learning more about the other cultural 
community, as if he were eager to be a 
potential member of the other group. Further- 
more it is argued that some individuals may 
be prone to learn another language as 4 
means of potential acceptance in another 
cultural group because of dissatisfactions 
experienced in their own group, while others 
may be as seriously interested in another 
culture as they are in their own. As one 
becomes proficient in a second language he 
may find that his place in his own member- 
ship group changes at the same time as the 
other linguistic-cultural group becomes more 
than a reference group for him. It may be- 
come a second membership group for him. 
His new position may become a marginal 
one, accompanied by feelings of chagrin oF 
regret as he loses ties in one group couple 
with the fearful anticipation of entering an- 
other. The term *anomie" (Durkheim, 1897; 
Srole, 1951; Williams, 1951, pp. 479 ff.) 
refers to the feelings of social uncertainty of 
dissatisfaction which characterize not only 
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the socially unattached person but also, it 
. appears, the bilingual or even the serious 
student of a second language and culture. 
Another set of psychological questions 
arises when one realizes that learning a 
P second language brings a new verbal system 
into contact with one which is (usually) 
"already well established, setting the stage 
— for interaction and potential interference. We 
have examined the consequences of this 
interaction among bilinguals (Jakobovits & 
Lambert, 1961; Lambert, Havelka, & Crosby, 
1958) and have studied means of reducing 
interlingual interference (Lambert & Witel- 
son, 1961; Wimer & Lambert, 1959). The 
little evidence at hand suggests that if the 
A Y 
| uM languages are learned and used in 
"culturally distinctive contexts, rather than 
"within a single cultural or linguistic setting, 
_ they function relatively independently, and 
"the connotative meanings of translated- 
| equivalent words in the two languages are 


comparatively distinctive—two factors which 
Teduce interlingual interference. When two 
"artificial languages are learned simultaneously 
in a common laboratory context (Lambert 
—& Witelson, 1961), translated equivalents do 
L become semantically close and yet inter- 
P lingual interference does not play an im- 
“Portant inhibitory role in two-language 
learning apparently because subjects pay 
Close attention to cues which distinguish 
Words in one language from words in the 
other. The Wimer and Lambert study in- 
“dicates that the “direct” method of learning 
4 second language (where foreign words are 
 *sociated directly with their referents rather 
3 than, as in the "indirect" method, by being 
Paired with their word equivalents in the 
Native language) is more effective because 
actual referents to which the new words are 
i “sociated are more distinctive than are other 
Words. It is this distinctiveness characteristic 
Of the direct method which apparently re- 
duces interlingual interference and promotes 
"ster learning of a new vocabulary. How- 
th, the generalizability of these findings 
| Bur needs to be investigated using 
E language learning situations. Young 
adults learning a second language may 
develop Other means of dealing with the 
Potential interference of their two languages- 
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The present study was designed to examine 
more extensively both the attitudinal and 
cognitive correlates of second-language learn- 
ing among students attending the McGill 
French Summer School where, for 6 weeks, 
students are required to follow full-day 
course work in French (a second language 
for all attending) and to use only French 
at all other times during the period, i.e., to 
follow a direct method, par excellence. This 
concentrated experience appeared to us par- 
ticularly likely to highlight the attitudinal 
and motivational features of language learn- 
ing as well as the phenomenon of interaction 
of two language systems. Our notions about 
second-language learning are based on studies 
conducted with Canadian students in a multi- 
lingual community where bilingual skill is 
so common that students can sense its po- 
tential value for them. The present study 
permits us to test our theoretical ideas with 
American students, since the large majority 
of those attending the Summer School are 
from the United States. It also permits an 
examination of the reactions of older students 
than those studied in the Montreal school 
systems. Most of the Summer School students 
are at the college or university level and 
many are specializing in French at the 
undergraduate or postgraduate level. 

Although this is viewed as a pilot study 
which calls for methods of analysis, such as 
factor analysis, which will tease our trends 
and patterns, we also predicted certain out- 
comes, as is apparent from the introduction 
and the choice of procedures. 

1. Favorable attitudes toward the other 
linguistic-cultural group, including a non- 
ethnocentric outlook, should promote greater 
achievement in language learning. 

2. An integrative orientation toward lan- 
guage learning should be more effective than 
an instrumental orientation for language 
learning. 

3. Those at a relatively elementary level of 
training should contrast with those seriously 
specializing in French study. The more ad- 
vanced group should be more involved with 
feelings of anomie. 

4. In a concentrated period of study such 
as that afforded by the Summer School, all 
students should show evidence by the end 
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of the course of a conflict in making bi- 
cultural identifications. 

5. The direct learning context of the Sum- 
mer School’s course should engender inter- 
lingual semantic distinctiveness and thereby 
reduce bilingual interference. 


METHOD 
Subjects 


The subjects were 192 students attending the 
McGill French Summer School for the years 1959 
and 1960? the large majority of whom were 
from the United States. The directors of the 
Summer School take care to place each student in 
courses designed for his level of attainment in 
French by interviewing each before registration 
and suggesting which of three levels would be 
most appropriate for him. Because the different 
sections are kept fairly well homogeneous and 
since the levels represent differences in attainment 
(particularly the introductory group compared to 
the other two), we decided to maintain the school’s 
sectional groupings of students and consider the 
resultant differences in the group’s patterns of 
behavior as due to differences in level of skill. 

Since French is theoretically the only language 
used at the school, students are required to have 
a working knowledge of the spoken language even 
at the introductory level. Applicants for the In- 
troductory section should have completed one 
semester of French at the college level following 
at least 2 years of French in high school. Inter- 
mediate level students normally have completed 
2 or 3 years of college or university training with 
French as a major or continuation subject. They 
are expected to speak with some fluency, to write 
correct grammatical French, and to read French 
at a fair speed. The Advanced level students have 
typically graduated (or are about to) with French 
as their major subject and with teaching or 
graduate work in French as an aim or actual 
profession. It was decided to work with two 
groups for our purposes, calling the Introductory 
level Elementary and combining the Intermediate 
and Advanced to make one comparably large group 
and calling them Advanced. Comparisons of Elemen- 
tary and Advanced students will be made through- 
out this report. 


Materials 


Students were asked to voluntarily participate in 
the present study by the director of the school 
who mentioned his interest in, and the anticipated 
value of, the project. The subjects were given 
questionnaires to complete in group settings at 


SA preliminary study was carried out in the 
Summer of 1958. On the basis of the analysis of 
the resuli of the 1958 study, the procedures and 
content of certain questionnaires were modified to 
the form presented here. E 
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the beginning and at the end of the 6-week course; 
Only those students who completed both question- 
naires are included in the analysis. 

Scores on the following tests were obtained: 

California F Scale. Ten items were taken from 
Forms 45 and 40 (Adorno, Frenkel-Brunswik, 
Levinson, & Sanford, 1950). This scale is designed 
to measure antidemocratic and ethnocentric prej- 
udice of an authoritarian nature which we felt 
would reflect a dimension of suspicion-acceptance 
of foreign peoples in general. 

Anomie Scale. This scale was designed to measure 
feelings of normlessness, social dissatisfaction or 
dysfunction, or group alienation. It is made up of 
nine items, five of which come from Srole’s (1951) 
Anomie Scale and four of which were developed 
specifically for this research. Sample items are: 
“These days a person doesn’t know whom he 
can count on” (from Srole) and “Having lived 
my life in this culture, I’d be happier moving 
to some other country now” (our own item). 

Francophilia Scale. Eight statements about 
French people and the French culture were presented 
and subjects were asked to indicate their degree of 
disagreement or agreement on a seven-point scale. 
A sample item is: “Compared to the French, the 
North Americans are an unimaginative people.” 
This test is a direct measure of one’s positive or 


negative attitudes toward French people and 
customs compared to one’s own people and 
customs. 


Orientation Index. Subjects were presented with 
eight alternative reasons “typically given” for 
studying French, and asked to indicate the one 
(or more) which best describes their reasons for 
studying the language. These alternatives were: 


1. Usefulness in work situations. k 

2. Enables one to gain friends more easily 
among French-speaking people. 

3. Because no one is really educated unless he 
is fluent in the French language. 

4. To better understand the French way of 
life. 

5. One needs a good knowledge of at Teast 
one foreign language to merit social recognition. 

6. Because I hope to live in France some dày. 

7.1 need the training in French for college 
credits. 

8. To enable me to begin to think and behave 
as the French do. 

9. “Any other personal reason.” 


Alternative 9 was supplied in case an individual | 
thought the alternatives given were inadequate. Those 
choosing Alternatives 1, 3, 5, or 7 were classified 45 
"instrumentally" oriented in their approach t0 
language learning. Those choosing Alternatives 2, ^ 
6, or 8 were classified as “integratively” orient 
Responses to Alternative 9 were classified by tw? 
judges as instrumental or integrative if both Judges 
agreed. " 
Distinctiveness of English and French meanings, 
at the start of the course (Ex Fi). Seven English 
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words and their translated equivalents in French 
were presented to subjects in the form of concepts 
to be rated on standard seven-point semantic dif- 
ferential scales (Osgood, Suci, & Tannenbaum, 1957). 
The concepts were: friend, literature, truth, 
church, elegance, family, and me, and their French 
translated equivalents. The eight semantic scales 
were in English for English concepts and in French 
for French concepts. The semantic differential was 
developed to measure the connotative meaning of 
concepts. The technique permits one to compare 
the degree of similarity of connotative meaning of 
any two concepts by utilizing the “general distance 
formula” (D) presented in Osgood et al. In the 
present case the similarity of semantic profiles of 
each pair of concepts rated at the start of the 
course (eg, friend and ami), calculated on eight 
different semantic scales, was determined for each 
subject and then an average similarity of meaning 
was calculated for all seven pairs of concepts. 
Distinctiveness of English and French meanings, 
at the end of the course (Es Fs). This score refers 
to the separation of French and English meanings 
after the 6-week course of study in French. 
Relative change in French meanings during the 
course of study when compared to English mean- 
ings (Fi Fı— E, Ex). In this case the difference 
in French semantic patterns at the start and end 
of the course are compared with the changes in 
the English semantic patterns for the same concepts. 
Achievement in French. Only those students who 
Tegistered for at least the average number of 
Courses in the school and took examinations in 
each were retained in the samples. Our measure of 
achievement was the average grade each student 
Was assigned in his four or five courses. The 
typical set of courses for all students would in- 
Volve training in speaking, reading, and writing 
as well as literature or French civilization courses. 
Because each instructor undoubtedly used quite 
different criteria for grading, all grades were trans- 
formed to z scores and averaged to determine each 
Subject’s final achievement score. Unfortunately 
there Was not enough time available to us to 
obtain measures of achievement based on standard- 
Wed examinations of French skills. We are aware 
that the final average includes various literary and 
Academic skills other than purely linguistic ones. 
M pidition to these eight variables, several other 
Bh ination or difference scores derived from 
"m se eight variables will be examined. For example, 
s ne point we will discuss changes in attitudinal 
eue Írom the start to the end of the course, 
the Is Changes in Francophilia or in anomie, and 
to. th anges in English meanings from the start 
the end of the course (E, Es) or in French 
1 Fs) meanings will be discussed. 
d md questions were asked of those students at- 
á ES the 1960 Summer School. These dealt with 
que €nt to which the student spoke only French 
ns the classroom and the extent to which 
In Was made from English before speaking 
(as opposed to thinking exclusively in 
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French), The analysis of the responses to these 
questions will be discussed separately below. 

Finally, two control groups of subjects were used 
to determine the test-retest stability of certain 
attitude and meaning measures. These are introduced 
and described at appropriate points in the dis- 
cussion section. 


RESULTS AND DISCUSSION 
Pattern Analyses 


Attention is directed first to analyses of 
patterns of interrelations among the variables 
measured. The intercorrelations of the eight 
main measures for both the elementary and 
the advanced groups are presented in Table 
1.* All coefficients listed are product-moment 
correlations except for those involving the 
Orientation Index. This index is scored 
dichotomously for integrative or instrumental 
orientations; point-biserial correlations were 
computed between this and all other vari- 
ables. The correlations are positive if 
integratively oriented subjects obtained a 
higher mean score on the continuous vari- 
ables than did those who were instrumentally 


4The correlations given are small in magnitude, 
but two comments should be made about them. 
First, the same correlational trends as presented 
here were found in both the 1959 and 1960 samples 
and thus they appear reliable. Furthermore in the 
1958 pilot study with French Summer School 
students, we were able to administer an objective 
French achievement test (Cooperative French Test) 
at the start of the course. The correlation between 
a preliminary form of the Orientation Index and 
the achievement test scores was significant for the 
elementary group (r=.44, p< 01), but not for 
the advanced group (r=.03). In the 1958 study, 
attitudes toward Europeans were measured instead 
of attitudes toward the French and, as is the case 
here, these attitudes were significantly related to 
anomie for the advanced group (r= 33, p< 01), 
but not for the elementary group (r= .l6, ns). 
However, the relation of orientation to actual 
grades was much lower and not significant (r = .18 
for the elementary and .14 for the advanced groups). 
The correlation of French grades with French 
achievement test scores was .70 and .59 for the 
elementary and advanced groups, respectively. Thus 
some variables other than French skills are involved 
in the grades assigned. We feel therefore that the 
relations presented here are reliable and their small- 
ness is attributable in part to the poor estimate of 
pure language skills derived from teachers’ grades 

le. 

ae multiple correlation of the Orient&ion and 
Francophilia scales with grades in the present case 
is 33,,significant at the .01 level. 
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TABLE 1 
CoRRELATION MATRICES 
i Test 1 2 3 4 5 6 7 8 
Elementary group* 
1. F Scale 44 31 —25 19 02 —.09 —.05 
2. Anomie AS —.21 .00 .05 —.20 —.09 
3. Francophilia .08 .09 .02 26 23 
4, Orientation 13 —.05 44 25 
5. EF vyl 07 —.12 
6. E:F: —.12 —.16 
T. FiF; — EE; 48 
E Genieyement 
vanc u] 
1. F Scale a 40 58 —.08 03 04 29 —.04 
2. Anomie 40 A3 05 04 .09 —4A1 
3. Francophilia. 10 .06 Al 47 —21 
4. Orientation 08 —.06 01 —.09 
5. EF; 45 36 —.19 
6. E:F: .05 —.23 
7. FiF; — EE: —41 
8. Achievement 


^ Total N varied around 89, slightly less for cases where the data were incomplete. Correlations of .21 are significant at the .05 


level; :25 at the .02 level, and .27 at the .01 level. 


Total N varied around 103. Correlations of .20 are significant at the .05 level, .24 at the .02 level, and .26 at the .01 level. 


oriented. The tables of intercorrelations were 
subjected to factor analyses using a centroid 
solution. The rotated factor matrices are 
displayed in Table 2. The factor analysis 
reflects attitudinal differences between the 
elementary and advanced groups as well as 
group. similarities in cognitive reactions to 
the Summer School experience. For the 
elementary students, one pattern of attitudes 
appears to be important for achievement in 


French: students with favorable attitudes 
TABLE 2 
ROTATED Factor MATRICES 
Test I Ir III h 
Elementary grouj 
1. F Scale š 00| .68| .22| .51 
2. Anomie elds 62. 341240 
3. Francophilia 56| 34| 15] .45 
4. Orientation 38 | —.32 | .03| .25 
5. Ex .06 | —.10 | .63| 41 
6. EF; —.17 | —.04 | 42| 20 
7. FiF; — E;E; 45 | —.17 | —.04 | .23 
8. Achievement 49 .00 | —22| 29 
Advanced group 
1. F Scale AS 76 | —44| .61 
2. Anomie S3| 34] 0| 41 
3. Francophilia 46] .63| .00| .60 
4. Orientation -27 | —.06 | .12| .09 
5. EF, =11 19 40] .53 
6. E:F: 02] .06| 59] .36 
7. FiF; — EE: —45| 46| 23| 29 
8. Achievement —42|—.14| —36| .16 
Note.—Orthogonal rotations. Š 


toward France, a willingness to identify with 
French people, and a capacity to modify: 
meanings of French concepts are more likely 
to do well in the French course. This pat- 
tern is very similar to the “social motiva- 
tional” factor found by Gardner and Lambert 
(1959) for English-Canadian high school 
students who were questioned about their 
attitudes toward French-Canadian people and 
culture. The fact that a group of American 
students displays the same pattern of at- 
titudes toward a quite foreign cultural group, 
one which is related to achievement in the 
other group’s language, is of particular in- 
terest. A second attitudinal pattern is not 
associated with achievement: an authoritarian 
disposition, feelings of anomie coupled with 
both favorable attitudes toward France and 
instrumental orientations toward the study 
of French. The measures of meaning indicate 
that elementary students do poorly in the 
course if they maintain semantic distinctive- 
ness between English and French translated- 
equivalent concepts. ý 
The factors for the advanced students mM- 
dicate that the attitudinal variables show 10 
relation to French achievement. The orienta- 
tions toward the study of French are not 
related to attitudes as was the case for the 
elementary students. Two factors for the 
advanced group indicate that favorable at- 
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titudes toward France are associated with 
anomie but not with integrative orientations 
toward learning the language. In fact, 
Francophile attitudes are positively related 
to anomie for the advanced group but not 
for the elementary. The patterning of at- 
titudes suggests that favorable attitudes to- 
ward the other culture which are linked to 
anomie bear no relation to achievement in 
the language. As was the case with the 
elementary group, maintenance of separate 
or distinctive French and English meaning 
patterns also hampers achievement in the 
Summer School course for the advanced 
students. 


Attitudinal and Meaning Changes during the 
Course of Study 


The data on attitude and meaning changes 
from the beginning to the end of the course 
reveal mainly similarities between the elemen- 
tary and advanced groups (Tables 3 and 4). 
The two groups manifest the same degrees 
of authoritarianism, anomie, attitudes toward 
France, and orientations toward the study of 
French at the start of the course. On the last 
Point, approximately 25% of both groups 
give integrative reasons, 60% give instru- 


_ Mental reasons, and 15% cannot be classified 
_ unambiguously. Both groups become clearly 


Mote anomic (actually, less nonanomic) dur- 
ing the course and the elementary group 
also becomes reliably more authoritarian. 
Neither group changes its attitudes toward 
France. The advanced subjects become some- 
What less instrumental in their orientations 
to language study during the course (29% 
ate integrative, 46% instrumental, and 25% 
Cannot be classified). These findings strongly 
Support the notion that students in a con- 
Centrated course of foreign language study 
T become psychologically marginal be- 
Ween two linguistic-cultural groups. 
Correlations of the attitude changes during 
J* course with achievement ratings are 
Siven in Table 5. (Because the Orientation 
a is a dichotomous variable, difference 
ores could not be assigned.) The only 
nge in attitudes related to achievement is 
Increase in pro-French attitudes among 
advanced students, which is positively 
*d to achievement. Since the increases 


the 
Telat 
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TABLE 3 
COMPARISONS OF ELEMENTARY AND 
ADVANCED GROUPS 

Elementary | Advanced 
Variable MT ea oe oa T 

M | Vari- Vari- 

ance ance 
F Scale 190 |36.74 |94.30 |35.95 |85.79 | .57 
Anomie 190 |23.16 |59.93 |22.17 |49.03| .93 
Francophilia | 184 |32.04 |42.77 |32.04|58.06 | .00 
Fi 183 | 2.75| 1.31| 2.32| 0.96 | 2.74* 
Fo 183 | 2.19| 0.91 | 2.00| 0.50 | 1.54 
EE; 183 | 3.35| 1.28| 2.84 | 0.76 | 3.45* 
FF; 183 | 3.48| 1.16| 3.06| 1.03 | 2.73* 
F,F;— E;E, | 183 | 0.13} 0.43| 0.22| 0.38 || .93 


Note.—Scores of 40, 36, and 32 indicate neutrality of atti- 
tudes on the F, Anomie, and Francophilia , respectively. 
The higher the respective scores, the more the attitudes are 
SURREY anomic, and favorable toward French people and 
culture. 

The semantic differences between XAR and translated 
French equivalents (E1F1) are given in the form of group means. 
In the first case, the elementary group shows a reliably greater 
semantic separation between English and French words than 
does the advanced group, 2.75 units compared to 2.32. The 
Jorge the difference score, the greater the differences in conno- 
tative meanings. These units are D scores based on differences 
of semantic ratings on eight scales for all of the seven French- 
English pairs. In the case of English (or French) words rated 
at the start and the end of the course (EiEs or FiF:) the D 
scores are based on the comparisons of semantic ratings of the 
same seven words made at the different times. 

PESUM 


in anomie and authoritarianism are not as- 
sociated with doing poorly in French, one 
cannot argue that such changes are a product 
of frustration generated by poor achievement 
in the course. Instead, the findings suggest 
that concentrated immersion in the study of 
a second language possibly evokes invidious 
comparisons between the other culture and 
one's own social group, thereby increasing 
feelings of anomie. (It should be noted that 
the mean anomie scores are below the neutral 
point of the scale, and consequently our 
statements about increases in anomie actually 
reflect a reduction in social satisfaction—the 
opposite of anomie.) 

There are reliable increases in the simi- 
larities of English and French meanings dur- 
ing the course of study for both elementary 
and advanced groups. This rapprochement 
of meaning systems of the two languages is 
of interest in view of the fact that all in- 
struction and a large part of outside-class 
conversations were carried out in French. It 
appears that students at this age and level 
of training in French do make use cf their 
first language and permit it to interact with 
the French that is being learned. We have 
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TABLE 4 
ATTITUDINAL AND MEANING CHANGES DURING THE COURSE OF STUDY 
Variable df M (Before) Variance M (After) Variance i 
E Ende E 88 36.74 94.30 38.31 87.70 —2.20* 
Anomie 88 23.16 59.93 24.88 65.21 2:51 5h 
Francophilia 85 32.04 42.77 32.63 47.86 — .96 
1Fı 5 2.75 1.31 Horns 
EE 2.19 0.91 C 
FiF» versus E1E2 84 er) 1.16 S 1.28 1.83 
j o 
ie 348 3.35 
Advanced 
F Scale EE 102 35.95 85.79 36.95 71.23 —1.74 
Anomie 102 2247 49.03 24.21 54.69 —321"* 
Francophilia 97 32.04 58.06 31.55 70.66 .86 
EF; 2.32 0.96 
versus 99 3,549 
E:F: 2.00 0.50 M 
F,F2 versus E;E; 99 1.03 0.76 3.55 
M (5E) (E;E;) 
3.06 2.84 
*5 s .03. 
ZEE 


already commented on the fact that the more 
the two languages do interact the better are 
the achievement ratings in French. There 
is further evidence of language interaction 
in the correlations between changes in English 
meanings (E; Es) and changes in French 
meanings (Fi F2) during the study period: 
the correlations of E; Es with F; Fs are 
.82 and .80 for the elementary and advanced 
groups, both highly significant. 

A control group of 32 senior students at 
McGill who were not taking French instruc- 
tion was administered the same English and 
French concepts as used here to be rated on 
the same semantic scales twice during a 


TABLE 5 


CORRELATIONS OF ATTITUDE CHANGES (during course) 
WITH ACHIEVEMENT 


Elementary | Advanced 
groups group 
1]2/3] 4 |tlo|3}4 
1. F Scale (After-Before) | |.37|.34| 12| |.26|.22|.06 
2. Anomie (after Bers) -27) .05 -25|.06 
3. Francophilia (After-Before) —:14 20 


4. Achievement 


^ Total N varied around 89, slightly less for cases where the 
data were incomplete. Correlations of .21 are significant at the 
-05 level, .25 at the .02 level, and .27 at the .01 level. 

b Total N varied around 103. Correlations of .20 are sig- 
TRO at the .05 level, .24 at the .02 level, and .26 at the .01 
level, " 


6-week period. Since nearly all these students 
were from Montreal, they would have had 
about 6 or 7 years of high school training 
in French and possibly a year at the uni- 
versity level. When one examines the separa- 
tion between meanings for English and 
French concepts for the control group there 
is no rapprochement of French and English 
meanings from one testing period to the 
other; the ¢ test for the difference between 
E; F, and E, Fə is —.48 which is clearly 
not significant. 

For the advanced group there are reliably 
greater changes in French than in English 
meanings (F; Fs E, Es) but this pattern 
is not quite significant for the elementary 
group. Apparently the summer school expe! 
ence does effect greater changes in French 
than English meanings, particularly for the 
more advanced students. 

One reason for the less pronounced change 
in French relative to English meaning changes 
for the elementary subjects is the fact that 
they show more change in both their English 
(E; E2) and French meanings (Fi F2) than 
do the advanced students. (These findings 
indicate that relative to their English meaning 
changes, the elementary students show le 
French meaning changes than the advanced 
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students.) Thus elementary students manifest 
a general semantic malleability or instability 
in comparison to the advanced group. 

The elementary students show more dis- 
tinctiveness or separation of English and 
French meanings at the start of the summer 
course than the advanced, but this difference 
does not appear at the end of the course, 
suggesting that as one progresses from the 
elementary to the advanced level of skill he 
reduces the degree of linguistic separation 
between his two languages. 


Personal Estimates of Concentration on 
French during the Course 


Students of the 1960 session were asked 
how often they had spoken in English out- 
side of class and how much they had thought 
in French when speaking French. The elemen- 
tary students used English more frequently 
than did the advanced students (£= 2.98, 
?«.01). However, there is no reliable dif- 
ference between groups in the extent of think- 
ing in French when speaking French (¢ = .93, 
ns). Apparently, the summer course has a 
common effect on students at both levels 
with respect to their own evaluations of the 
amount of thinking done in French. This 
variable was measured on a five-point scale 
ranging from “always thought in English 
before speaking French” (rated as 1) to 
‘always thought in French before speaking 
French” (rated as 5). Both groups fall about 
at Point 3 which was defined as “thought 
Primarily in English at the start of the course 
but more in French at the end of the course.” 
The mean scores were 2.95 for the advanced 
Students and 2.71 for the elementary. These 
Tesults are quite consistent with the previ- 
ously mentioned finding that students use 
both their languages for cognitive activities. 

The correlates of speaking English outside 
of class are presented in Table 6. For the 
elementary group, there is a marked relation 
between increases in feelings of anomie dur- 
Mg the course period and the use of English 
Ritside of class. A similar finding is noted 
or the advanced group: those who were 
more anomic, especially at the end of the 
Course, are prone to use English. These find- 
ie: Sive strong support to the hypothesis 

at students revert to English (even though 
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TABLE 6 


CORRELATES OF Use oF ENGLISH AND THINKING IN 
FRENCH FOR ELEMENTARY AND ADVANCED GROUPS" 


Elementary | Advanced 
group group 
Variable Degree | Degree | Degree | Degree 
ofuse| of |ofuse| of 

of think- of think- 

En ing in | Eng. ing in 

lish |French| lish | French 
F Scale (Before) 04 —.04 | —.02 09 
F Scale (After) zi LS eS | eure 
Changes in F Scale 33 32 3t —.04 
Anomie (Before) =29 —33 28 .05 
Anomie (After) 17 404 37 =t 
Changes in Anomie 54 40 43 -47 
Francophilia (Before) —.05 —43 46 42 
Francophilia (After) 43 —.09 43 5: 
Changes in Francophilia. 21 |-—0 |-95| .07 
Orientation Index (Before) | —.09 | —.09 409 | —41 
Orientation Index (After) —.06 | —25 44 | —42 
Ei -27 -07 | —.23 09 
E:F: -23 —40 | —10 | —.04 
FiF: — EiE: -—22 —44 .06 31 
Achievement —.33 35 07 | —.02 
Degree of using English 425 —40 


^ These data are for the 1960 Summer School samples only. 
The N for the elementary group varies around 32 and for the 
advanced group 46. For the elementary group, an r of .36 
is significant at the .05 level, For the advanced group, an r of 
.31 is significant at the .05 level. 


they thereby break their pledges to speak 
only French) because of a conflict based 
on anomie. One cannot be certain of the 
causal nature of this relation with these data. 
However, it is logically clearer to suggest 
that reverting to English follows from an 
uncomfortable feeling of anomie rather than 
the converse. 

The two groups differ when consideration 
is given to the correlates of thinking in 
French. For the elementary group, thinking 
in French is also associated with feelings of 
anomie, but because there is an insignificant 
correlation between using English and think- 
ing in French, the pattern suggests that 
elementary subjects become anomic as they 
commence to think in French and then use 
English more frequently when anomie reaches 
a certain strength. Furthermore, there is a 
trend for those elementary subjects who use 
English to do poorly in the course and for 
those who think more frequently in French 
to do well in the course. 

For the advanced subjects, neither using 
English nor thinking in French is related to 
achievement in French. Thinking in French 
does not appear to involve the advanced 
students in a conflict based on feelings of 
anomje as did the use of English. Rather, 
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those advanced subjects who think most 
regularly in French are those who are clearly 
instrumentally oriented toward learning 
French (r= — .41 between the Orientation 
Index on the first testing and thinking in 
French, and r = — .42 for the second test 
results of the Orientation Index). 

The first interpretive task is to account 
for the differences in attitudes of elementary 
and advanced groups. The advanced students 
are older, more advanced in the study of 
French, and are concentrating in French at 
college or university levels of training or 
actually teaching French at high school or 
college levels. A careful examination of the 
responses to the Orientation Index indicates 
that the advanced subjects are more profes- 
sionally oriented in their approach to French 
study. Table 7 presents a more complete 
analysis of the Orientation Index scores for 
both groups. The advanced students ap- 
parently want to learn about the French way 
of life more so than do the elementary but 
not because they are more eager to live in 
France, to study or visit in France, to gain 
French friends, or to communicate with the 
language, but because they clearly are more 
interested in teaching or improving their 
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teaching of the language, a response which 
would be categorized as “instrumental.” This 
more professional orientation indicates that 
the advanced students have a definite and 
immediately accessible goal which may likely 
maintain a strong degree of motivation in 
language study. Our previous studies in- 
dicated that favorable attitudes and orienta- 
tions toward the other language contributed 
to a strong motivation to learn the other 
group's language and consequently correlated 
with achievement measures. In the present 
case, even the instrumentally oriented ad- 
vanced students have strong motivations to 
learn the language, thereby reducing the 
relation of Orientation Index scores to 
achievement. 

The data also indicate that the advanced 
students do not exhibit the particular syn- 
drome of attitudes which has been found 
to be related to achievement in language 
training. That is, their Francophilia is ac- 
companied by anomie and authoritarianism 
to a much more marked degree than is the 
case with the elementary students. This 
finding could mean that the advanced stu- 
dents are unable to exhibit a patterning of 
attitudes which will relate to achievement 


TABLE 7 
GROUP COMPARISONS OF ORIENTATIONS TOWARD LEARNING FRENCH 


Category 


Elementary students Advanced students 


Before After 


Useful in work situations 

‘To gain friends more easily among French people 
No one really educated without French skill 

To better know French way of life 

Need French to merit social recognition 

T hope to live in France some day 

I need it for college credits 

To begin to think and behave as French people do 
Personal reasons* 


Hmnoumgjou» 


Love, enjoy, fascinated by (beauty of) the French language 


General usefulness 


General knowledge about literature, theatre, history of France 


Desire to use (communicate with) the language 
To study in or visit France : ia 
Researt purposes or to major in French 
To teach (or improve teaching of) the language 
Personal satisfaction with learning languages 
Ag: awer or multiple response 

oti 


23 21 18 12 
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? The “personal reasons" responses were categorized as instrumental or integrative in terms of other responses which were given 


along with these responses if no clear placement was indicated in the personal reason itself, When no certain placement cou 


made the response was called "uncertain." 


ACQUISITION OF A SECOND LANGUAGE 


because of the strong relation of anomie and 
Francophilia. Several lines of evidence sup- 
port the contention that both groups experi- 
ence a conflict based on anomie. Both groups 
increase their anomie during the course, and 
for both groups the extent of use of English 


outside of class is related to anomie. The 


nature of the conflict may well differ for the 
two groups. For elementary students, the use 
of English and the tendency to think in 


French, although unrelated themselves, are 


both related to anomie. This is not the case 
for the advanced students for whom think- 
ing in French is not related to anomie but 
rather to instrumental orientations toward 
the learning of the language. It is only with 
the advanced students that favorable at- 
titudes toward France correlate highly with 
anomie. One can interpret these findings to 
mean that as a student progresses in special- 
ization of French, as an avocation in a non- 
French culture, he develops invidious com- 


- parisons of American and French cultures 


such that the more appreciation he develops 
for the French language and culture the 
More socially dissatisfied he is, or conversely, 
the more anomic he becomes, the more he 
appreciates the French culture. The elemen- 
tary student is not committed to the study of 
French as a professional specialization and 


aN appreciate a new culture without the 


involvement of anomic feelings. It is clear 
that he, too, will encounter a conflict center- 


Mg around feelings of anomie, however, as 


- Sh 


he concentrates. in the study of the new 
language and culture. The evidence is too 
Sparse to extend the discussion to strategies 
employed to manage anomie, but the overall 
findings point to the value of further research 
On this matter, 
€ were interested in checking on the 
test-retest stability of the attitude scales 
used, even though published studies, such as 
mith’s (1955), have demonstrated that these 
Or related scales are reliable. We administered 
* F Scale, anomie, and Francophilia items 
2 à group of 30 McGill students 5 weeks 
efore and at the end of the school year, to 
“Pptoximate the “end of school” feature of 
© second test for the French Summer 
“tool students. The respective first and 


E 
- "Ond test means for the control group are: 
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authoritarianism, 41.41 and 39.66; anomie, 
28.21 and 27.58; and Francophilia, 28.90 
and 30.20; mone of these changes is 
significant. 

However, the control students are highly 
significantly more anomic, significance levels 
being .01 and .001 when compared with 
elementary and advanced groups, respec- 
tively, at the start of the course (cf. Table 
4). There are no reliable differences between 
control and either of the student groups at 
the end of the course. These findings suggest 
that serious students of a second language 
as a group are less socially dissatisfied at the 
start of the course than control students who 
are not specializing in language training. 
The present findings indicate that the Sum- 
mer School population of students, on the 
average, apparently find dual cultural mem- 
bership satisfying, as though their experience 
with a second cultural-linguistic group has 
been enriching for them. The conflict around 
feelings of anomie which has been mentioned 
actually involves attitudinal changes from 
relatively nonanomic towards more anomic 
feelings during the course of the Summer 
School. 

The second major trend is the rapproche- 
ment of meanings of concepts in the students' 
two languages during the course of study. 
It is assumed that the large majority of 
both the elementary and advanced students 
have learned their French mainly in an Amer- 
ican setting and that French concepts have 
been learned through the use of English 
concepts, ie. that they enter the Summer 
School with compound French and English 
linguistic systems. Furthermore, it would be 
more difficult to develop a coordination of 
French concepts in such a short period as 
the Summer School than to make efficient 
use of a well-established compound system. 
The Montreal setting would not likely be 
distinctively French enough in any case to 
facilitate the development of a separate lan- 
guage learning context for French especially 
since most of the students' time is spent 
with other American students. 

The findings indicate that both groups of 
students manifest an increase in scmantic 
similarity of French and English concepts 
during the summer course. Furthermore, 
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maintenance of semantic distinctiveness of 
French and English concepts apparently 
leads to poor achievement in the course. In 
addition, the more that semantic change oc- 
curs in one language, the more change is 
also noted in the other language. Modifica- 
tions occur in both French and English con- 
cepts, indicating that training in French has 
affected both languages. These findings are 
of interest because they suggest that these 
American students, taught by the direct 
method, do not keep their two languages 
functionally separated. Rather. they tend to 
permit the two languages to interact, and 
this is to their advantage in the learning of 
the second language. It is hypothesized that 
these students develop generalized super- 
linguistic concepts which incorporate the 
unique semantic features of the concepts in 
each of their two languages. Rather than 
encountering difficulty because of the inter- 
action of their languages, these students are 
better able to acquire French by making use 
of interacting or compound linguistic systems. 
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The vastly different type of life chosen by novice nuns and college females 
suggested personality differences which were thought likely to be reflected in 
response style. Accordingly, 40 Ss in each group were asked to rate each of 
the Rorschach inkblots on 21 semantic differential scales. It was found that 
the nuns generally rated the inkblots toward the more positive extreme of 
the evaluation scales and at times toward the more potent extreme of the 
potency scales. The generality of this tendency was assessed by comparing the 
social desirability (SD) ratings of 28 nuns to those of 28 college females. It 
was again found that the nuns tended toward more extreme ratings. The 
introduction of the SD dimension resulted not only in more positively extreme 
ratings on adjectives ordinarily rated positively, but also more negatively 
extreme ratings on adjectives ordinarily rated as negative. Nuns were more 
variable than college females in rating nearly neutral adjectives. It was 
concluded that these response tendency differences might be used as a basis 


for later prediction studies. 


During the early years of its application 
| 8S a projective device many users of the 
Rorschach test speculated on the stimulus 
value of the standard blots and Rorschach 
(1952) himself discussed the qualities of each 
| Plate. Although this seemed an important 
aspect of the test because of its bearing on 
Interpretation, relatively little research was 
done to establish empirical bases for the many 
intuitive impressions which were rife. A few 
Studies were done primarily in an effort to 
establish the existence of a “mother” and 
father” card (Levy, 1958; Meer & Singer, 
1950; Rosen, 1951) and others investigated 
the sexual significance of the blots (Charny, 
959; Pascal, Reusch, Devine, & Suttell, 
1950; Shaw, 1948), As Baughman (1958) 
indicated in his thorough review of the role 
9f the stimulus in Rorschach responding this 
Paucity of empirical studies of symbolic 


CM authors acknowledge with gratitude the 
cce of Carol Braun and Sue Vollenweider 
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nM Cooperation of the superiors of the Rochester 

a e of Mercy made it possible for sister-students 

| atherine McAuley College to act as subjects 


Du t 
pa this study; the authors are grateful for this 
emission, 


meaning in the inkblots probably was the 
result of methodological obstacles. 

In recent years some of these obstacles seem 
to have been overcome by the introduction by 
Osgood, Suci, and Tannenbaum (1957) of a 
technique for assessing word meaning which 
they recommended as being of use with pro- 
jective materials. Several applications of the 
Semantic Differential technique to the study 
of the connotative meaning of the Rorschach 
materials have followed in the wake of their 
work (Little, 1959; Rabin, 1959; Rosen, 
1960; Zax & Benham, 1961; Zax & Loiselle, 
1960a, 1960b; Zax, Loiselle, & Karras, 1960). 
These studies have been directed largely to 
an understanding of the impression conveyed 
by the blots to various types of subjects with 
the prime focus being the stimulus materials 
themselves. 

In the course of a few of these Rorschach 
studies (Zax & Benham, 1961; Zax et al., 
1960) and in the work of Osgood et al. 
(1957) it has become apparent, as is the case 
in most human activities, that responses on 
semantic differential scales are a function of 
the personality of the responder as well as 
the objective qualities of the stimulus. Both 
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Osgood et al. and Zax et al., for example, 
have noted the response tendency among 
schizophrenics to make a larger number of 
neutral ratings (Point 4 on a 7-point 
scale) than do control subjects. These find- 
ings suggest that semantic differential scales 
may have potential as an objective tool for 
studying personality. 

Recently Couch and Keniston (1960) re- 
ported an impressive series of studies aimed 
at characterizing the personality of subjects 
predisposed to responding positively or nega- 
tively on Likert-type scales regardless of the 
content rated. In their investigation various 
personality indices of subjects manifesting an 
already established rseponse set were explored. 
Another approach to the problem of relating 
response set to personality factors could in- 
volve the comparison of groups equated in 
all important respects but one which would 
suggest some important personality differ- 
ences. A study of the stimulus value of 
Rorschach inkblots for a group of novices 
who have elected to live a cloistered life 
dedicated to religious work provided an 
opportunity for such a comparison. Since this 
group was well matched for age and education 
with a group of female college students 
already studied (Zax & Loiselle, 1960b) it 
was possible to compare ratings of the two 
groups to explore potential differences in re- 
sponse style. If personality differences are 
reflected in response styles, two groups so 
seemingly disparate behaviorally can be 
expected to manifest response dissimilarities, 
The paucity of empirical data descriptive of 
the personality of those choosing to renounce 
many of society’s values to enter a convent 
stands as a drawback to the formulation of 
formal hypotheses regarding expected response 
tendencies, Should such characteristic tend- 
encies be uncovered they would contribute to 
the development of more specific hypotheses 
in an area about which we are presently 
lacking in information. 


METHOD 


Subjects for this study were a group of 40 female 
undergraduate students of psychology at the Uni- 
versity of Rochester and a group of 40 novices from 
Catherine McAuley College in Rochester, none of 
whom had previous familiarity with the Rorschach 
inkblots. The mean age for the college studeats was 
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19.62, ranging from 18.25 to 20.75 years. For the 
novices the mean age was 21.13 years, ranging from 
18.33 years to 23.33 years. 

Seven scales each from the evaluation, potency, 
and activity factors of the semantic differential were 
selected on the basis of their loadings and judged 
relevance to the inkblots (the scales used may be 
found in Table 1). These scales were randomly 
arranged in 10 different orders and booklets were 


assembled in such a way that the 10 orders fell in | 


random positions providing a control for any 
order effects which might operate in the making 
of successive ratings. 

Subjects were given these booklets which con- 
tained the standard Semantic Differential instructions 
(Osgood et al., 1957) with minor modifications since 
inkblots and not verbal concepts were being rated, 
After reading the instructions subjects were given 
an opportunity to ask questions. In general both 
groups required only minor clarification of what 
the task entailed. Most of the novices (henceforth 
referred to as Group N) and the majority of the 
college females (henceforth referred to as Group C) 
were tested in small groups. All subjects made 
ratings of the standard series of blots taken in the 
usual order from I to X. The time required to 
complete the series of ratings ranged from 30 to 4$ 
minutes, 


RESULTS AND DISCUSSION 


In analyzing these data to uncover the 
stimulus characteristics of the inkblots, it was 
first necessary to determine whether each of 
the scales was applicable to each of the blots: 
In cases where a scale was considered. in- 
applicable or where the blot was seen to be 
neutral with respect to that scale, an exces 
sive number of ratings at Point 4 would be 
made. Excessive in this case was defined 2$ 
any number significantly above what might 
be expected purely by chance. Since 40 sub- 
jects were used and random ratings can fall 
on any of seven points, approximately. six 4 
ratings might occur by chance. Using the 
chi square technique it was found that 1! of 
more 4 ratings out of 40 represented a S 
nificant deviation (p = .05) from this chance 


base. Therefore, further analyses were Te | 


stricted to scales where fewer than 11 neutral 
ratings were made. For the remaining scales 
ratings of 4 were eliminated from further 
analyses. Ratings for each card on each of the 
21 scales were then tabulated separately for 
both groups and an analysis of directional 
trends was made. Ratings in Positions 1, ^ 
and 3 were combined as were those in 5, 
and 7. To determine whether a blot convey® 
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TABLE 1 
RATING SCALES By Novices YIELDING SIGNIFICANT CHI SQUARES 


Scale i Ir 


HI IV v VI VII | IX 


Beautiful-ugly n 
Clean-dirty 
Fair-unfair 
Good-bad 

Happy-sad 
Kind-cruel 

Wise-foolish 


= 


B 


L] 


Brave-cowardly 
Hard-soft 
Heavy-light 

m Large-small 
Masculine-feminine 
Rough-smooth 
Strong-weak 


Active-passive 
Angular-rounded 
Fast-slow 
Hot-cold 
Reckless-cautious 
Sharp-dull 
Tense-relaxed 
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impression characterized by the term at 
he or the other end of the continuum, ob- 
i erved , frequencies were compared to the 
theoretical probability that, by chance, half 
h ratings would be in one direction and 
alf in the other. The results of these analyses 
te reported in Table 1. 

E superficial examination of this table 
üdicates that in common with most subjects 
ax & Loiselle, 1960b; Zax et al, 1960) 
rho have rated the blots on these scales the 
vices view I as a potent, active stimulus 
ch Is negative in value. On the other hand 
Y which is also viewed as possessing potent 
ities is not nearly so negatively assessed 
by other groups. The last four blots as 
as III are valued rather positively as has 
n done by other female subjects (Zax & 
Loiselle, 1960b). 

Comparisons between the ratings of college 
Jécts and the novices were made in two 
ate analyses. In one the groups were 
pared on the number of ratings toward 
left end of the continuum (Positions 1, 2, 
3), the number of neutral ratings (Posi- 
4), and the number toward the right 
eme (Positions 5, 6, and 7). The chi 

€ technique was applied to the 210 com- 
ns (21 ratings for each of 10 inkblots). 


i 


-. Note.—n denotes that a significant number of the ratings were toward the right side of the continuum; N denotes that a sig- 
‘hificant number of the ratings were toward the left side of the continuum, 


The results of these analyses are presented 
in Table 2. 

It will be noted that significant differences 
were found in a total of 41 of the scales. This 
significantly exceeds (p < .001) the observed 
total of 13 such differences in the comparison 
of males and females in the original study 
(Zax & Loiselle, 1960a). In interpreting the 
results in Table 2 it should be noted that 
significant differences may have occurred in 
the absence of a complete reversal of rating 
pattern, i.e., a predominance of ratings toward 
the left side of the continuum in one group 


TABLE 2 


Comparison ON RATINGS BETWEEN COLLEGE 
Supyects AND NOVICES 


Scale 


Angular-rounded 
Fair-unfair 


None 

Fair-unfair, good-bad, happy-sad, kind-cruel, rough- 
‘smooth, strong-weak, angular-rounded 

Wise-foolish, heavy-light, reckless-cautious, sharp-dull 

Fair-unfair, happy-sad, wise-foolish, rough-smooth 

bad, happy-sad, kind-cruel, hard-soft, large- 

small, hot-cold, tense-relaxed 

Cleon-dirty, fair-unfair, kind-cruel, brave-cowardly, 
hard-soft, angular-rounded, tense-relaxed 

Fair-unfair, kind-cruel, heavy-light, fast-slow 

Fair-unfair, good-bad, happy-sad, brave-cowardly, 
large-small . 


Note.—The italicized terms denote the direction in which 
Group N exceeded Group C. 
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TABLE 3 


CoMPARISONS BETWEEN EVALUATION SCALE 
Scores or Group N AND Group C 
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Group N Group C 
Card 1 
x S: x s 
I| 30.63 58.82 33.25 6846 | 147 
II| 2445 91.54 27.88 50.79 | 1.82 
TIL) 2148 | 49.18 23.80 | 82.10 | 1.28 
IV| 3133 82.54 | 3818 | 4833 | 3.76** 
V| 23.88 96.26 27.95 66.05 | 2.02* 
VI| 2435 64.90 28.65 7149 | 2.32* 
VII| 1870 | 38.26 24.03 8428 | 3.05** 
VIII| 20.35 41.77 26.78 68.79 | 3.87** 
IX | 2418 89.08 28.38 | 95.85 | 1.95 
X| 18.90 38.69 24.90 | 120.87 | 3.99* 


and a predominance of ratings toward the 
right side in the other. Thus, it would be 
most accurate to say that Group N perceived 
IV as more fair, good, happy, kind, smooth, 
weak, and rounded than subjects in Group C. 
"This does not imply that those in Group N 
viewed IV as fair, good, happy, etc. in an 
absolute sense. When examined in this light, 
there is a clear tendency for Group N to 
be more positive than Group C in their 
evaluations of IV, VI, VII, VIII, IX, and X. 

A further check on Group N's trend toward 
setting a more positive value on these stimuli 
was made by developing and comparing evalu- 
ation scale scores of both groups. This was 
done by simply assigning values from 1 to 7 
to each point on a given scale and summing 
the seven evaluative scale ratings on each 
blot. Thus, a low score (the minimum being 
7) would indicate a very positive evaluation 
and a high score (the maximum being 49) 
would denote a very negative evaluation. 
Card by card comparisons between subjects 
were made by using ¢ tests. The results of 
these are found in Table 3. 

The results in Table 3 indicate that 
Group N's evaluation scores were on the 
average lower (in the more positive direction) 
on each of the 10 blots. Significant ¢ values 
were achieved on six of these and two others 
(on II and IX) were just short of the .05 
level. 

A similar analysis was done of the ratings 
on the so-called potency scales. Potency scale 
Scores were determined in the same way that 


evaluation scale scores were computed a 
card by card comparisons were made usi 
t tests. The results of these are found i 
Table 4. 

The results in Table 4 indicate 
Group N potency scores were signifi 
lower (in the more potent direction) on th 
of the blots (II, V, and X). None of the oth 
?'s approached significance. This finding 
interesting one since most groups which lid 
rated the blots with these scales (Zax 4 
Benham, 1961; Zax & Loiselle, 1960b; Za 
et al., 1960) agree in not viewing these pa 
ticular plots as very potent stimuli while th 
novice nuns view them as relatively m 
potent and, as can be seen in Table 1, see] 
as high in potency as I and IV, blots whi 
are consistently seen as highly potent. Th 
result is inferpreted as supporting the earli 
finding that the nuns placed a more positi 
value on these unstructured stimuli than di¢ 
college females. Potent qualities (particula 
brave versus cowardly and strong versi 
weak) are generally more highly valued th 
are ones implying impotence. ] 

When activity scale scores for the ty 
groups were computed and compared, 
results were inconclusive. Significant dil 
ences between groups were found in two blot 


rated the blot as possessing more active quali 
ties and in the other case the novice nul! 
rated the blot (X) as the more active. 

"These results indicated a clear rating i 
ency on the part of the novices to evaluate tli 


TABLE 4 


COMPARISONS BETWEEN POTENCY SCALE SCORES 
or Grour N anp Group C 


Group N Group C 
Card t 
X E Se E 
I| 2003 | 52.38 | 21.43 | 46.77 | 0.897 
II] 2128 | 33.03 | 2468 | 40.64 | 2. 
III| 28.65 | 4449 | 28.88 | 45.33 | OM 
IV| 1988 | 6813 | 18.55 | 5044 | 0.79, 
V| 28.68 | 59.46 | 32.30 | 5487 | 2.49. 
VI| 2445 | 37.18 | 25.28 | 43.69 | 0.58 
VII| 33.85 | 24.13 | 32.83 | 57.08 018 
VIII| 26.58 | 50.51 | 2523 | 71.92 om 
IX| 25.58 | 3405 | 24.63 | 45.00 05 
X| 27.50 | 3825 | 3105 | 6246 | 2 


*p «.05 (t = 2.00). 
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unstructured stimuli used in this study more 
positively than college subjects matched for 
age and education. As such, they gave rise to 
speculation that the specific findings might 
well exemplify a more generalized tendency 
to respond in a homogeneously positive way 
to test stimuli which allowed for such a re- 
sponse characteristic to be manifested. Ac- 
cordingly, it was decided to probe the 
extensibility of the critical findings by collect- 
ing data on the social desirability (SD) 
stereotypes (Edwards, 1957) of this same 
group. In contrast to the semantic differential 
ratings of ambiguous (inkblot) stimuli, an 
SD rating task is one which taps attitudes 
to very concrete and meaningful content. To 
the extent that parallel response character- 
istics might be observed across these two 
contentually dissimilar situations one might 
have greater confidence in the pervasiveness 
and. content-free nature of the response 
characteristic in question. 

From a prior investigation of the SD stereo- 
types of college students (Cowen, 1961) 
horms for a sample including 28 college 
females were available for 209 trait descrip- 
tive adjectives. On this basis, 28 novices 
selected at random from the original sample 
of 40, and comparable to them as well as to 
the college females in the demographic at- 
tributes of age and education, were given the 
trait descriptive adjectives for rating on a 
Seven-point SD scale. The list of adjectives, 
the instructional set (i.e., the frame of refer- 
ence of people in general), and the conditions 
of administration were the same as those used 
in the prior study by Cowen. 

Upon completion of the data tabulation, 
Means and variance were computed for each 
adjective for the novitiate group. Mean SD 
ratings for the college females and the nuns 
Were correlated with a resulting Pearson 7 
of .967. This finding accords well with those 
of „Several other studies (Cowen, Davol, 
pees, & Stiller, in press; Cowen, Staiman, 

Wolitzky, 1961; Cowen & Stricker, in 
Press; Zax, Cowen, Budin, & Biggs, 1962) 
ee a high degree of comparability 
dh € relative SD ordering of trait descriptive 

Jectives across ostensibly diverse groups. 
tide since in prior studies such over- 

img constancies in relative ordering tended 
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to obscure absolute differences between 
groups, several different. types of absolute 
cross-group comparisons were undertaken. 
Accordingly £ ratios were computed sepa- 
rately. for each adjective comparing abso- 
lute differences in mean SD ratings across 
groups, as well as F ratios comparing group 
differences in variability." Significant group 
differences in mean SD level at p= .05 or 
beyond were found for 60 of the 209 adjec- 
tives rated. Although restricted in the determi- 
nation of whether or not this is a nonchance 
event by the nonindependence of the adjective 
ratings, it is of interest to note that the 
total of 60 differences significantly exceeds 
(p = .001) the observed total of 23 such sig- 
nificant differences in the comparison of males 
and females in the original study (Cowen, 
1961). Moreover, these differences fell into 
a very distinct formal pattern so that of 38 
positive adjectives, the novices had 34 sig- 
nificantly higher means, whereas, of the 22 
negative adjectives the novitiates had 18 
significantly lower means. This very sharp 
dichotomization prompted a further non- 
parametric analysis, similar to one used in an 
earlier study on the direction of mean differ- 
ences between groups, independent of the 
significance of such differences. Combined 
group means were used to fix overall SD level, 
and on this basis a tally was made for each 
adjective to indicate which group had the 
higher mean SD score. Table 5 presents the 
results of this trichotomous classification of 
positive adjectives (Msp 5-7), neutral adjec- 
tives (Msp 3-5), and negative adjectives 
(Msp 1-3). It may be seen that there is a 
striking tendency for the novices to rate 
positive adjectives more positively and nega- 
tive adjectives more negatively than do the 
college females. This rather obvious fact is 


2A 9-page table, presenting means and variances 
for the 209 trait descriptive adjectives for both C 
and N groups, as well as indicating those adjectives 
with significant differences in group means (s) and 
variances (F), has been deposited with the American 
Documentation Institute. Order Document No, 7449 
from ADI Auxiliary Publications Project, Photo- 
duplication Service, Library of Congress; Washing- 
ton 25, D. C, remitting in advance $1.25 for 
microfilm or $125 for photocopies. Make checks 
payable to: Chief, Photoduplication Service, Library 
of Congress. 
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TABLE 5 


RELATIONSHIP BETWEEN .MEAN SD LEVEL AND THE 
DIRECTION OF MEAN Group DIFFERENCES 


Mean SD level 
Group Total 
5-7 | 3-5 | 1-3 
Novitiates higher 87 17 14 | 118 
College subjects higher 21 25 39 85 
Total 108 | 42 53 | 203* 


Note.—x? = 49.5, p = .001. " ets 
^ Mean SD levels were identical for six adjectives. 


borne out by a chi square analysis which 
yields an overall chi square value of 49.5 
( = .001). This same trend may also be 
Observed at the individual category level, 
when the novices are significantly higher for 
6—7 and 5-6 adjectives and significantly lower 
for the 1-2 and 2-3 adjectives. 

Briefly, with respect to group differences in 
variability, again examined in terms of a non- 
parametric test of directionality, no differences 
are observed for negative adjectives; college 
females are more variable on 71 of the 
111 positive adjectives (x? = 8.66, p = .02), 
while the novices are more variable on 31 of 
the 42 neutral adjectives (x? = 9.52, p=.01). 
While it is conceivable that the greater vari- 
ability of the college subjects on the positive 
adjectives is merely a reflection of the already 
demonstrated more extreme ratings of the 
novices at this end of the SD continuum, 
there are no such artifactual constraints in 
the middle range of the scale when the noviti- 
ates evidence significantly greater variability. 

To provide a context for the evaluation of 
this second set of data, it is important to note 
that prior to this investigation the college 
female subjects represented the most extreme 
and least variable SD rating group yet 
Studied. This is in comparison to a wide 
variety of other groups: college males, alco- 
holics, domiciliated geriatric patients, com- 
munity “senior citizens,” sexual psychopaths, 
“relatively intact” schizophrenics, regressed 
schizophrenics, and others. Hence we must 
regard the novices, and this finding need not 
be surprising, as an extremely homogeneous 
group, with very strong and highly uniform 
attitudes toward the acceptability of charac- 
teristics upon which our society places either 
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positive or negative value. In a sense, for such 
characteristics they provide what we might 
think of as an extreme of social evaluation, 
stronger than that yet observed in any other 
group. Perhaps this implies a sharpening of 
the ordinary base rates for social morality, 
At the same time, it is of some interest to 
observe that the pattern of sharp and uniform 
reaction breaks down, at least relatively, in 
the neutral SD range, where a clear-cut frame 
of reference is lacking. On such adjectives 
the novices are found to be more variable 
than the college females. 

On a somewhat more integrative level, the 
prime communality which may be extracted 
from the two sets of data is the consistent 
and very strong tendency of the novices to 
react in a highly positive way to stimuli 
which allow for such a response. In the 
semantic differential task, although the ma- 
terial to be rated is ambiguous, positive, 
evaluational concepts are provided in the re 
sponse matrix; in the case of the SD ratings, - 
the trait descriptive adjectives are rated 
directly. The cross-task consistency of re- 
sponse shown by the novices points strongly 
to the likelihood that we are observing a 
specific manifestation of a more generalized 
tendency to respond in an extreme manner, 
once given the possibility of positive or nega- 
tive response. In the absence of such a frame 
of reference more variable behaviors may be 
expected to ensue. 

As a further check on this proposition the 
semantic differential data were re-examin! 
with primary focus on the 24 scales (in some 
instances the same scale was involved on sev- 
eral different blots) which received in excess 
of a significant number of neutral (Point 4) 
ratings by both groups. Presumably these 
scales offered the least clear-cut choices for | 
certain of the blots. It was found that the 
ratings of the novices showed higher vati- 
ability on 16 of the scales with college 
students being more variable on 8. This dif | 
ference falls just short of exceeding a pe 
expectancy at the .05 level. Of the six 0 
these variance differences which yielded sig- 
nificant F ratios, five found the novices m 
variable. These differences, while somewhd! 
short of acceptable significance limits, see™ 
to reflect a rating tendency similar to th? 
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seen in connection with the SD rating of 
relatively neutral adjectives. 

This combination of studies has uncovered 
response tendencies which have differentiated 
two groups of females who, by virtue of their 
very different life goals, were expected to 
differ despite similarity in age and education. 
In so doing it has made us aware of specific 
differences between the groups where pre- 
viously logic and intuition had suggested only 
that some differences should exist. The merit 
of these findings, in what has essentially been 
an exploratory effort, is felt to lie in the fact 
that they provide a beginning for the develop- 
ment of a deeper understanding of a little- 
studied group of people who elect to live such 
an idiosyncratic way of life as novice nuns. 
At a surface level one can, using the con- 
structs uncovered herein, begin to predict 
attitudes, values, and, often, probable be- 
havior of nuns as a group as has been done 
by Marlowe and Crowne (1961) working with 
different type subjects and a different response 
set. This could lead to an elaboration and 
refinement of the constructs proposed thus 
providing the type of validity described by 
Cronbach and Meehl (1955). When such 
constructs are defined well and reliably they 
themselves may provide the hypotheses for a 
depth approach aimed at understanding the 
development of the surface picture. 
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STABILITIES UNDERLYING CHANGES IN INTER- 
PERSONAL ATTRACTION 


THEODORE M. NEWCOMB 


University of Michigan 


Week-to-week changes in interpersonal attraction on the part of 2 17-man 
populations of initial strangers were observed over a 4-month period. These 
and 2 other kinds of responses (Ss’ attitudes toward various objects, and 
5 estimates of one anothers’ attitudes) showed individual changes during this 
period. From first to last, however, there persisted a preference for psy- 
chologically balanced relationships among the 3 kinds of "elements" This 
psychological constancy helps to account for the nature of the observed 
changes. Since accuracy of estimates increased with acquaintance, balance 
increasingly characterized actual as well as judged relationships. Subgroup 
formation is interpreted in terms of psychological and objective balance, as 


constancies. 


It is a safe prediction that individuals who 
are initially strangers to one another will, 
under conditions assuring that they will 
become well acquainted, experience many 
changes in the degree of their attraction 
toward one another. Such changes, like any 
others that scientists investigate, presumably 
occur in orderly ways, and the principles 
governing both change and nonchange cor- 
respond to constancies. Lewin (1947), 
paraphrasing Cassirer (1923), notes that 
"throughout the history of mathematics and 
physics problems of constancy of relations 
rather than of constancy of elements have 
gained importance and have gradually 
changed the picture of what is essential” 
(p. 5). The present report points to a few 
such constancies of relations that have been 
observed on the part of two populations 
of initial strangers over a 4-month period, 
while their attitudes (elements involved in 
the relations) toward one another were char- 
acterized by a good deal of inconstancy. 

As reported more fully elsewhere (New- 
comb, 1961), two sets of 17 male students 
served in two successive years as subjects in 
an investigation of the phenomena of getting 
acquainted. They had been successfully se- 
lected as total strangers to one another, and 
lived and took their meals together in a house 
reserved for them. During each of 16 weeks 
they responded to a selected set of question- 
naires, attitude Scales, or other instruments, 
many of which were repeated from time to 


time. In particular, they rated or ranked 
each other as to favorability of interpersonal 
attitudes (henceforth referred to as attrac- 
tion) during almost every week. In addition, 
they frequently estimated one anothers’ at- 
titudes of various kinds. The present paper 
partially summarizes and also supplements 
findings reported in the original monograph, 
for the specific purpose of noting constancies 
that underlie inconstancies.' : 
The theoretical considerations from which 
the investigation stemmed were direct de- 
scendants from Heider’s (1958) theory of 
“balanced states.” For the purposes of this 
study, the elements among which a balanced 
relationship may exist for an individual are 
his degree of attraction, positive or negative 
toward another individual; his attitude, favor- 
able or unfavorable, toward some object (in 
the inclusive sense, referring to persons, 
issues, and abstractions like general values); 
and the second individual's attitude, as pet 
ceived by the first individual, toward the 
same object. A balanced state exists among 
these elements insofar as attraction is positive 
and the individual perceives that his ow? 
and the others’ attitudes are similar. Per 
ceived dissimilarity together with positive 
attraction represents an imbalanced state; 
negative attraction. (with which this pun 
does not specifically deal) together Wit 


CTS i of 

1 There were, of course, individual uc 
nonchange, but as population variables most E 
attitudes here considered were highly inconstant. 
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TABLE 1 


Means or 17 INDIVIDUAL CORRELATIONS (rho's) 
FoR PATRS OF ADJACENT WEEKS 
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TABLE 2 


Means OF 17 INDIVIDUAL CORRELATIONS (rho's) 
OVER VARYING INTERVALS OF TIME 


Weekly intervala Yearl | Year II Weekly interval Year I Year II 
04 St 65 
1-2 through 4-5? 82 En 00 3 35 
5-6 through 9-10 .86 91 0-5 38 43 
10-11 through 14-155 .88 90 
5-15 .66 10 
Note,—In rank ordering attraction toward other 16 subjects. 5-10 3 34 
* Week numbers refer to the number of preceding weeks of 10-15 83 85 


acquaintance. 

b Variations within these sets of adjacent weeks are so slight 
that values for the pairs of weeks have been averaged, rather 
than presenting each one. 


perceived similarity of attitudes may be 
either imbalanced or merely nonbalanced (a 
matter of indifference); together with per- 
ceived dissimilarity, negative attraction may 
be either balanced or merely nonbalanced.* 
These rules of balanced relationships include 
the specification of certain conditions, most 
important of which are that the attitude ob- 
jects be of relatively high importance and 
be considered to have common impact upon 
self and others, in similar ways.® 

The significant feature, for present pur- 
poses, of balanced and imbalanced states is, 
in Heider’s (1958) words, that “if a balanced 
State does not exist, then forces toward this 
State will arise. If a change is not possible, 
the state of imbalance will produce tension” 
(D. 201). Thus balanced states tend to be 
stable and imbalanced ones unstable. In 
either case we are dealing with relations, in 
Lewin's terms, and not merely with elements 
attitudes, 


INDIVIDUALS! ATTITUDES 


_ Tt is to be expected that individuals’ attrac- 
tion to the remaining group members will at 


2 This description of balance as associated with 
mne attraction differs from Heider’s position, 
ia to which negative attraction together 
geth dissimilarity is balanced and imbalanced to- 
poa perceived similarity. Theoretical con- 
eios suggest that the former combination 
ind pot be rewarding nor the latter distressing; 
Mier a findings from the investigation here 
often A indicate that the former combination 
SR not have the stability that is char- 
latter 3 of balanced relationships, while the 
chara oes not necessarily have the instability 
8 Cteristic of imbalanced relationships. 

Brom a fuller statement concerning this theoretical 

ach see Newcomb (1953, 1959). 


Note.—In rank ordering attraction toward other 16 subjects. 


first be unstable, because initial attraction 
responses (made on the third day) are neces- 
sarily based upon first impressions only; and 
that week-to-week changes should be in the 
direction of increased stability—that is, that 
the rate of change will be a declining one, 
because in successive weeks the amount of 
“new” information that individuals receive 
about one another will decline. The kinds of in- 
formation about another person that are rele- 
vant to attraction toward him are, in general, 
those that result in the attribution to him 
of properties that are regarded as rewarding. 
These are not necessarily persistent or “in- 
herent” personal properties; they may equally 
well include properties that are elicited only 
in interaction with specific other persons and 
they may, of course, be idiosyncratically at- 
tributed. Changes in attraction result not 
only from new discoveries of what char- 
acteristics another person already has, but 
also from observing qualities that, whether 
one knows it or not, one has oneself helped 
to elicit in him.* 

Table 1 presents means of week-to-week 
“reliability coefficients"; each subject's rank 
ordering of the other 16 subjects in attraction 
at each week was correlated with his rank 
ordering for the following week. Table 2 
shows similar coefficients, computed over 
longer intervals. The two tables together 
provide strong support for both predictions: 
initial responses have little predictive value 
even for so short a period as 5 weeks, whereas 
Week 5 responses predict almost as well to 
Week 15 as to Week 10 (p< .001 in either 


4 Certain distinguishable sources of interpersonal 
reward, and thus of attraction, have been else- 
where described by Newcomb (1960). 
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TABLE 3 


RELATIONSHIP BETWEEN GIVING HIGH ATTRACTION 
AND PERCEPTION OF RECEIVING HIGH ATTRACTION 
FROM SAME PERSONS 
(Year I) 


Number of subjects estimating 
reciprocation from their 
Rank 1 choices at 


Estimated rank of devel indicated 


reciprocated attraction 


Week 1 Week 5 | Week 15 
1-2 Dey high) 14 14 12 
3-4 (high) 3 2 2 
5-8 (second quarter) 0 1 3 
9-16 (lower half) 0 0 0 
Total 17 17 17 


case). Change continues throughout the entire 
period, but the rate of change declines hardly 
at all after the first 5 or 6 weeks. Except 
for very unpopular subjects, whose high 
attraction choices are not reciprocated and 
continue to be relatively erratic, attraction 
choices show comparatively little change 
after the first 6 weeks. 

Such changes should occur, hypothetically, 
in spite of the individual’s tendency to 
maintain a constant relationship between 
degree of perceived agreement and attraction 
to others concerning objects of importance 
to himself. If it may be assumed that the 
self is such an object, and in general a 
positively valued one, then it is to be expected 
that high attraction toward others will be 
associated with the perception of reciproca- 


TABLE 4 


Summary or RELATIONSHIPS FOUND BETWEEN LEVEL 
OF ATTRACTION TO OTHER SUBJECTS AND PER- 
CEIVED AGREEMENT WITH THEM ABOUT 
ATTRACTIVENESS OF REMAINING 


SUBJECTS 

Time of response x df 
Year I, Week 1 55.81*** 2 
Year I, Week 5 31.13*** 2 
Year I, Week 14 38.94*** 2 
Year II, Week 2° 17.54** 1 
Year II, Week 5° 6.73* 1 
Year II, Week 12» 9.95** 1 


^ In Year II only 5% of all possible estimates, based on a 

An. ibn samples use RE The somewhat lower 
it is in Year it, 

Smalle: OS result, in part at least, from the 
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tion of high attraction toward oneself. Table 
3 supports this prediction for Year I, and 
results for Year II are almost identical. 
All estimates of reciprocation by Rank 1 
choices are at all times in the upper half 
of the distribution, and most of them in 
the upper eighth. There is a very strong 
tendency (not necessarily warranted) to as 
sume that one's highest ranked associates” 
return the compliment. 

It is to be expected, on similar grounds, 
that attraction to other individuals will be 
paralleled by perceived agreement with them 
as to the relative attractiveness of the remain- } 
ing House members. As shown in Table 4, 
which summarizes relationships between level 
of attraction to others and degree of peri 


TABLE 5 P 


SUMMARY OF SIGNIFICANCE Levers AT WurcH TWO. 

HIGHEST RANKING CHOICES BY ALL SUBJECTS 
ÅRE JUDGED TO BE HIGHLY ATTRACTED 

TO EACH OTHER 

(Year I, Week 5) 


Number of estimates 
Category of esti- x d 
mated attraction | | 


Obtained | Expected 3 

- | 
Highest quarter 22 8.5 26.51* | 1 
Upper half 33 17.0 28.30* | 1 
“Favorable” 34 23.6 | 13.68* | f 


*p «.001. 


ceived agreement with them about the 
relative attractiveness of other House mem | 
bers, there is in both populations, at 8 
stages of acquaintance, a significant rela- 
tionship between these two variables. : 

A special instance of this tendency is to be 
found in the almost universal tendency to 
assume that one’s two most preferred socio” 
metric choices are highly attracted towan 
each other. (In view of the fact that recip! 
cated attraction from Rank 1 and Rank 1 
choices is also perceived as very high, 
set of phenomena may be labeled “the 
ception of perfect triads.”) According to the 
Year I data, which for this purpose are mo i 
complete than in Year II but which are wey 
supported by the latter, the relationship? 
shown in Table 5 are typical of all stage 
of acquaintance. ne 


per 


It seemed to be 4 
unthinkable to these subjects that their 
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most preferred choices should be hostile to 
each other, and almost so that they should 
be merely “neutral.” Early estimates to 
this effect were in several cases quite in- 
accurate; lack of information invites autistic 
judgments. Later ones were highly accurate; 
as earlier perceptions of perfect triads were 
discovered to be erroneous, preferences 
shifted in such manner as to justify the per- 
ception of perfect triads. 

Balance inducing forces should also result, 
at all times, in the perception of closest 
agreement with most attractive others with 
respect to objects other than the self and 
House members. The data most suitable for 
testing this prediction are subjects’ rankings 
of the six Spranger values in Year II, together 
with their estimates of how each other subject 
would rank them. Both at Week 2 and at 
Week 14 the relationships between attraction 
toward other subjects and estimates of agree- 
ment with them were highly significant; 
xX values are 17.19 and 11.63, respectively, 
corresponding p values being <.001 and 
<.005, df= 2. The slight decline in this 
relation, from Week 2 to Week 14, is also 
found in other tests of the same prediction; 
it reflects in part the countereffects of 
greater accuracy with increasing acquaintance, 

Thus the data show a continuing increase, 
though at a rapidly declining rate, in the 
Stability of attraction toward others. They 
also show that at all times, to about the 
Same degree, attraction toward others is re- 
lated to perceived agreement with them con- 
cerning a variety of things. 


Dvap RELATIONSHIPS: 
MUTUAL ATTRACTION AND ACTUAL 
AGREEMENT 


Insofar as subjects were alert to increments 
of information about one another with con- 
tinued interaction among them, it is predict- 
able that estimates of others’ attitudes will 

*come increasingly accurate with continued 
acquaintance; and that actual relationships 

etween mutual attraction and agreement 
will increasingly approach the perceived 
Ielationships. The latter prediction presumes 

‘at, with increasing accuracy, subjects will 
‘cover that some of their assumptions about 
“steement with attractive others are not 
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justified, and will tend either to modify 
their own attitudes or to shift their attraction 
preferences to individuals with whom they 
are in fact more closely in agreement. _ 

With respect to the self as an object, the 
data do not support the first prediction: 
estimates of others’ attraction toward oneself 
do not become more accurate with increasing 
acquaintance, and this is true at all levels of 
expressed attraction. Frequencies and mag- 
nitudes of inaccuracies are quite constant, 
although they are at all times predominantly 
in the direction of overestimating the true 
level of reciprocated attraction. Estimates 
are in general fairly accurate, especially at the 
extremes of expressed attraction. Most sub- 
jects, apparently, are rather sensitive to 
others’ indications of attraction toward them- 
selves, at all times, and at all times there is 
a constant tendency to exaggerate the degree 
to which one’s own attraction toward another 
person is reciprocated at about the same 
level. à 

The accuracy with which subjects estimate 
each others’ attraction preferences toward 
other House members does increase. During 
the early period (Weeks 0-5) this increase 
is significant only for the estimator’s highest 
attraction ranks, representing individuals with 
whom he is likely to have associated fre- 
quently enough after 5 weeks to estimate 
their preferences reasonably well. By Week 
15 the trend is unmistakable at all attraction 
ranks: in Year I (the population in which 
these data are most nearly complete) 15 of 
17 subjects were more accurate than at 
Week 1—the binomial probability of which 
is beyond .001. 

Subjects’ accuracy in estimating others’ 
rank ordering of Spranger values increases 
at a high level of significance, as shown in 
Table 6, in which the indices of accuracy 
represent rank-order correlations between 
each subject’s estimated rank ordering of 
each other subject’s responses and the latter 
responses as actually made, The mean ac- 
curacy of 272 estimates, according to this 
index, is .25 at Week 2 and .49 at Week 14. 

Turning now to actual dyadic relationships, 
it is to be expected that sensitivity to"others' 
responses to oneself will increasingly result 
in similar levels of attraction on the part of 
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TABLE 6 


RELATIVE Accuracy OF ESTIMATING OTHERS’ RANK 
ORDERING OF SPRANGER VALUES, EARLY 
AND LATE 


Number of estimates at indicated 
levels of accuracy 


Accuracy 
levels 
Week 2 Week 14 Total 
>.60 66 110 176 
<.60, >.14 103 95 198 
X44 103 67 170 
Total 272 272 544 


Note.—x? = 17.92, p = .001, df = 2. 
* Rho between each estimated rank with actual rank. 


dyad members, whatever that level may be. 
Insofar as forces toward balance with regard 
to the self are tempered with considerations 
of reality, dyad members should come to 
assign about the same degree of attraction 
to one another. Table 7 shows that this is 
indeed the case. A large proportion of the 
dyads whose members accord very different 
levels of attraction to each other include a 
very popular or a very unpopular individual, 
or both. Apart from this consideration, the 
tendency toward increasing reciprocation of 
attraction by dyad members at closely 
similar levels is almost universal. This fact 
about objective dyadic relationships, com- 
bined with the unchanging tendency to per- 
ceive favorably reciprocated attraction, re- 
flects shifts in actual attraction preferences: 
changes are such that increasingly accurate 
judgments of others’ attraction toward oneself 
result in increasingly close reciprocation. 
With respect to Spranger values, also, the 
effects of increased accuracy in judging others, 
together with constant forces toward balance, 
are that actual agreement is increasingly as- 
sociated with high mutual attraction. At 


TABLE 7 


Ns or Dyaps WHose MEMBERS’ ATTRACTIONS TO EACH 
OTHER DIFFER By 3 (of 16) RANKS or LESS 


Time of response Obtained | Expected x? df 
Year I, Week 0 51 50 00 |1 
Year II, Week 0 59 53 61 | 1 
Year I, Week 15 77 53 10.61**| 1 
Year II, Week 15 72 53 6.46* | 1 

*p <.02 G 
5$ S002 
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Week 2, when these responses were first ob- 
tained, there was no relationship between pair 
agreement and mutual pair attraction (x? 
= 1.17, p< .50, df = 1). At Week 14, how. 
ever (when responses were last obtained), the 
relationship had become highly significant, 
as shown in Table 8. The x? value of this 
distribution, with both variables equally 
dichotomized, is 9.52, df = 1, and p < .001 
by a one-tailed test. 

It happened that there were almost no 
changes in subjects’ ranking of the six 
Spranger values between early and late 
acquaintance; it was therefore possible to 
predict later dyad attraction from initial 
agreement nearly as well from early as from 


TABLE 8 


RELATIONSHIP BETWEEN DEGREE OF AGREEMENT 
ABOUT SPRANGER VALUES AND MUTUAL 
PAIR ATTRACTION 
(Week 14, Year II) 


Number of dyads at attraction 


level indicated 
Level of Total 
agreement 

Highest| Second | Third | Lowest 

quarter?|quarter^| quarter*|quarter* 
Highest quarter| 12 6 9 6 33 
Second quarter 10 16 6 4 36 
"Third quarter 9 4 8 11 3 
Lowest quarter 7 4 10 14 35 
"Total 38 30 33 35 | 136 


a For reasons described in the full report, dyad score an 
routinely categorized according to proportions of expected, PA 
obtained, frequencies, which typically show slight differences. 


late agreement; early agreement did not, 
however, predict to early attraction. Similar 
results were obtained with two other sets of 
attitude items, each quite wide-ranging ™ 
content, from which indices of dyad agree 
ment were computed; first responses to these 
items were made on the third day of acquaint: 
ance in Year I, and by mail one month before 
subjects arrived at the university in Yea! 
II. As shown in Table 9, in neither case did 
preacquaintance agreement bear any relation- 
ship to early scores of mutual attraction, but 
high preacquaintance agreement in each yea? 
predicted significantly to high mutual attrac- 
tion 4 or 5 months later. 

In view of the general increase in acura 
of estimating others’ attitudes and in view 9 
the constant tendency to prefer balanced t0 


y 
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unbalanced states, it follows that with in- 
creasing acquaintance there should be an 
increasing tendency toward relationships that 
are balanced not merely phenomenologically 
but also in fact. This means that, except in 
the case of attitudes that show little or no 
change, change in mutual attraction between 
dyad members should be accompanied by 
change in their actual agreement. This pre- 
diction is best tested with respect to agree- 
ment about the relative attractiveness of 
House members. As shown in Table 10, the 
early relationship between mutual dyad at- 
traction and agreement about House members 
approached zero in both populations, and 
increased to a significant level in the later 
weeks. Typically, there was a good deal of 
shifting about during the earlier weeks, both 
with respect to mutual attraction and to 
preferences among other House members, 
with the result that the relationship between 
attraction and actual agreement becomes a 
highly significant one. It can also be shown, 
by more detailed analysis, that among dyads 
at a high level of mutual attraction in early 
weeks there is significantly higher agreement 
3 months later on the part of dyads whose 
Mutual attraction remains at the same high 
level than on the part of dyads whose mutual 
attraction has decreased. Change or non- 
change in these two respects proceeds to- 
gether and interdependently. 


HICH ATTRACTION STRUCTURING 
WITHIN POPULATIONS 


Both a rationale and supporting evidence 
have been presented for the expectation that 
With increasing acquaintance subjects will 
Shift their attraction preferences in such man- 
ner as to satisfy constant preferences for 
balance (that is, agreement with attractive 
others about objects of importance), and to 
take into account considerations of reality, 
with continuing increments of information 
about one another. With specific reference to 
E. members’ agreement about the attrac- 
IVeness of other House members, this ex- 
i leads to the prediction of increasing 
a of high attraction triads and larger 
Eu for the following reasons. Dyad 
indi bers will tend to find one or more other 

WViduals toward whom each of them is 
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TABLE 9 


PREDICTIVE VALUE OF HIGH ATTITUDINAL AGREEMENT 
AT EARLY PERIODS FOR HiGH MUTUAL 
ATTRACTION AT EARLY AND 


Late PERIODS 
Week of | Week of 
Nature of [Year| attitude |attraction| x? | df 
response | response 
Miscellaneous I 0 0 46 1 
Miscellaneous | II —4 0 .00 1 
Spranger values | II 2 2 147 1 
Miscellaneous T 0 13 8.78* |1 
Miscellaneous | II —4 13 11.68** | 1 
Spranger values | II 2 14 9:524% i 
*p <.005. 
> <.001. 


strongly attracted; dyads in which this does 
not occur will tend to be unstable. It is likely, 
as has been shown, that this high attraction 
will be reciprocated in kind by a third in- 
dividual; if so, a high attraction triad has 
been formed, composed of the three mutually 
attracted pairs; if not, in the case of some 
particular person, it is likely to occur with a 
different one. Once a high attraction triad 
has been formed, similar processes tend 
toward further accretion, with the attendant 
formation of high attraction tetrads and 
larger subgroups. 

With respect to triads, such processes did 
occur, in both populations. Since our prin- 
cipal interest lies in stable triads we took, 
as a criterion of stability, triads all three of 
whose component dyadic relationships main- 
tained a high level of mutual attraction for 
3 consecutive weeks. During the first 3 weeks 
of acquaintance only 2 of 17 possible triads 
remained stable by this criterion in Year I 
and 3 of 13 in Year II. During the last 3 
weeks, however, 7 of 13 in Year I and 14 


TABLE 10 


SUMMARY OF MEAN CORRELATIONS BETWEEN MUTUAL 
ATTRACTION AND AcTUAL AGREEMENT ABOUT 
OTHER HOUSE MEMBERS ON THE PART OF 
136 DvAps 


Week Year I Year IT 
il 45 18 
on ERI 37 
6-9 52 40 
10-13 50 43 
14-15 46 58 


382 


of 19 in Year II remained stable; the re- 
spective ? values, by exact test, based on 
Ns of early and late triads that were stable 
and that were unstable, were .036 and .017. 
Numbers of stable triads do increase with 
acquaintance; there were too few high at- 
traction tetrads and pentads at any time to 
be studied. 

The attraction structuring of the total 
populations can be described in various ways, 
which might include numbers of high attrac- 
tion subgroups of various sizes, the attraction 
relationships among them, and the number of 
isolates (individuals who have no mutually 
high attraction relationships at all). The 
two population structures did not differ in 
important ways at first, but interesting dif- 
ferences later appeared, as suggested by the 
sociograms presented in Figure 1, which show 
all subgroups of two or more whose members? 
levels of mutual attraction reached a rather 
high criterion. The visual appearance of these 
two sociograms, together with certain other 
evidence not here reported, suggests differences 
along a dimension that might be called 
centrality versus divisiveness—a difference 
not predicted on the basis of any theoretical 
considerations. A rather simple index of 
divisiveness confirmed the appearance, and a 


YEAR I, 


WEEK 15 


b 
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post hoc hypothesis was formulated concern- 
ing the sources of this difference. This hy- 
pothesis rests upon the contribution of two 
variables to centrality: the degree of intercon- 
nectedness among high, mutual attraction 
dyads on the part of the individuals having 
the highest attraction power (“popularity”), 
and the amount of attraction power con- 
centrated in these individuals. It was found 
that among the most popular six individuals 
in each population, who together accounted 
for somewhat more than half of the attraction 
power in each population, 20% of all such 
dyads met the criterion of interconnectedness 
in Year I as compared with 53% in Year II, 
Moreover, there was more attraction power 
concentrated in the six most popular indi- 
viduals in Year II than in Year I. Thus each 
variable contributes to the greater centrality 
in Year IL It is, in fact, an artifactual 
necessity that whichever of two populations 
of the same size has these characteristics will 
be the more centrally structured: greater 
interconnectedness at high levels of mutual 
attraction among individuals having most at- 
traction power; and greater attraction power 
on the part of the same number of popular 
individuals. These are not necessarily the 
only variables that contribute to centrality, 


YEAR II, WEEK 15 


Fic, 1. High attraction structure of two populations on final acquaintance. (Circles rep- 
resent individuals; connecting lines represent mutual attraction at high levels.) 
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but they do necessarily contribute to it, for 
the following reasons. Popular individuals 
who are multiply connected by high mutual 
attraction constitute a triad or larger subgroup 
that serves as a core substructure, each of 
whose members brings with him several 
“hangers-on” who are thus added onto the 
core structure. And the more “hangers-on” 
thus added, the larger the substructure 
becomes. 

Our post hoc hypothesis as to why so few 
popular subjects were multiply intercon- 
nected in Year I and so many in Year II 
is simply that there was more agreement 
among the six popular individuals in the 
latter than in the former year; the evidence 
abundantly supports it. 

1. The most nearly comparable attitude 
responses made by both populations were to 
miscellaneous batteries of items; in Year I 
the 15 dyads among the six popular 'subjects 
agree with each other no more closely than 
did all other pairs, whereas in Year II their 
agreement was significantly closer than that 
of other pairs (p < .05). 

2, In authoritarianism (F score), five of 
the six popular subjects in Year II were 
extremely low, ranking 1, 2, 3, 4, and 5 in 
a low scoring population; in Year I, on the 
other hand, the six popular subjects covered 
nearly the entire range, from 2 to 17. 

3. Although no comparable data are avail- 
able for Year I, five of the six popular sub- 
jects in Year II were in extraordinarily 
Close agreement in Spranger values—and they 
knew it: the mean of their 10 intercorrelations 
M actual response was .72, and the mean of 
their estimated agreements with each other 
Was .75. (The sixth, who disagreed with all 
of the others both in F score and in Spranger 
Values, was Number 39, the only one of the 
six not having high mutual attraction with 
more than one of the others.) Thus the 
central proposition concerning balance ac- 
Counts for the close interconnectedness of 
Six popular subjects in one year and for the 
absence of it in the other year, and thus 
accounts for the different attraction structur- 


„Mgs in the two populations. It does not, of 


rourse, account for the fact that in Year IT 
aut not in Year I popularity was pretty much 
Oncentrated among a set of five closely agree- 
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ing individuals. Our general theory has 
nothing to say about the characteristics 
which led to popularity in these populations, 
and we can only say that our random methods 
of selecting subjects had this consequence. 


INDIVIDUAL CHARACTERISTICS OF SUBJECTS 


Several theoretically derived expectations 
have been shown to be supported by empirical 
findings. Our tests, however, have been statis- 
tical ones, pertaining to populations or sub- 
populations and not to individuals. There 
are two relevant questions concerning the 
manner in which individuals’ characteristics 
contribute to the findings. 

First, it is possible that a comparatively 
few individuals contribute all or most of the 
variances that account for the statistically 
significant findings; if so, the generalizability 
of the findings is severely limited. The find- 
ings depend rather heavily on assumptions 
concerning two kinds of individual tend- 
encies: to prefer balanced to imbalanced 
relationships; and at the same time to take 
account of “reality” in the form of accretions 
of information that may disturb existing 
states of balance. Individual indices of both 
of these tendencies (sensitivity to balance, 
and accuracy in judging others) were there- 
fore constructed. Results of intensive analyses 
of all 34 subjects may be summarized as 
follows: 

1. Individual differences are clearly ap- 
parent, with respect to both balance and 
accuracy. 

2. In estimating others’ attitudes toward 
varying kinds of objects there are in each 
population typically one or two individuals 
who show little or no sensitivity to balance, 
or no greater than chance accuracy in making 
estimates. 

3. The subjects who are deviant in these 
ways with respect to one attitude object at 
a particular stage of acquaintance are not 
necessarily deviant with respect to other 
attitude objects, or at other times. 

In sum, the tendencies to be sensitive to 
balanced relationships and to judge others’ ` 
attitudes more accurately with increasing 
acquaintance, which underlie our theoretically 
derived predictions, appear to be present, at 
least jn some degree, in all of our subjects. 
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As to measured individual characteristics, 
authoritarianism (as measured by the F 
Scale) seemed most likely to be relevant to 
the problems of this investigation. Our ex- 
pectations concerning authoritarianism stem 
from several studies suggesting that F Scale 
scores (Adorno, Frenkel-Brunswik, Levin- 
son, & Sanford, 1950) are related to *per- 
ceptiveness of others" (Christie & Cook, 
1958, in a review of the relevant evidence 
on this point, pp. 180-183). Such evidence 
leads to the expectation that low F scorers 
should be relatively accurate in estimating 
others' attitudes. With one exception (esti- 
mates of others' ordering of attraction toward 
House members in Year I), the prediction 
is supported with respect to various attitude 
Objects, and according to different indices 
of accuracy, in both populations. The data 
that best lend themselves to detailed analysis 
of accuracy consist of estimates of others’ 
rank ordering of Spranger values, made in 
identical manner at Weeks 2 and 14 of Year 
IL These data show that on the rather dif- 
ficult task of ranking other subjects according 
to agreement with the estimator, the non- 
authoritarians excel the authoritarians in ac- 
curacy on late but not on early acquaintance; 
the correlation of .56 between F score and 
accuracy at the later time is significant at 
<S 

Inaccurate estimates of others’ attitudes 
may represent distortions either in the balance 
promoting direction or in the opposite direc- 
tion; the former may be considered autistic, 
and if we assume that imbalanced relation- 
ships represent a form of ambiguity, of which 
authoritarians are relatively intolerant (cf. 
Adorno et al., 1950), then it is to be expected 
that high F scorers’ inaccuracies will be in 
the autistic direction, relative to low F 
Scorers, No difference in this respect appears 
at Week 2, when autistic errors are relatively 
frequent at all F score levels; but the predic- 
tion is well supported at Week 14: compari- 
sons of autistic, accurate, and contra-autistic 
estimates for low, intermediate, and high F 
Scorers yield a x? of 13.02 in the predicted 
direction, significant at < 01, df = 4, by a 
one-tailed test.5 


51f the "accurate" estimates are ignored, in order 
to compare autistic and contra-autistic responses 
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These and other findings are consistent 
with the following interpretation. The greater 
sensitivity of the very low F scorers enables 
them to select as most attractive those with 
whom they are in fact most closely in agree- 
ment about a rather wide range of values, 
The nonauthoritarians’ characteristic solution 
to imbalance is nonautistic: they tend to 
achieve balance not by exaggerating actual 
agreement with those to whom they are at- 
tracted (on other grounds), but by judging 
rather accurately who is in agreement with 
them and letting their highest attractions be 
determined accordingly. The characteristic 
solution of the more authoritarian subjects 
tends to be just the reverse: instead of letting 
their personal perferences be determined by 
accurate perceptions of agreement, they tend 
to perceive more agreement than actually 
exists with those toward whom they are 
already attracted. 


NATURE OF CONSTANT RELATIONS 


"Three kinds of elements, in Lewin's (1947) 
sense, have been considered: an individual's 
attraction toward another person; his at- 
titude toward some object other than that 
person; and that person's attitude, as he 
perceives it, toward the same object. Under 
the conditions of the investigation here 
reported, the stability curves of these three 
kinds of elements were quite different: at- 
titudes toward nonperson objects (especially 
toward general values) showed little change 
from first to last acquaintance; attraction 
toward other House members, on the patt 
of most subjects, became relatively stable by 
the end of the first 6 weeks or so; and esti- 
mates of others’ attitudes were relatively 
slow in stabilizing, though with individual 
differences. If the study had been concerned 
only with subjects’ own attitudes and attrac: 
tions, it might well have been terminat: 
after 6 rather than 16 weeks. But in vieW 
of the crucial place, in the present formula- 
tion, of perceptions of others’ attitudes, aP 
in view of the relatively slow and continuing 
changes in estimates of others’ attitudes, # 
might be argued that the study should have 
been continued for another several weeks. 


only, the inverse relationship between autism = 
authoritarianism is still significant at the .05 level. 
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What does remain relatively constant, in 
spite of these differential rates of stability 
and change, is the second-order relationship 
etween the relationship of two of them (own 
and other's perceived attitude) and attrac- 
ion, With regard to such diverse attitude 
objects as the individual subject himself, 
other group members, and a range of non- 
person objects, such a relationship, described 
as a balanced one, is found at all stages of 
acquaintance. This constant relationship is 
maintained despite the fact that all of the 
related elements are changing, or some of 
hem are changing while others are not. 
Eventually, the single elements tend to be- 
come stable, but the level at which they do 
so is governed by the same constancy of 
relationships that prevailed throughout the 
earlier periods of change. 

The psychological processes by which intra- 
personal states of balance are maintained 
may be described as follows. As group mem- 
bers interact with one another, each of them 
Selects and processes information—about ob- 
jects of' common interest, about one another 
as sources of attitudes toward those objects, 
and about one another as objects of attrac- 
tion—in such ways that the inconsistencies 
and conflicts involved in imbalanced relation- 
Ships tend to be avoided. Both autistic 
processes ("balance at all costs") and realistic 
Ones ("the truth, whatever it costs") are 
involved, their respective weightings being 
determined both by individual differences 
and by the strength of the attitudes involved. 

hen interaction begins with total stranger- 
Ship, increments of information are inevit- 
able; attitude change results from the neces- 
sity to adapt simultaneously to increments of 
Information and to constant preferences for 
alanced relationships. 

Interacting members of dyads and larger 
Soups necessarily make such adaptations to 
She another simultaneously. Insofar as they 
do so realistically, the consequence of recip- 
Tecal adaptation is a mutual relationship that 
5 in fact maximally satisfying to both or all 
» them—that is, maximally within the limits 

| What is possible. Realism tends to increase 
With acquaintance and, combined with con- 
Stant tendencies toward balance, the in- 
‘Vitable trend is toward mutuality of at- 
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traction. Stable relationships tend to persist, 
and relationships that are in fact balanced 
tend to be stable because they are mutually 
rewarding and not likely to be disturbed by 
increments of information with continued 
interaction. 

Viewed intrapersonally, the generalizable 
constancies underlying changes in inter- 
personal attraction that apply to all individ- . 
uals—regardless of their differences in pre- 
ferring some personal properties to others 
and regardless of the personal traits that 
others present—are preferences for balanced 
relationships, and tendencies to adapt to 
information regarded as valid. Viewed inter- 
personally, the generalizable constancies are 
the necessities (which may become inter- 
nalized as preferences) confronting each of 
a set of interacting persons to make successive 
adaptations to one another, simultaneously 
and reciprocally, in the direction of establish- 
ing relationships that are both realistic and 
balance promoting for each. It is relationships 
that are simultaneously rewarding to each 
and realistically apprehended by each that 
tend to be stable. Such relationships are both 
psychologically (intrapersonally) balanced 
and objectively (interpersonally) balanced. 

Attitude changes that are governed by 
these constancies stem from a triple con- 
frontation that is characteristic of la condi- 
tion humaine, Each of us must somehow 
come to terms, simultaneously, with the 
other individuals and groups of which our 
interpersonal environment is constituted; with 
the world that we have in common with those 
persons and groups; and with our own, intra- 
personal demands, including the preference 
for balanced states. Insofar as the individual’s 
confrontation is characterized by changing 
input of information, the elements that cor- 
respond to his attitudes are subject to in- 
constancy, but the lawfulness with which 
they change corresponds to certain con- 
stancies in relationships among the elements. 
Tt is such constancies that make possible 
viable adaptations, simultaneously, to mul- 
tiple confrontations. 


REFERENCES 


Apvorno, T. W., FRENKEL-BRUNSWIK, Erst, LEVIN- 
son, D. J., & SANFORD, R. N. The authoritarian 
persgnality. New York: Harper, 1950. 


386 


Cassirer E. Substance and function. Chicago: 
Open Court, 1923. 

Curistiz, R, & Cook, Pec. A guide to published 
literature relating to the authoritarian personality 
through 1956. J. Psychol., 1958, 45, 171-199. 

Hemer, F. The psychology of interpersonal rela- 
tions. New York: Wiley, 1958. 

Lew, K. Frontiers in group dynamics: Concept, 
method and reality in social science; social equi- 
libria and social change. Hum. Relat, 1947, 1, 

008-4. 

Newcome, T. M. An approach to the study of com- 

municative acts. Psychol. Rev., 1953, 60, 393-404. 


THEODORE M. NEWCOMB 


Newcome, T. M. Individual systems of orientation, 
In S. Koch (Ed.), Psychology: A study of a 
science. Vol. 3. New York: McGraw-Hill, 1959, 
Pp. 384-422, 

Newcoms, T. M. Varieties of interpersonal at- 
traction. In D. Cartwright & A. Zander (Eds), 
Group dynamics: Research and theory. (2nd 
ed.) Evanston, Ill: Row, Peterson, 1960. Pp, 
104-119. 

Newcoms, T. M. The acquaintance process. New 
York: Holt, Rinehart, & Winston, 1961. 


(Early publication received October 9, 1962) 


: 


————— Se eee l«———————MÀ— 


al of Abnormal and Social Psychology 
Tourn ot Sor No. 4, 387-389 


CRITIQUE AND NOTES 


INTERACTION OF SET AND AWARENESS AS DETERMINANTS 
OF RESPONSE TO VERBAL CONDITIONING + 
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Stanford University and Veterans Administration Hospital, Palo Alto, California 


The utility of an operant conditioning model to psychotherapy was evaluated 
by studying the definition of the situation given S and S’s focus on E's 
behavior. Instructions induced either a positive or negative set, identifying a 
story telling task as a test of empathy or personal problems. Awareness was 
induced in % of the Ss by calling attention to E’s reinforcement *mm-hmm." 
12 undergraduate students served as Ss in each of the 4 experimental groups. 
Positive set-Aware Ss increased use of emotional words, while Negative set- 
Aware Ss decreased use of emotional words, The results were interpreted as 
evidence that awareness can either facilitate or inhibit conditioning, depending 


upon S's set. 


Bandura (1961), Dollard and Miller (1950), 
Frank (1961), Kanfer (1961), Krasner (1961), 
and Marmor (1961), among others, have recog- 
nized that verbal operant conditioning offers a 
model situation for studying interpersonal vari- 
ables relevant to psychotherapy. The present 
paper reports on the interaction of two variables 
Which are central to a determination of the 
utility of this model: the definition of the situa- 
lion given the subject (set) and the subject's 
Íocus on the experimenter's behavior (aware- 
ness). The experimental procedure within which 
these variables were studied was made to re- 
semble psychotherapy in four ways: First, 
emitted (free operant) verbal behavior was con- 
ditioned rather than elicited (sentence comple- 
tion) verbal behavior (Taffel, 1955). Second, 
the “sets” experimentally created were relevant 
to Psychotherapy: “personal problems” versus 
empathy” or health. Next, the situation was a 
TAT-like one, and finally, the reinforced verbal 
class was one which had been developed and 
documented in a clinical setting. 


METHOD 


, Set and awareness were manipulated by alterations 
1 the instructions given to the subjects prior to the 
protons task. A positive set was induced by 
dn E the subject that the procedure was a test of 
n pathy, or warmth and feeling towards people. A 
CONG Set was induced by characterizing the task 
Eseee test cof personal problems and difficulties in 
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getting along with people. Awareness was induced 
in half the subjects by telling them that the experi- 
menter would indicate that they were either revealing 
personal problems or showing empathy by going 
“mm-hmm.” There were four experimental groups all 
of whom were given the following instructions: 


This is a new personality test that we are trying 
out. I have here a set of cards with drawings of 
people, I want you to make up an imaginative 
story about each card. In telling your story 
mention something about the present: what is 
going on now; the past; what led up to or other- 
wise explains the present; and the future: the 
outcome of the story. In other words, a complete 
imaginative story. Feel free to tell any kind of 
story you wish. Make it interesting to yourself. 

The only restriction on your stories is the time. 
Each story must last for a full 4 minutes. [Demon- 
strate clock] Frequently you will still be talking 
when the clock goes off. When that happens, just 
stop and I'll give you another card. To save my 
having to write, we'll tape record it. 


In addition to the above instructions each of the 
subgroups was given instructions as follows: 


[Positive set-Nonaware] We are hoping that in 
telling these stories you will show how much 
empathy you have for others. Your stories will 
be scored for the amount of warmth and feeling 
you have towards other people. 

[Positive set-Aware] We are hoping that in 
telling these stories you will show how much 
empathy you have for others. Your stories will 
be scored for the amount of warmth and feeling 
you have towards other people. After your first 
few stories I will let you know that you are 
showing warmth by going “mm-hmm” whenever 
you do this. » 

[Negative set-Nonaware] we are hoping that in 
telling these stories you will show your own per- 
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TABLE 1 


EFFECT OF SET AND EXPERIMENTALLY MANIPULATED 
“AWARENESS” ON INCREASED USE OF EMOTIONAL 
WORDS DURING REINFORCED TRIALS 


Nonaware Aware 
Set 
M SD M SD 
Empathy +5.92 | 12.33 |+11.50] 8.92 
Personal problems | +3.33 | 10.83 |— 42| 7.75 


sonal problems and difficulties in getting along with 
others. Your stories will be scored for the amount 
of personal problems you have in dealing with 
other people. 

[Negative set-Aware] We are hoping that in 
telling these stories you will show your own per- 
sonal problems and difficulties in getting along with 
others, Your stories will be scored for the amount 
of personal problems you have in dealing with 
other people. After your first few stories I will let 
you know that you are revealing your own per- 

` sonal problems by going “mm-hmm” whenever 
you do this. 


Task. Subjects were presented cards with simple 
line drawings of people engaged in commonplace 
activities, such as buying ties, fishing, or getting a 
haircut (Weiss, Krasner, & Ullmann, 1960). The TAT- 
like instructions were to tell imaginative 4-minute 
stories, Each 4-minute story defined a trial and all 
Stories were tape recorded for later scoring. There 
were four trials. The first and second trials were 
used to obtain “operant level” and were not rein- 
forced. During the third and fourth trials, emotional 
words (EW), as defined by Ullmann and McFarland 
(1957), were reinforced by the experimenter who 
nodded his head and said “mm-hmm” as if in 
agreement each time the subject used an emotional 
word. 

Subjects, Forty-eight undergraduate males and 
females served as subjects. The proportion of males 
and females was balanced across the four groups, 

Procedure. Subjects were seen individually in the 
same experimental room. The variable of social 
deprivation-social enhancement (Gewirtz & Baer, 
1958; Kanfer & Karas, 1959; Walters & Karal, 1960) 
was attenuated by all subjects first completing a 
Picture identification task (Ekman, 1961). Subjects 
were then introduced to the conditioning task by 
the above instructions. Subjects were assigned to 
expetimental groups in terms of order of their ap- 
pearance. All stories were tape recorded, and a clock 
was used to time each story. 


RESULTS 


Scoring reliability by three raters of single 
nonreinforced trials of college undergraduate 
protocols had in previous research (Weiss et al., 
1960) yielded a coefficient of concordance at 
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the .001 level. Since two trials were used for 
establishing both operant and reinforced levels, 
reliability was at least, and probably greater 
than, .90. 

The four experimental groups did not differ 
from each other on number of EW used during 
operant trials (all the F ratios were less than 
unity). Because there were no systematic dif- 
ferences during operant trials, a 2 x 2 analysis of 
variance was completed on the differences be- 
tween number of EW used during operant and 
reinforced trials, Table 1 presents the means and 
standard deviations for the four groups. The F 
ratio for rows (Empathy-Personal problems) 
was significant past the .05 level (F = 5,66, 
df =1/44), while the F ratio for columns 
(Aware-Nonaware) was insignificant (F = .09) 
and the interaction approached, but did not 
reach, statistical significance (F — 2.34). The 
most important difference was between groups of 
"aware" subjects: aware subjects who had been 
told that the situation was a measure of empathy 
increased an average of 11.5 EWs while aware 
subjects who had been told that the situation 
was a measure of personal problems on the 
average decreased .5 an EW. The difference be- 
tween these two groups was significant beyond 
the .01 level (¢ = 3.56). 


Discussion 


Although many of the early studies in verbal 
conditioning reported evidence for “learning 
without awareness” (Adams, 1957; Krasner, 
1958), later studies have focused on a number 
of methodological issues which have brought into 
doubt the interpretations of these early findings 
(Dulany, 1961; Eriksen, 1960; Levin, 1961; 
Matarazzo, Saslow, & Pareis, 1960; Spielberger, 
1961; Spielberger, Levin, & Shepard, 1962). A 
previous study (Krasner, Weiss, & Ullmann, 
1961) has demonstrated that “awareness” can 
be manipulated by instructional set in such à 
way as to differentially affect responsivity. The 
problem has also been approached by varying 
the amount of information given to the subject 
(Tatz, 1960), varying the “task relevant in- 
formation” (Kanfer & Marston, 1961, 1962), and 
using instructional set to create “high threat” and 
“low threat” experimental situations (Sarason 5 
Ganzer, 1962). The manipulation of “awareness 
by various instructional sets avoids the pitfalls of 
ascertaining “awareness” in an interview after 
the completion of the conditioning task. 

The present results show that set and aware 
ness cannot be considered separately, but that 
induced awareness will differentially affect con- 
ditioning depending upon the subject's orientation 
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towards the task. Thus heightening the subject’s 
attention or alertness to the reinforcement con- 
tingency does not in itself predict whether con- 
ditioning will be facilitated or inhibited. If the 
subject believes that there is something un- 
pleasant or undesirable about the response being 
reinforced, then increased awareness will lead him 
to inhibit or suppress his use of the response 
dass the experimenter is “reinforcing.” On the 
other hand, increased awareness may facilitate 
behavior change if the subject’s set has been 
positive, leading him to view the experimenter’s 
“reinforcing” behavior as an indication of favor. 
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SCHIZOPHRENIC DEFICIT AND ITS RELATION TO 


SOCIAL MOTIVATION + 
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A study investigating the effects of praise and encouragement on the perform- 
ance of schizophrenics was carried out with 48 normal and 48 hospitalized 
schizophrenics, the latter being drawn equally from open and closed wards. 
All Ss performed a simple motor task, those in the experimental groups 
receiving verbal praise and encouragement from E after each trial and those 
in the control groups no evaluative comments, The results indicated that all 
experimental groups performed better than their controls but that the amount 
of improvement did not differ among the diagnostic groups. In absolute 
performance level, normals were significantly superior to the schizophrenics 
but the 2 schizophrenics groups did not differ significantly from each other. 


Psychologists have been increasingly con- 
cerned in recent years with testing the implica- 
tions of various hypotheses that attempt to ac- 
count for psychological deficit among schizo- 
phrenics by postulating that they respond dif- 
ferently to social cues than do normal individ- 
uals, One group of studies falling into this 
general classification has been specifically con- 
cerned with the influence of social rewards and 
positively toned motivators on the performance 
of laboratory tasks. The hypothesis most fre- 
quently tested in these investigations has been 
derived from the suggestion of Hunt and Cofer 
(1944) that the inferior performance of schizo- 
phrenics in a variety of situations reflects in 
part an extinction of responsiveness to social 
stimuli, Thus, by implication, schizophrenics 
have been hypothesized to be less responsive 
to positive rewards and motivation than are 
normals and hence to show less improvement 
in performance when these are introduced into 
an experimental situation. 

The purpose of the present study was to 
provide further evidence bearing on this latter 
hypothesis. Since the type of social motivation 
to which Hunt and Cofer (1944) referred seems 
to be a general desire to please the experimenter, 
to conform to social expectations, etc., the 
manner in which motivation was introduced 
into the present situation. was also general. 
Rather than employing a procedure which com- 
bined information about the correctness of the 


1 Based on a master's thesis of the senior author 
performed under the direction of the junior author 
while bcth were at Northwestern University. The 
junior author is responsible for the preparation of 
the manuscript in its present form. 


subjects response with positive reinforcement 
(e.g, the experimenter's *Right"), as has been 
done in several prior investigations (e.g., Atkin- 
son & Robinson, 1961; Cohen & Cohen, 1960), — 
the subject was given a speed task and ad- 
ministered praise and encouragement for his 
performance. To determine whether severity of 
illness, as reflected by ward status, would bt 
related to magnitude of performance chang 
with praise, two groups of schizophrenics were 
tested: patients on open wards and on closed 
wards, 
METHOD 


Subjects 


The schizophrenic group consisted of 48 male, 
chronic patients from Downey Veterans Administra- 
tion Hospital, 24 drawn from open wards and 24 
from closed wards, Mean length of current hos 
Pitalization was 4.2 years and 4.7 years for the open 
and closed ward subjects, respectively. Forty-eight 
male nursing aides served as normal control subjects. | 
All subjects were 20-40 years of age, had no history . 
of alcoholism, drug addiction, or of CNS damage: 

Half of the subjects from the normal group ani 
from the two schizophrenic groups served in is 1 
experimental condition and half in the contro 
condition. All subgroups were equated for age and 
education, the mean age for the total sample b 
28.9 years and mean education 11.6 years. 


Apparatus 


The apparatus consisted of a board, 8 X 21 inches, 
containing 11 rows of 60 holes. The subjects’ t 
was to place small brass pins into the holes as rapidly 
as possible. The task was selected for two reasons! — 
pilot work indicated that it is predominantly ê . 
performance task, subjects’ exhibiting little improve" 
ment over trials; thus performance differences be- 
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MEAN NUMBER OF PINS PLACED 
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Fic. 1. Mean number of pins placed per trial for the six groups. 


tween control and experimental groups could be 
expected to be minimally contaminated by possible 
differences in rate of learning; it can easily be per- 
formed by chronic schizophrenics, thus minimizing 
fear of failure. 


Procedure 


After receiving preliminary orienting instruction 
and practice in placing the pins, subjects were given 
10 45-second trials, during which they were instructed 
to fill as many holes as possible. On each trial, the 
Subjects started and stopped at the experimenter’s 
verbal command. The intertrial interval was 30 
Seconds, during which time all subjects were re- 
minded of the instructions and if necessary of the 
Proper procedure for placing the pins. Only comments 
i this nature were made to control subjects. Experi- 
k Pe Subjects received, in addition, verbal praise 
m encouragement immediately after each trial, e.g., 
(ond very good”; “That’s the fastest I’ve seen”; 
oe still fine. See if you can’t go even faster the 
No me In addition, the experimental subjects 
eun reated in a warmer and friendlier manner prior 

e actual testing. 


RESULTS 


As may be seen from Figure 1 and Table 1, 
t erences in performance were related to both 
the major variables. Each experimental group 


respective control group while within diagnostic 
category, normal subjects in both conditions were 
clearly superior to all of the schizophrenic sub- 
groups, Although the differences were not as 
marked, each of the open ward (OW) schizo- 
phrenic groups was also superior to its parallel 
closed ward (CW) group. 

Two analyses of variance for repeated measures 
(Kogan, 1948) were performed on these data. 
The first compared all six subgroups and resulted 
in significant Fs for all main effects (p = 01); 
ie. there was significant improvement in per- 
formance over trials, performance levels differed 
significantly among the three groups (normals, 


TABLE 1 


MEAN NUMBER OF PINS PLACED PER TRIAL FOR THE 
THREE DIAGNOSTIC CATEGORIES UNDER CONTROL 
AND EXPERIMENTAL CONDITIONS ] 


Experimental Control 
Group 
M c M e 
Schizophrenics 
Open Ward 15.1 .95 13.6 .62 
Closed Ward 14.1 94 12.6 .36 
Normals 18.0 .82 17.3 43 


Consistently performed at a faster rate than its 
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OW, and CW patients) and between the experi- 
mental and control conditions. None of the inter- 
action terms, however, approached significance. 
Thus it may be concluded that while praise and 
encouragement facilitated the performance of all 
groups to a significant degree, there was no 
indication that the amount of improvement in 
the performance of normal subjects was any 
greater or any less than in either of the schizo- 
phrenic groups. 

The second analysis of variance omitted the 
data from the normal groups. Although the Fs 
for Trials and Conditions continued to be sig- 
nificant (p= .05), that for the OW-CW com- 
parison was not, suggesting that the significant 
term for groups in the first analysis was due to 
the superiority of the normal subjects. Neither 
the Group X Condition nor the triple interaction 
term approached significance, however, again sug- 
gesting that relative to control performance, 
praise did not lead to any greater improvement 
in one group than the other. 

Since neither of the raw score analyses reported 
above took initial differences in performance level 
among the groups directly into account, a second 
type of analysis based on gain scores was under- 
taken. A gain score was computed for each 
subject by subtracting the total number of pins 
placed on the first two trials from those placed 
on: the last two. Analyses of variance of these 
data, one based on all six subgroups and the 
second on only the schizophrenic subgroups, indi- 
cated that subjects in the experimental condition 
gained significantly more than their controls 
(p < .01). In neither analysis, however, did the 
Fs for Groups or the Group X Condition inter- 
action approach significance. 


Discussion 


The results of the present investigation indicate 
that the absolute performance levels of the 
normals were significantly higher than the schizo- 
phrenics, with the two schizophrenic groups not 
differing significantly from each other. With 
respect to the effects of the experimental condi- 
tion, it was found that praise and encouragement 
facilitated performance in all groups and second, 
that the magnitude of performance change under 
these conditions was no different in schizophrenics 
than in normals. 

The same kinds of findings have also been 
reported in three studies of simple reaction time 
(Benton, Jentsch, & Wahler, 1960; Goodstein, 
Guertin, & Blackburn, 1961; Stotsky, 1957): 
schizophrenic performance was facilitated by the 
experimental conditions but to the same degree 
as normals. All of these studies were procedur- 
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ally similar to the present one in the use of p 
and encouragement as the experimental variable 
and in studying the effects of this variable o 
the performance of a relatively simple, repetitive 
speed task. However, several other investigators, 
testing the effects of praise and encouragement 
on the performance of somewhat more complex 
tasks, have found significant difference betw 
schizophrenics and normals. Olson (1958) and 
Lair (1954) both reported that the amount of 
improvement exhibited by praised schizophrenics, 
in comparison to their controls, was greater thi 
for the parallel normal groups, while Stotsky 
reported significantly less improvement. The 
implications of the Hunt and Cofer (1944) hy- 
pothesis thus receive little support from this 
group of investigations; with only one exception, 
praise and encouragement either had no differ 
ential effects on the performance of normals and 
Schizophrenics or had effects opposite to those 
postulated by an underresponsiveness notion, i.e, 
Schizophrenics improving relatively more than. 
normals rather than less. 
As was mentioned earlier, introducing praise 
and encouragement into an experimental situa 
tion might be described as a general evaluation 
of the subject’s performance level. Another 
method that has been used to study the effects 
of positive social incentives might be described 
as a procedure in which information concerning. 
the correctness of individual responses is com- 
bined with positive reinforcement. Investigating 
verbal conditioning by means of the Taffel (1955) 
sentence construction technique and without 
giving the subjects instructions to learn, Cohen 
and Cohen (1960) and Leventhal (1959) found 
that in contrast to normals and neurotics, schizo 
phrenics showed no tendency to increase theif 
usage frequency of pronouns reinforced by the 
experimenter’s “good,” 2 Several investigator 
(e.g, Cohen & Cohen, 1960) have interpreted 
such results as indicating the underresponsiveness 
of schizophrenics to positive reinforcers. Recent 
evidence suggests that “verbal conditioning” may 
not be at all a simple form of learning that cani 
be accomplished without subjects’ awareness: 
(e.g., Levin, 1961). Rather it may involve a com 
plicated problem solving process in which E 


? Other investigators, using somewhat specializ 
procedures, have found that schizophrenics conditioni 
(eg, Krasner & Ullmann, 1958). In the one study 
in which nonschizophrenics were also tes 4 
Salzinger and Pisoni (1960) reinforced self-refert y 
affect statements during an interview and found 1% 
significant differences in the number of such state 
ments given by schizophrenic and physically ™ 
patients, 
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of the problems is to discover that something is 
to be learned. In the absence of suitable control 
conditions, it is therefore difficult to attribute 
the failure of schizophrenics to learn, or even to 
equal normal performance in a verbal condition- 
ing situation, to specific underresponsiveness to 
the positive reinforcer rather than to a more 
general performance deficit that is typically found 
in verbal learning and problem solving tasks. 

A study by Atkinson and Robinson (1961) did 
provide such controls. Employing a paired- 
associate learning task, these investigators gave 
positive reinforcement (“Right,” “Good,” etc.) 
following each correct anticipation. Contrasting 
the performance of this reward group with con- 
trol subjects given no reinforcement (but able 
to assess their correctness from the drum) 
Atkinson and Robinson found that in normal 
subjects, the addition of social reward facilitated 
performance. The performance of the rewarded 
schizophrenics, however, was worse than the 
performance of their nonreinforced controls. 
Such results do not suggest that the schizo- 
phrenics were underresponsive to social reward 
in comparison to normals, but that the effects of 
the reward on performance were in different 
directions, 

Assuming that these results could be repli- 
cated, the Atkinson-Robinson study raises the 
interesting possibility that specific reinforcers, 
evaluating the accuracy of individual responses, 
may have quite different effects on the perform- 
ance of schizophrenics than general positive 
evaluation of subjects’ performance level. Other 
Variables could, of course, also contribute to the 
discrepancy between these results and those ob- 
tained with praise and encouragement: the 
nature of the task, characteristics of the schizo- 
phrenic sample, etc. One conclusion seems fairly 
D however. Lumping together all types of 

‘ophrenics, all types of laboratory tasks, and 
the total class of manipulations characterized as 
Seu incentives and motivations, has not pro- 
luced a simple empirical generalization about 
m effect that social incentives have on the per- 
ME of schizophrenics, let alone a generaliza- 
T about the differences in reaction to social 
m between schizophrenics and normals. If 
in seeming contradictions among empirical find- 

85 are to be resolved, more refined analyses of 
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the relevant experimental variables and their 
interaction with schizophrenic characteristics 
appear to be demanded. 
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DERIVED FROM BALANCE THEORY 
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The study explored the interpersonal attraction factors relating to roommate 
compatability among college women. The sample consisted of 121 university 
women who requested roommate changes during the middle of the academic 
year. Using an adjective check list, each S described: (a) herself, (b) her 
originally assigned roommate, (c) her new roommate, (d & e) her perception 
of her roommates’ description of her, and (f) her perception of how most 
people viewed her. A comparison of the responses of the satisfied (new) 
roomates with those of the dissatisfied (original) roommates supported the 
following 2 hypotheses at the .01 level: (a) Interpersonal attraction varies with 
attitudinal similarity in regard to the self as an object of importance. (b) 
Interpersonal attraction varies more directly with perceived similarity than 
with objective similarity of self-other descriptions. 


The purpose of the present study was to 
explore interpersonal attraction factors pertaining 
to roommate satisfaction among college women. 
It was specifically designed to investigate two 
of the aspects of Heider’s (1958) earlier bal- 
ance theory which were recently examined by 
Theodore Newcomb (1961) in his Acquantance 
Process. 

Reciprocal attraction may be analyzed either 
in terms of objective similarity or of perceived 
similarity of attitudes. Objective similarity refers 

` to the actual agreement expressed by two persons 
toward an evaluated object. Perceived similarity 
has reference to an individual’s estimate of the 
extent of consensus between his and another 
person's evaluation of a relevant object. 

One of the valued objects, if not the most 
important object, in one's attitude world is 
himself. Interpersonal attraction is thus directly 
related to agreement in self-other evaluations of 
oneself (Newcomb, 1961). An individual is at- 
tracted to others whom he perceives as agreeing 
with his perception of both favorable and un- 
favorable traits within himself, This thesis seems 
to imply that it is psychologically comfortable 
for one to feel that he is completely understood 
by a person who is aware of his desirable and 
undesirable qualities and yet continues to value 
his friendship. While it is rewarding to perceive 
consensus in the appraisal of all traits, it is 
especially gratifying to have one's positive 
characteristics appreciated by others. 

In the absence of perceived consensus, Heider 
(1958) proposes that the relationship between 
communicators is characterized by imbalance. 
This same nonconsensual condition is described 
later by Festinger (1957) as cognitive dissonance 


and by Newcomb (1961) as a “strain of perceived 
discrepancy." 

Newcomb's work was not concerned with per- 
sonality similarity, but with additudinal similar | 
ity. While his writings deny that roommate com- 
patibility requires similar dispositions, Heider’ 
more elaborate theory of interpersonal attraction 
suggests that both of these homogeneity elements 
contribute to congeniality. 


Hypotheses 


1. It is hypothesized that interpersonal ab 
traction among college roommates varies with 
attitudinal similarity in regard to the self, a 
measured by a revised Gough Adjective Check | 
List. 

2. It is further proposed that interpersonal 
attraction of freshmen roommates is moe 
directly related to perceived similarity than to 
objective similarity of self-other descriptions 0. 
the revised Gough Adjective Check List. 


| 
METHOD | 
Subjects 


The sample consisted of 121 freshmen women s 
the University of Kentucky. In analyzing the bs | 
data, these subjects were divided into two pe! | 
classifications, a group of satisfied roommates and 
group of dissatisfied roommates. The satisfied EM 
was composed of new roommate pairs who ha 
selected. one another as roommates, while rej 
their originally assigned roommates. The response 
of this group were then contrasted to the inform 
tion reported by the dissatisfied subjects, who cho i 
not to remain with one another as roommates. 
odd number of subjects was produced by the kc ) 
of a few triple rooms, which contained three 100! 
mates. Students from three residence units were US 
as subjects. 
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Procedure 

‘All three residence groups were informed at the 
beginning of the academic year that roommate 
changes would be allowed after approximately 5 
weeks. Five weeks were selected as an appropriate 
time interval in view of Newcomb’s (1961) earlier 
research findings that most of the initial interpersonal 
exploration among college students was completed 
within 5 or 6 weeks. The 35 students who requested 
new roommates at the end of this time and the 
168 students who remained with their originally 
assigned roommates throughout the entire year were 
excluded from the present study. 

At a group meeting in December, all of the 
subjects were encouraged to change roommates at 
the end of the first semester. They were reminded 
that the opportunity of living with two different 
students greatly enriched a freshman's college experi- 
ence, The groups were assured, however, that the 
staff would not discountenance any roommates who, 
notwithstanding this suggestion, decided to remain 
together. This approach was taken with the assump- 
tion that roommates who were very satisfied would 
remain with one another in spite of the appeal, 
and that other less satisfied students would take 
advantage of the opportunity to select new room- 
mates, 

Students requesting a roommate change were asked 
to move during the first week of the second semester. 
All subjects were then asked to fill out six copies 
of the revised Gough Adjective Check List de- 
scribing themselves, their originally assigned room- 
mates, their newly selected roommates, their percep- 
lion of how they were viewed by their former 
Toommates, their perception of how they were 
Viewed by their new roommates, and their perception 
of how most people see them. 

Five months after the original research data were 
collected, the subjects from one of the three residence 
halls were re-examined with the same test instru- 
ments. The purpose of this second testing was to 
coda to what extent perception is altered when 
Re arrangements change. Do negative feelings 
ed an originally assigned incompatible room- 
8 e lessen when students no longer live together? 
imilarly, do positive feelings toward a newly selected 


n n 
DM decrease or increase after sharing the same 


Testing Instrument 


Ba revision of the Gough Adjective Check List 
ee as an instrument for obtaining the sub- 
mate: Perception of themselves and of their room- 
gu The Check List in its original form consisted 
desi 0 adjectives. The modification of the test, 
ae Specifically for use of this sutdy, was com- 
nu only 100 items, 50 positive and 50 negative 
true and false items. 


4 RESULTS 
nalysis Procedure 


ps cross-comparisons were made of the 
evised Adjective Check List measures, each 
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of which contained 100 items. The measures are 
as follows: 


1. How the subject sees herself. 

2. How the subject thinks her originally as- 
signed roommate sees her. 

3. How the subject thinks her newly assigned 
roommate sees her. 

4. How the subject thinks most people see her. 

5. How the originally assigned roommate sees 
the subject. 

6. How the newly selected roommate sees the 
subject. 


Each subject’s description of herself (Meas- 
ure 1) was compared with how she was seen 
by her original roommate (Measure 5). The same 
self-image (Measure 1) was matched with the new 
roommate’s description of the subject (Measure 
6). The number of correct predictions made by 
the subject was tabulated and a £ test was ap- 
plied to determine if the difference between the 
two groups of roommates was statistically sig- 
nificant. The results provided a measure of ob- 
jective similarity and was a test of the first 
hypothesis, that interpersonal attraction varies 
with attitudinal similarity in regard to objects 
of importance. 

The second hypothesis was tested by matching 
the 100 items used to describe how the subject 
thought she was seen by her originally assigned 
roommate with how she was actually perceived 
by that roommate. This analysis was followed by 
a comparison of the new roommate’s description 
of the subject with the subject’s prediction of 
the roommate’s perception, The difference in 
correct predictions made by the two groups was 
analyzed by means of a £ test. This yielded a 
measure of perceived similarity, testing the 
second hypothesis that “Interpersonal attraction 
varies more directly with perceived similarity 
than with objective similarity." 

A third ¢ test was made after a correction 
factor had been applied to the measures of per- 
ceived similarity. The correction factor was con- 
tained in Measure 4 (How I think Most People 
See Me). It was introduced in order to correct 
the estimate which each subject gave of how she 
was seen by both her original and her new room- 
mates. The adjustment was made by comparing 
the subject's prediction of how she thought her 
original roommate saw her (Measure 2), with 
how she thought most people saw her (Measure 
4), and by tabulating each item on which they did 
not agree. If the subject perceived her original 
roommate as viewing her in the same manner as 
she is appraised by most people, this perception 
was net particularly meaningful. If, for example, 
the subject thought that most people regarded 
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TABLE 1 
VALUES OF ¢ FOR INTERPERSONAL ATTRACTION 


Measures* Group 1 | Group 2 | Group 3 

(1 and 5) and (1 and 6) | 10.08* | 10.80* | 15.58* 

(2 and 5) and (3 and 6) .055| .040 .026 
(2—4) and 5; and (3—4) 

and 6 7.26* | 891* | 11.15* 


^ Measure 1 refers to subject's view of herself, Measure 2 
refers to the original roommate's description of the subject, 
Measure 3 refers to the newly selected roommate's description of 
the subject, Measure 4 refers to the subject's prediction of the 
original roommate's description of the subject, Measure 5 refers 
to the subject's prediction of the newly selected roommate's 
description of the subject, Measure 6 refers to the subject's 
puo of how she is perceived by most people. 

b «.01. 


her as being honest and also believed that the 
originally assigned roommate considered her 
honest, there was nothing unique in her prediction 
of her roommate's perception. If, however, the 
subject felt that most people pictured her as an 
unselfish individual, while her originally assigned 
roommate saw her as a selfish person, this lack 
of consensus was more meaningful. All of the 
items which were unique in the subject's estimate 
of the original roommate's description were then 
compared with how that roommate actually per- 
ceived her. In the above instance, if the subject 
had guessed correctly, and if the roommate had 
described her as selfish, while the subject thought 
other people considered her unselfish, this ad- 
jective was added to the frequency count of 
items depicting perceived similarity with the 
original roommate. 

A similar comparison was made of the con- 
sensus between the subject's estimate and the 
new roommate's description of the subject. The 
number of correct predictions for the original and 
new roommates were then compared. This 
Measure of perceived similarity, corrected for 
factors which might produce accidental accuracy, 
was considered a more valid measure than the 
originally uncorrected measure of perceived 
similarity. 


Findings 


According to the first hypothesis, interpersonal 
attraction varies with attitudinal similarity in 
regard to objects of importance. As reported in 
Table 1, the hypothesis was statistically supported 
at the .0005 level of confidence by all three resi- 
dence units, the obtained ¢ values being 10.08, 
and 10.80, and 15.58, respectively. 

According to the second hypothesis, inter- 
personal attraction among freshman roommates 
is more directly related to perceived similarity 
than to objective similarity of self-other descrip- 
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tions on the revised Adjective Check List, The | 
results clearly supported the hypothesis, as indi- 
cated in Table 1. The obtained difference be- 
tween the compatible and less compatible room- 
mates yielded ¢ values of 7.26, 8.91, and 1115 
for the three residence groups. Estimates which 
students made of the consensus existing between 
them and their roommates were thus strikingly 
more accurate for new roommates than for the 
original, uncongenial roommates. 

Without the correction factor, there was no dif- 
ference in the interpersonal perception processes 
of the satisfied and dissatisfied roommates, the 
t values being .055, and .040, and .026 for the 
three residence units. The striking difference be- 
tween the uncorrected figures and the resilts | 
obtained with the correction factor certainly sug- 
gests some degree of accidental accuracy. It is 
probable that the uncorrected responses reflected 
predictions which related to all people and were | 
not specific to the roommates. 


Retest Results 


Further interesting findings were yielded by | 


a comparison of the original results with retest 


findings yielded 4 months later by subjects from | 


one of the three residence units. These results 
are recorded in Table 2. 

The difference between subjects’ self-images 
and the original roommates’ descriptions of the 
subjects decreased after the students were sept 
rated for 4 months. In both January and May, 
there was a significant difference in these objec- 
tive measures of the self. The amount of con 
sensus was somewhat greater, however, in the 
retest results. The ¢ value obtained in January 
was 17.57, while only 12.66 in May. This in 
creased similarity suggests that students who were 
incompatible, and who preferred new roommates, 
probably grew less hostile in their feelings towal 
one another after 4 months of separation. The 
earlier estimates may have been complicated by 


TABLE 2 
Test-Retest VALUES oF ¢ FOR INTERPERSONAL 
ATTRACTION 

Measures* January May 
1 and 5 17.57* 12.66% 
(1 and 5) and (1 and 6) |  15:58* 10.80" 
(2 and 5) 17.96* 11.13 
(2 and 5) and (3 and 6) ‘03 e 
(2—4) and 5 11.64* 53 Be 
(2—4) and5;and (3—4) |  11.15* 6.5 

and 6 


^ See Footnote a, Table 1 for description of measures. 
* p <.01. 
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yarious factors which decreased accuracy of 
perception. 

A second £ test provided a comparison of the 
self-image with the descriptions given of the 
subject by both the original and new roommates. 
‘The ¢ values of 15.88 in January and 10.80 in 
May show that both roommates became more 
realistic in their appraisal of the subject. As was 
expected, however, in each testing period, the 
new roommate’s description of the subject agreed 


- more with the subject’s view of herself than the 


original roommate’s description. 

Another similar finding related to the percep- 
tion of consensus for originally assigned room- 
mates. Both the January and May results reflect 
marked inaccuracies in the estimates of reciproca- 
tion, As compared with the January f score of 
17.96, the retest £ value was only 11.13. Both 
figures indicate a difference in the way in which 
the subjects perceived their originally assigned 
roommates and the way their original roommates 
thought they were viewed by the subjects. Ac- 
curacy of perception was considerably improved, 
however, after 4 months of living apart. 

When this measure was corrected against the 
subjects! perception of how they thought most 
people saw them, the test-retest values were con- 
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siderably lessened, though in the same direction. 
The corrected retest ¢ value was then 11.64 as 
compared with the January £ score of 5.35, At 
both times there was a significant amount of 
error estimate. The errors were considerably re- 
duced, however, on the second testing. When the 
ability of subjects to predict how they were per- 
ceived by their original roommates was corrected 
and compared with their corrected ability to 
guess the description given of them by their new 
roommate, accuracy was again improved after a 
lapse of 4 months. The corrected January ¢ value 
was 11.15 and the corrected May ¢ value was 
6.54. Without the correction factor, there was no 
difference in either January (£—.03) or May 
(¢=.05) in the perceived similarity of satisfied 
and dissatisfied roommates. 
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VOLUNTEERING AND THE RECALL OF INTERRUPTED TASKS’ 


DONALD ROSS GREEN 


Emory University 


It was hypothesized that the prop 


not willing to volunteer, with di 


falling between the 2. Within each of these 3 groups t í 
greater proportions of interrupted tasks 


instructions were expected to recall 


than those given ego orienting instructions. 


ortion of interrupted tasks recalled would 


be greatest among volunteers serving by choice 


raftees who had been willing to volunteer 


and least among draítees 
those given task orienting 


College students divided into 6 


i 3X2 factorial design were tested individually 


although the differences among 


argued that differences in degree of t: 


account for these results. 


á The purpose of this study was to clarify the 
"Cumstances which promote or inhibit the 
elgarnik effect. 


ate article is adapted from a doctoral disserta- 
aL Submitted to the University of California, 

pu The author is grateful for the assistance 

Ma led by F. T. Tyler, G. T. Buswell, p. W. 
¢Kinnon, and J. A. Holmes. 


cedure, The results conformed to expectations 
the volunteering groups were small. It was 
ask involvement and ego involvement 


One common explanation of the results ob- 
tained in studies using Zeigarnik’s technique is 
that “task involved” subjects tend to recall a 
larger proportion of interrupted than completed 
tasks (the Zeigarnik effect) whereas this tendency 
is reversed among “ego involved" subjects. 
Taken at face value the data do not entirely sup- 
port this interpretation. For example, Zeigarnik 
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(1927), Marrow (1938), and Lewis and Franklin 
(1944) each obtained a strong Zeigarnik effect 
under circumstances that may be considered task 
orienting, but Glixman (1949), and Gilmore 
(1954) did not. However, following a suggestion 
made by Atkinson (1955) it may be that the 
differences in results stem from the differing 
ways in which the services of the subjects were 
obtained. Glixman and Gilmore both used stu- 
dents who were required to participate as part 
of a course. Marrow’s subjects were all volun- 
teers. Apparently some of Zeigarnik’s subjects 
were volunteers while others like those of Lewis 
and Franklin were unaware that they were sub- 
jects in an experiment. If volunteers in contrast 
to nonvolunteers tend to become task involved 
more readily and ego involved less readily the 
difficulties created by the data for such an 
explanation would be reduced. 

A task involved subject is generally described 
as one whose goal during performance is the 
solution of the problem or completion of the 
task, not primarily as a means to some other 
goal but as an end in itself. Conditions tending 
to increase the prominence of this goal in a 
subject’s hierarchy of goals may be called task 
orienting. It seems quite reasonable to assume 
that many volunteers are curious about and 
interested in the things that they are going to 
be asked to do; many of the tasks given experi- 
mental subjects are interesting and even the 
duller or more unpleasant ones frequently can 
be thought of as challenging. It is suggested 
that among volunteers this view of psychological 
experiments is common, pretty much regardless 
of the specific circumstances they envisage. If 
so, it is logical to assert that volunteers are 
particularly likely to respond to task orienting 
cues, i.e., they are particularly likely to become 
task involved. 

By the same token one can argue that some 
individuals do not volunteer because of'a lack of 
interest in such things, or an antipathy toward 
them. To the extent this is important, non- 
volunteers should become task involved less 
readily than volunteers, 

A description of the ego involved subject as 
one Whose dominant goals include defending, 
maintaining, or enhancing his private self-esteem 
and/or his public prestige, encompasses most of 
the attributes of ego involvement cited in the 
relevant studies. Thus, one may call ego orient- 
ing those conditions which make it appear to a 
subject that a poor performance will be a re- 
flection»on his standing in his own eyes or in 
the eyes of others. Such concerns may be one 
reason some people do not volunteer; one, might 
expect more nonvolunteers than volunteers to be 
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suspicious of the experimenter's motives and 
perceive threat of this sort more readily, 
fore it is probable that nonvolunteers are 
ticularly likely to respond to ego orienting 
It should be noted that when a whole group 
drafted to serve as subjects, some of 
"nonvolunteers" can be considered volun! 
since they would have volunteered if given a 
chance. Hence these involuntary “volunteers” 
may behave like actual volunteers. 
However, a subject required to serve, even one 
who would have volunteered if given a chance 
has more reason to suspect he is in a threaten ng | 
situation than a subject free to participate oF 


some members of the class are not participating, | 
has the choice of assuming that the experimenter 


ment may still consider it unfair, but he p 
much less reason to doubt the experimenters 
assertion and, in fact, may suspect some suchi 
thing whether implied or not. In short, being} 
drafted can be an ego orienting cue and may 
reinforce other cues of this sort. K 

Thus, three categories of subjects may be dit} 
tinguished: volunteers serving freely, volunteers | 
required to serve, and nonvolunteers; these 
groups will be labeled F, R, and N, respectively, 
It is proposed that these three groups arranged 
in the order volunteers, required volunteers, and | 
nonvolunteers will exhibit decreasing responsive 
ness to task orienting cues and increasili 
responsiveness to ego orienting cues. | 

It is further claimed that task involvement 
tends to induce the Zeigarnik effect and eg 
involvement its reverse. Therefore, of the three 
groups, the free volunteers should have the 
highest recall ratios (i.e., number of interrupted 
tasks recalled to total number of tasks recalled) 
and the nonvolunteers the lowest ratios—regal " 
less of orientation; the mean for Group R woul a 
then fall between those for F and N (fi iil 
hypothesis). Also, task orienting situations sho 1 
produce higher ratios than ego orienting sita 
tions within each of these three groups GM 
hypothesis). In general, task orientation sho! f 
lead to ratios greater than .5 and ego orientatio” 
to ratios less than .5. 


METHOD 
Subjects 


All subjects, 30 males and 66 females, were E 
from students in eight sections of an undergra 
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course in Child Development at the University of 
California, Berkeley. In two sections of the course in 
the Fall semester and again in the Spring semester, 
the experimenter who was not known by the stu- 
dents, appeared in class and asked for volunteers for 
a psychological experiment which would take an 
hour at a time to be mutually determined. No further 
information was given. Sign-up sheets requesting 
name and choice of time were passed out. When 
they had all been collected a questionnaire asking 
for reasons for volunteering or not volunteering was 
distributed. These were collected, sign-up sheets were 
given to the nonvolunteers, and the class was told 
that participation was actually a course requirement. 
The whole procedure took about 20 minutes. When 
questioned about the volunteering routine the experi- 
menter said only that it had been necessary in order 
to find out who would have volunteered. Group R 
was drawn from those draftees willing to serve, and 
Group N from those not willing to volunteer. 

Those who had objection to participation were 
invited to see the experimenter privately. Five stu- 
dents in the Group N pool did so and were excused; 
the 15 from this same pool who missed their ap- 
pointments were not pursued. No Group R student 
protested and the 7 who missed appointments were 
rescheduled, Results from 32 students in each of 
these two groups were used. 

The 32 volunteers serving freely (Group F) were 
drawn from the four remaining sections of the course. 
Nine of these missed appointments but were re- 
scheduled, some of them several times. Thus, although 
the subjects in this group were supposed to be left 
entirely free to participate or not as they chose, a 
number of them may have felt some pressure. 

Testing occurred during a 4-week period beginning 
5 days after the experimenter’s appearance in class. 
The average delay for each group was a little over 
2 weeks, 


Materials 


Twenty heterogeneous paper-and-pencil tasks taken 
with minor modifications from MacKinnon and 
Henle (1948) were used. Each task was on a separate 
sheet of paper with ample space for the subject to 
Write or draw as directed. 

B Three different task orders were used in each 
Ee umental group. By dint of some juggling, equal 
Res of interruptions and completions for each 
er m each group were achieved. Many subjects 
za e a not complete certain tasks as scheduled 
Bos e result that a perfect balance for each serial 
dU could not be maintained in four of the six 
ie: A comparison with expected serial position 
fiects on recall suggests this may have reduced the 

erences reported below very slightly. Alteration of 
to tasks during trial runs to reduce this problem 

Oved unsuccessful. 


Procedure 


Eben were tested individually seated across à 
Subject ap. the experimenter who questioned the 
ane about age, academic interests, major subject, 

luture plans; gave orienting instructions; pre- 
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sented tasks and task instructions one at a time; 
asked for verbal recall of the tasks; questioned the 
subject about his perceptions of situation, tasks, 
interruptions, and recall; and explained the experi- 
ment to the subject. The procedure took an average 
of 50 minutes, about 40 of which were spent on 
the tasks. 

For each subject, performance was interrupted on 
half the tasks scattered through the list by placing 
the next sheet over the one on which the subject 
was working and immediately giving the new direc- 
tions, After the last task all sheets were picked up 
by the experimenter, stacked, and put in a drawer 
creating a brief pause after which recall began. 

The subject could see that the experimenter was 
keeping a record but not what was being noted. 
Most subjects said they thought each task was being 
scored. i 


Orientation 


Two different general instructions were used.? 
Both were modifications of the instructions used by 
Marrow (1938). 

The task orienting instructions were almost the 
same as Marrow’s Group I instructions but the word 
task was substituted for test and no mention of 
total score was made. These instructions emphasized 
that “We are interested in how rapidly and accu- 
rately each of these exercises can be completed . . .” 
and repeatedly mentioned speed, accuracy, and 
completion. 

Ego orientation was also accomplished with 
Marrows Group I instructions—but with some 
additions. A probable relationship between test per- 
formance and success in various professions was 
asserted and the experimenter added that "Since you 
will probably be going into teaching or some pro- 
fessional field, you can see why we are interested in 
how successful you will be on these tests." A large 
majority of all subjects were, jn fact, planning to 
teach. For task oriented, but not ego oriented, sub- 
jects the initial questions about plans and interests 
were accompanied by the statement that they were 
asked “only because one never knows what may be 
important.” 

The groups given these instructions will be called 
Group T and Group E, respectively. Sixteen subjects 
from each of the three volunteering groups (F, R, 
and N) were arbitrarily assigned to each of these 
orientation groups. Thus, a 3 X 2 design was obtained 
and the six subgroups may be labeled FT, NE, and 
so forth according to volunteering status and 


orientation. 


Analysis of the Data 

If degree of task involvement versus degree of 
ego involvement is a major variable in the determina- 
tion of recall, the appropriate measure is the ratio 
of interrupted tasks recalled to total tasks recalled 


2 A third set of instructions was also used’ with an 
additional 48 subjects. The data for these groups are 
not directly relevant to the hypotheses being con- 
sidered here, 1 
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(Rosenzweig, 1952). Since for each subject 10 tasks 
were interrupted and 10 tasks were completed, a 
ratio greater than .5 indicates more interrupted tasks 
recalled. Comparisons among mean recall ratios using 
a variance analysis provide evidence relevant to the 
hypotheses, 

The method of orthogonal comparisons, weighting 
Groups F, R, and N as +1, 0, —1, respectively, 
provides a test of the first hypothesis. This pro- 
cedure assumes Group R is distinct from the other 
two groups. A comparison treating R like the other 
volunteers would use weights +1, +1, —2 and a 
comparison treating Group R like the other draftees 
would use weights +2, —1, —1. The test of the 
orientation main effects is a test of the second 
hypothesis. No significant interaction is suggested 
by the hypotheses. ? 


RESULTS 


Mean recall ratios are shown in Table 1 and 
the results of the analysis of variance in Table 2. 

The orientation differences are reasonably 
large, consistent, and as predicted. Task orienta- 
tion does tend to lead to the recall of more 
interrupted than completed tasks and ego orienta- 
tion reverses this tendency. Needless to say, these 
are only tendencies; all six distributions overlap 
a great deal. 

The differences among the volunteering groups 
are consistent, as predicted, and quite small. 
In fact, to claim significant variation due to 
volunteering differences involves a fair amount 


TABLE 1 
Mean Recarr Ratios 


Volunteering 
Orien- 


tation Sex. 
Free Required | Nonvolunteers 
Task 572 554 524 550 
Ego 493 480 437 470 
Total | .533 517 480 
TABLE 2 
ANALYSIS OF VARIANCE OF RECALL RATIOS 
Source SS df | MS F 
Between groups -1997 | 5|.0399 | 4g7*** 
Within groups -7375 |90 | .0082 
"Total -9372 |95 
Between groups 
Orientation 1530 | 1} 1530 | 18.66*** 
Volunteering -0460 | 2] .0230| 2.80* 
Interaction -0007 | 2|.0004| 0.05 
"Total .1997 | 5 
Volunteering 
Prsdicted trend 0436 | 11.0436 | 5.32** 
Deviations -0024 | 1|.0024| 0.29 
"Total -0460 | 2 
*5x.10. 
TED S .025. 
=p 2 005. 
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of risk. If one is willing to accept that risk then 
one can assert F > R > N with some confidence 
since Table 2 indicates that almost all the volun. 
teering variance is accounted for by the com 
parison used. However, it may be added that 
the comparison treating F =R > N, ie, using 
weights +1, +1, —2, would have been just about 
as good since the mean square for this comparison 
is .0421 (F = 5.13, p < .05), leaving only .0039 
for deviations. On the other hand the comparison 
treating the draftees alike in contrast to Group F, 
i.e., using weights +2, —1, —1, gives a mem 
square of .0245 (F = 2.99, p > .10) witha devia 
tion mean square of .0215. 


Discussion 


That orientation is an important variable has 
been demonstrated repeatedly (for example, 
Eriksen, 1952; Glixman, 1949; Lewis & Franklin, 
1944; Rosenzweig, 1943). That the degree of a 
subject’s task involvement or ego involvement 
is the major factor affecting his recall has not 
been demonstrated by these data, however sug- 
gestive they may be. None of these studies, 
including this one, has used any direct measure 
of this involvement. What has been needed is 
some demonstration that task involvement and 
ego involvement result from the orienting 
instructions. | 

The present study offers considerable support 
for this idea. In addition to the claim that task 
orientation led to task involvement and e 
orientation to ego involvement which in turn led 
to the orientation differences in recall, the dif- 
ferences in recall among the volunteering groups 
can be explained using these same constructs. If, 
as seems reasonable, volunteers are more likely 
to be interested in the tasks and less likely 
to be afraid of experimental and test situations 
than is the case among nonvolunteers, then 
volunteers in contrast to nonvolunteers will prob- 
ably respond more readily to task orientation an 
less readily to ego orientation. Thus, the 
ferences in recall among volunteering groups © 
be attributed to differences in degree of task 
involvement and ego involvement. The fact thit 
these differences are small is not surprising Sint 
the relevant variables were not directly mine 
lated. The major strength of this view is tb 
the psychological forces responsible for the rec 
differences among the volunteering groups are 
asserted to be the same as those acting 1 
the orientation groups. This interpretation of 3 
data also permits a reconciliation of the di a 
ences in results among the studies cited in 
introduction. i 

It should be added that people almost certai d 
do or do not volunteer for a variety of reasons 
not included in the preceding discussion. 


data indicate that volunteers differ in 
m nonvolunteers but do not indicate 
if any, are factors in this 


ually, an attempt to determine the reasons 
osing or refusing to volunteer was made 
a badly designed questionnaire completed 
subjects immediately after the volunteer 
"sheets were collected. The nonvolunteers 
ed to "lack of time" frequently and lack of 
t occasionally, along with a heterogeneous 
jf other reasons. Fear of being manipulated 
i acknowledged only once and fear of adverse 
ent of performance and the like not at all. 
volunteers responded in an even more hetero- 
us manner with references to service and 
being most frequent. Responses loosely 
d as indicating curiosity, interest, and 
ted pleasure ran second and can be 
d to about one in six of the volunteers. 
the responses were anonymous all one can 
that the questionnaire data do not prevent 
erpretations offered above. 

le explanations of the data using the 
s of task involvement and ego involvement 


is are rather vaguely defined. For example, 
| not clear whether ego involvement refers to 
; hopes, or both, and to concerns about 
te self-esteem or about public prestige. As 
as such large uncertainties remain the use 
Tecall ratios as indications of personality does 
: seem justified. 
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Evidence for similarity of needs in 


select female friends who 


ferent interpersonal roles. 


Ors influencing friendship and mate selec- 
€ been the subject of considerable 
n and research, Complementarity of 


CRITIQUE AND Notes 


This study was undertaken to investigate complemt 
needs in mate and friendship selection. 
Schedule (EPPS) was administered to 


i le and a m ; 
EXnsisted: of atelier mate and friendship selection, but no 


consistent evidence for complementarity was found. Engaged couples were 


significantly alike on 8 of the 15 EPP 


few similar needs. The male fiancé selects tl 
friend and as a future spouse, but the female | 
a fiancé who are dissimilar. These findings indicate 
involved in the formation of like-sex and opposi J 
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RELATIONS BETWEEN NEEDS OF FRIENDS AND FIANCÉS 


THOMAS J. BANTA ann MAVIS HETHERINGTON 


University of Wisconsin 


entarity and similarity of 
. The Edwards Personal Preference 
29 clusters of 6 Ss each. A cluster 
ale and female friend of each fiancé. 


S needs. In friendship pairings, females 
are like themselves, males select male friends with 


he same type of woman as a 
fiancée picks a male friend and 
that different processes are 
ite-sex pairs fulfilling dif- 


needs and personality characteristics were ad- 
vanced by Murray (1938) and Rogers (1951) 
as the basis for friendship choice. Newcomb 
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Critique AND Nores 


(1956) found perceived personality similarity 
to be an important factor in the selection of 
friends by male college students. Izard (1960) 
using both male and female subjects reported 
that personality profile similarity on the Ed- 
wards Personal Preference Schedule (EPPS) 
was a correlate of friendship choice. He found 
no evidence for selection on the basis of com- 
plementarity of needs. Reilly, Commins, and 


 Stefic (1960) also using the EPPS with a group 


of female Catholic college students, obtained no 
evidence for a relationship of either similarity 
or complementarity in the personality needs of 
friends, Thus no evidence is available to support 
the role of complementarity of needs in friend- 
ships and evidence supporting similarity is not 


- always found. None of these studies attempted 


to evaluate sex differences in friendship choice. 

The results of studies on the relationships of 
needs in mate selection have also been in- 
consistent, Winch, Ktsanes, and Ktsanes (1955) 
found evidence of complementarity in choice of 
a mate, Bowerman and Day (1956) found no 
consistent support for either similarity or 
complementarity of needs in courting couples, 
nor did Katz, Glucksberg, and Krauss (1960) 
in ranking the needs of married couples. 

The present study involves more complex 
clusters of social relationships than have been 
previously used in studies of mate and friend- 
ship choice. An attempt is also made to dif- 
ferentiate between the selective processes in- 
volved in like-sex and different-sex pairs, and 
in pairs of subjects having clearly defined 
interpersonal roles. 


METHOD 

, The EPPS was administered to 29 clusters of 
Sx subjects each, a total of 174 subjects. A cluster 
Bed of an engaged couple and a male and 
emale friend of each fiancé. Contact with the 
Eu Subjects in a cluster was made through the 
male fiancée, No duplication of subjects was 
Permitted within a cluster or across clusters. 


RESULTS 


Ese Scores were computed for each of 
E, clusters of subjects. The 90 scores were 
Bu on the 15 EPPS need scores for each 
prod e six subjects in a cluster. All Pearson 
Pee oment intercorrelations of these 90 
i Were computed. .The resulting correla- 
Drese are not of equal importance for the 
ated nt analysis. These correlations were evalu- 

In terms of the relations between the same 
Table and the relations between different needs. 

* 1 provides only the correlations describ- 
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ing the degree of similarity between the 15 
possible pairs of subjects for the same need 
scales. 

The significant relationships are not dis- 
tributed equally among the columns of the table. 
Three of the 15 columns account for over half 
of the significant values. For the engaged 
couples (Column 1) 8 of the 15 needs are sig- 
nificantly correlated beyond the .05 level. All 
these correlations are positive, indicating a gen- 
eral preference for a person with similar rather 
than complementary needs among engaged 
couples. The other two relationships character- 
ized by highest need similarity are those of the 
female fiancée and her female friend (Column 7), 
and the female fiancée and the male fiancé's 
female friend (Column 8). Other role relation- 
ships have only zero to five significant need 
correlations. 

The 15 needs did not differ greatly in con- 
tributing to similarities of the pairs of sub- 
jects. At least one significant correlation, but 
no more than six, were observed for each row. 

In addition to examining similarity between 
pairs of subjects with respect to the same need, 
similarity between pairs of subjects with re- 
spect to different needs was analyzed. 

The pairs used in this analysis were: the 
engaged couple, the male fiancé and his male 
friend, the male fiancé and his female friend, 
the female fiancée and her male friend, and 
the female fiancée and her female friend. Each 
of these cross-need matrices included n(n-1) or 
210 correlations. Only 3-10% of the correlations 
in these matrices were significant at the .05 
level. For the engaged couples, only 7% of the 
cross-need correlations were significant. Most 
of the need pairings which were significant could 
not be interpreted in a meaningful pattern on 
the basic of logic, past research, or theory. 


DISCUSSION 


The results of this study indicate that engaged 
couples have similar needs and yield no con- 
sistent evidence for complementarity of needs 
(Column 1). Cross-need correlations did not 
occur in any meaningful or systematic pattern 
to support complementarity of needs, 

In friendship pairings there is again evidence 
for similarity of needs and none for complemen- 
tarity. The significant cross-need correlations 
for friendship pairs did not occur on variables 
which would yield significant support for com- 
plementarity of needs. Interesting differences in 
the relationships of needs within the various 
friendship pairings are found in Table 1. 


404 CRITIQUE 

Engaged males like the same type of women 
both as a friend and fiancée (Column 8), but 
there is little similarity between the male 
friend and the fiancé of engaged females 
(Column 5), In selecting friends of the op- 
posite sex, males selected women who were like 
themselves on autonomy, deference, and hetero- 
sexuality (Column 4). These needs were also sig- 
nificantly related in the engaged couple. The 
female fiancée.is only similar to her male friend 
in the need for autonomy (Column 9). This 
suggests that roles fulfilled in an engagement 
and a friendship by members of the opposite 
sex, are more differentiated for females than 
for males. 

In choosing same-sex friends women pick 
other females whose needs are very like their 
own (Column 7), males select same sex friends 
who are significantly like themselves only in 
Succorance and abasement (Column 2). These 
results are not in agreement with those of 
Newcomb (1956) or of Reilly et al. (1960). 
The difference between the present results and 
Newcomb’s finding of similarity of personality 
in male friends is probably due to Newcomb’s 
use of perceived personality similarity and this 
study’s use of actual similarity of needs as 
measured by the EPPS. The lack of agreement 
with the Reilly et al. study, which found no 
similarity in the needs of female friends as meas- 
ured by the EPPS is difficult to explain although 
it may be due to differences in the population 
sampled, ie, an all Catholic sample from 
Catholic University of America versus the 
present heterogeneous sample from a large state 
university. Since Izard (1960) who did find 
profile similarity on the EPPS did not dif- 


AND NOTES 


ferentiate between male and female subjects, his 
results are not directly comparable with the 
findings of this study. 

The engaged couple tends to pick male 
friends who are alike and female friends who are 
alike (Columns 12 and 13). However there is 
little relation between the needs of the male 
friends and the needs of the female friends 
chosen by the engaged couple (Columns 10, 
11, 14, and 15)—only 4 correlations out of 
these 60 are significant at the .05 level and 1 | 
of these, the need for change, is negative, 
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ERRATUM 


On page 379 of the article “Individual 
sonality Research,” by Irwin G. Sarason 
380), the second line of Paragraph 1, sh 


(P <.05) was that involving H scores for 


Differences, Situational Variables, and Per- 
(J. abnorm. soc. Psychol., 1962, 65, 316- 
ould read: 
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The frequency—and popularity—of tele- 
vision programs featuring violence and aggres- 
sion have provoked considerable debate in 
.Iécent years (Berkowitz, 1962; Klapper, 
1960). What effects do aggressive scenes 
In TV and the movies have? There is no 
question but that the mass media can teach 
certain forms of behavior. Newspapers, books, 
radio, TV, and movies may influence attitudes 
4nd values in a substantial proportion of their 
audience (Hovland, 1954). But the contro- 
versy centers largely about the question of 
the instigational qualities of media events. 
1 oes fantasy violence enhance or reduce the 
likelihood of later aggressive behavior by 
audience members? More generally, can the 
Mass media instigate socially disapproved be- 
havior by arousing otherwise inactive or sup- 
Pressed drives? Some critics answer such a 
Question with a definite and unhesitant “yes” 
(eg, Wertham, 1954). However, on the op- 
Posite side of the fence, a substantial group 


Eu pent-up aggressive energy within the 
"ence (cf. Klapper, 1960). 


E e 
Grane P Yestiration was supported by Research 
3 -1540(03) from the National Institute of 
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EFFECTS OF FILM VIOLENCE ON INHIBITIONS 
AGAINST SUBSEQUENT AGGRESSION * 


LEONARD BERKOWITZ anp EDNA RAWLINGS 


University of Wisconsin 


As a test of the possibility that recently-angered people can “purge” their 
anger through viewing filmed scenes of aggression, 160 college men and 
women were shown a 7-minute prize fight scene after having been either 
deliberately insulted or treated in a neutral fashion by a male graduate student. 
Just before the Ss saw the movie, E, a female graduate student, provided 
them with 1 of 2 synopses of the movie plot. In % of the cases E told the 
Ss the film protagonist (who took a bad beating in the fight) was a downright 
scoundrel. The fantasy aggression witnessed by this group was presumably 
regarded as justified. The remaining Ss were told that the protagonist was 
not really bad, and they presumably came to regard the filmed aggression as 
less justified. All Ss rated the male graduate student after seeing the movie. 
In opposition to the thesis of vicarious hostility reduction, the authors had 
predicted that the justified fantasy aggression would produce heightened 
overt hostility toward the insulting male graduate student by lowering in- 
hibitions against aggression. The questionnaire ratings support the prediction. 


The empirical evidence bearing on this 
controversy seems to be inconsistent and 
contradictory. On the other hand, some of 
the earliest studies in this area found that 
filmed events can excite audience members 
physiologically, particularly if they do not 
discount the scenes or otherwise dissociate the 
fantasy situation from their own circum- 
stances (Dysinger & Ruckmick, cited in Char- 
ters, 1935, pp. 25-27). Other research has 
indicated that fantasy violence may increase 
the probability of subsequent aggressive be- 
havior, perhaps because of a similar emotional 
arousal. Lovaas (1961) found that nursery 
school children exposed to an aggressive 
movie cartoon made more aggressive responses 
with a toy than did other children shown a 
less violent film, while Bandura, Ross, and 
Ross (1963) noted that preschool youngsters 
who witnessed the actions of an aggressive 
adult model on a movie screen tended to 
imitate this behavior later after they had been 
subjected to mild frustrations. 

Feshbach (1955, 1961) has been perhaps 
the foremost proponent of the “symbolic 
catharsis" doctrine, an opposing point of view. 
He has maintained that the symbolic expres- 
sion of aggressive responses, or participation 
in vicarious aggressive activity, can weaken 
the instigation to subsequent aggression’ under 
appropriate conditions. Such a reduction in 
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aggressive drive supposedly occurs only if the 
aggressive drive had been aroused at the time 
of participation in the vicarious activity; “a 
child’s anger toward its mother will not be 
reduced by an aggressive act toward a doll 
figure unless its anger toward the mother is 
aroused when the aggressive act is performed” 
(Feshbach, 1961, p. 381). From this point 
of view, then, the previously cited studies 
obtained a heightened occurrence of aggres- 
sive responses after exposure to the media 
violence because the subjects had not been 
angry at the time. To buttress his argument, 
Feshbach (1961) reported that male college 
students who had been angry exhibited less 
hostility on a questionnaire and word associa- 
tion task after witnessing a filmed prize fight 
scene than did similarly insulted men shown 
a more neutral movie. The fantasy aggression 
did not influence the incidence of aggressive 
responses in the noninsulted groups. 

Berkowitz, in questioning the symbolic 
catharsis hypothesis (1960, 1962), has sug- 
gested (1962) that hostility catharsis can be 
considered a special case of the “Ovsiankina 
effect,” or a “completion tendency.” A person 
motivated and set to attack a frustrater sup- 
posedly experiences a tension increase if he is 
prevented from doing so. The aggressive re- 
sponse sequence theoretically is not completed 
until the goal response is made, i.e., until he 
perceives that the anger instigator, or some- 
one closely associated with him, is aggres- 
sively injured by himself or some acceptable 
substitute. Symbolic catharsis is possible 
according to this formulation, but should gen- 
erally be relatively ineffective as a means of 
reducing the instigation to aggression. The 
angered audience member presumably obtains 
“completion” only to the extent that the 
fantasy victim of aggression is associated with 
his own frustrater, and this association should 
be very slight for most adults. 

How then can Feshbach’s results be inter- 
preted? One obvious possibility is that the 
decrease in aggressive responses stemmed from 
the arousal of inhibitory tendencies in the 
angered subjects shown the filmed violence. 
As a test of this alternative explanation, 
suppose that restraints against aggression are 
lowered in angered people witnessing scenes 
of violence. The symbolic catharsis hypothe- 
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sis surely would predict that the decreased 
inhibitions would facilitate the cathartic 
process. Because they now have weak guilt 
barriers against aggressive responses, these 
people should participate in the vicarious 
aggressive activity with few restraints; they 


should enter wholeheartedly into the aggres- | 


sive scene shown to them and, consequently, 
should experience a relatively strong emo- 
tional “purge.” The instigational conception, 
on the other hand, predicts the opposite ef- 
fect. According to this thesis, people seeing 
fantasy violence under conditions lowering 
their inhibitions against aggressive responses 
should display an increased likelihood of 
subsequent aggression. 

The experiment reported in the present 
paper is based on the latter expectation. 
College students are shown an aggressive 
incident, with some of the subjects led to 
believe the fantasy victim of the aggression 
is a scoundrel who more or less deserved the 
beating he received. If we assume the students 
regard this depicted violence as justified ag- 
gression, the fantasy scene should elicit only 
weak restraints against making hostile re- 
sponses. Rather than leading to a more 
effective purge of any anger they might feel, 
the reduced inhibitions (we contend) will give 
rise to relatively strong expression of hostility 
towards a person who had insulted them just 
before the movie. 


METHOD 
Subjects 


Male and female college students enrolled in intro- 
ductory psychology classes at the University 9 
Wisconsin volunteered to participate in a survey 0 
reactions to movies and TV for which they woul 
receive points counting toward their final course 
grade. The students were scheduled in groups of 
10 and more, with the entire experimental treatmen! 
being administered to the group as a whole. How- 
ever, since scheduling difficulties made it impossible 
to have the same number of men and women 
each group, the resulting disproportionality in 
number of male and female subjects in each cond 
tion was overcome by randomly discarding cases: 
There are 10 men and 10 women in each of 
eight experimental conditions in the final sample. 


21m order to lessen the possibility of bias, ER 
who served as the experimenter in this study 35. 
who was much less wedded to our initial hypo 
than was LB—carried out the elimination of 
superfluous cases. 


the 


Experimental Procedure 


e experimental conditions were created in a 
X2 factorial design. After the subjects had 
d into their seats the experimenter informed 
that another “graduate student” had requested 
mission to take a few minutes of their time in 
r to collect normative data for a study in 
ich he was engaged. The experimenter introduced 
‘graduate student and then left the room. 
“This student, whose actual function was to serve 
the major attitude object in the study, then pro- 
ded to administer a very brief intelligence test. 
j instructions for the test varied with whether or 
he was to anger the group. In all cases the 
miner said that the test had been developed with 
Eastern college students and that the original scores 
might not be applicable to a Midwestern group. In 
Angered condition he then went on to say 
it Eastern students undoubtedly were brighter and 
r than Midwestern students. He also explained 
importance of normative data and gave the task 
instructions in a condescending manner in this condi- 
jon, and badgered the subjects as they worked on 
e test. The Nonangered subjects, on the other 
|, were treated in a neutral fashion without being 
ulted or patronized. 


Series of studies, the writers also sought to deter- 
he the consequences of an attitude assessment im- 
Mediately after the anger arousal treatment. Would 
Swers to the postmovie questionnaire be affected 
Iesponses to a questionnaire completed earlier 
t after the anger instigation? After they had 
ed working on the intelligence test the examiner 
half of the groups within both the Angered 
d Nonangered conditions (Premovie Questionnaire 
jects) to fill out a brief, one-page form. He ex- 


lorm. (However, the experimenter was able to 
identify the subjects later by means of the biographi- 
information requested on the sheet: sex, birthdate, 
Year in college, and place of legal residence.) The 
jects in the No Premovie Questionnaire treatment 
not asked to complete this form at this time. 
fter the examiner had collected the intelligence 
f sheets, and the Premovie Questionnaires if they 
d been distributed, he left the room and the 
Perimenter returned. She said the movie reaction 
ty would now begin. They would be shown a 
Prize fight scene from the movie Champion 
d then would be asked to indicate their reactions 
"the depicted aggression. In order to help them 
Evaluate this scene, the experimenter would give them 
Drief synopsis of the story. 
his summary served as the context defining the 
Mtasy aggression as either “justified” or “less 
fied.” In the Justified Aggression condition the 
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synopsis portrayed the protagonist (played by Kirk 
Douglas) as a cynical, heartless opportunist who 
took undue advantage of his friends. Since the movie 
clip showed the protagonist taking a fairly bad beat- 
ing, we assumed most subjects would believe the 
protagonist had received his “just desserts.” The 
story outline in the Less Justified Aggression condi- 
tion indicated that the protagonist’s opportunistic 
behavior was a natural outgrowth of the harsh 
treatment this man had received from others earlier 
in life, but that he now (just before the fight) felt 
guilty about his behavior, and was on the verge of 
“turning over a new leaf.” The stronger sympathy 
for the protagonist aroused in this latter condition, 
we reasoned, should make the beating he received 
in the film appear relatively undeserved. Although a 
small number of subjects had seen the movie several 
years before the time of the experiment, both 
synopses were close enough to the actual plot to be 
accepted readily. 


Dependent Variables 


Immediately after the conclusion of the 7-minute 
film clip the experimenter distributed the one-page 
questionnaire employed earlier in the Premovie Ques- 
tionnaire condition. She said that the subjects’ re- 
actions to the movie might be colored by their 
attitudes toward the other experimental task (i.e., the 
intelligence test), and she wanted to assess these 
attitudes in order to be able to take them into 
consideration. Some of the items on this form were 
specific to the ostensible purpose of the intelligence 
test, but two questions were designed to serve as the 
primary dependent variables. The items read as 
follows: 


4. My attitude toward this task [the intelligence 
test] might have been better if there had been 
another experimenter [ie., examiner]. 


The scale was anchored at one end by the phrase, 
“Definitely true,” and at the other end by the words, 
“Definitely false; the experimenter did not bother 
me.” 


6. If I were to serve in other psychological 
experiments I would . . . 


Here the scale ends were, “Definitely not want to be 
with the same experimenter,” and “Definitely want 
to be with the same experimenter.” 

The other items essentially asked the subjects to 
indicate how much they had enjoyed working on the 
task. In all cases the responses were to be made by 
placing a checkmark at the appropriate place on a 
14-point rating scale. ` 

When this form had been completed the experi- 
menter administered a two-page questionnaire (the 
Movie Questionnaire) on which they were to indicate 
their reactions to the scene they had just witnessed 
and to TV in general. One item was employed as 
a test ọf the effectiveness of the Justified Aggression- 
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Less Justified Aggression manipulation. It asked the 
subjects: 


To what extent did you sympathize with each 
of the major characters? 


They were to rate their degree of sympathy for the 
protagonist and his opponent on separate 12-point 
scales. 

Two other questions embedded in the form at- 
tempted to assess the degree to which the subjects 
had enjoyed the prize fight scene. One question was 
fairly direct: 


To what extent did you enjoy watching the 
movie scene just shown to you? 


The other item was designed to serve as a disguised 
(or "projective") measure of their attitude toward 
the aggressive scene: 


To what extent do you believe the average 
student at this University (of your sex) would 
enjoy watching movie or TV scenes such as this? 


Again, the subjects were to respond by placing checks 
on a 12-point scale, Neither of these latter measures 
yielded significant condition differences, and there 
will be no further mention of them. 


RESULTS 


Differences in Sympathy toward the Fantasy 
Characters 


An important question has to do with the 
effectiveness of the sympathy manipulation. 
Did the story synopses provided by the ex- 
perimenter create the intended attitudes 
toward the main fantasy characters? Separate 
analyses of variance of each of the sympathy 
measures on the Movie Questionnaire demon- 
strated that the experimental manipulation 
was successful. There was a significant main 
effect for the Justified-Less Justified Aggres- 
sion variable for both items. Thus, the people 
in the Justified Aggression groups indicated 
significantly less sympathy for the protago- 
nist (F = 33.5, p < .01) and reliably greater 
sympathy for the challenger (F = 5.63, 
P < .05) than the subjects in the Less Justi- 
fied Aggression condition. The Duncan range 
tests of the condition means on the two 
items are summarized in Table 1.3 


5 The main effect for Justified versus Less Justified 
Fantasy Aggression was the only significant effect 
revealed on the item assessing sympathy for the 
film protagonist. In the case of the second item 
dealing^with sympathy for the challenger, there also 
was a significant interaction of Justification with 
Anger Arousal (F= 5.50, p < .05). As can be seen 


LEONARD BERKOWITZ AND EDNA RAWLINGS 


TABLE 1 


MEAN DEGREE OF SYMPATHY FOR THE 
Fantasy CHARACTERS 


Angered Nonangered 
n Less atifi Less |— 
Justified | justified | Justified | y. cena 


Aggression Aggression Aggression} Aggression 


Premovie 


questions: | Yes | No | Yes | No | Yes | No 


For the pro- | 4.8. | 4.8. | 7.5be| 8.10 | 5.5ab) 4.94 | 8:40 | 8.25) 
tagonist a 
For the chal- | 6.2. | 6.60 | 4.44 | 4.2ca) 4.64 | 2.94 | 3.8be] 3.65 


lenger 


‘The higher the score the higher the expressed sym 
pathy. Separate analyses were performed for each attitud 
object, i.e., the protagonist and the challenger. For each of 
these fantasy characters cells having any subscript in comn 
are not significantly different (at the .05 level) by Dum 
multiple range test. Each mean in this table is based on 20 ca 


character in the film clip and also the only 
well-known actor in the scene, it is reasonable 


propriety of the aggression they witnessed— 
which was mainly directed against the pro 
tagonist—varied inversely with the intensity 
of their sympathy for this person. The sub: 
jects expressing relatively strong sympathy for 
the protagonist, in other words, presumably 
were likely to believe he did not entirely 
deserve the beating he took. Through prior 
learning, this notion of unwarranted aggtes 
sion will then evoke guilt feelings or, more 
generally, fairly strong restraints against ao 
gression. Such movie induced inhibitions, 0” 
the other hand, will be relatively weak in the 
subjects observing a scoundrel receive well 
deserved aggressive attacks. Indeed, the pel 
ception of justified aggression on the Screen 
might even influence their interpretations 0i 
their own aggressive inclinations. They Coni 
ceivably could believe that if the aggression 
against the fantasy frustrater was warranted 
so was it permissible to aggress against their 
own frustrater, the insulting examiner. 


in Table 1, the Angered subjects in the Justi 
Aggression treatment expressed reliably greater SY? 
pathy for the protagonist's opponent than di E 
Nonangered subjects in this treatment. This is ta 
doubtedly a “cognitive balance" effect; not liking P^ 
protagonist, and possibly feeling some enhanced E 
sentment toward the protagonist because of à EPE 
eralization of hostility to this fantasy character D 
the insulting examiner, they evidently felt a He 
liking for the protagonist's opponent, the challeng 
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Attitudes toward the Examiner and the 
Intelligence Test 


The analysis of the two items assessing the 
postmovie attitudes toward the examiner 
(reported in Tables 2 and 3) generally 
yielded the expected results. Since the Duncan 
test does not require a significant F ratio, 
Table 3 presents the condition means in the 
groups filling out the premovie form as well 
as in the groups that did not do so. The 
insulted subjects who had responded to the 
questionnaire before seeing the movie were 
apparently not strongly affected by the 
fantasy context. Their attitudes toward the 
test administrator varied, by and large, with 
his treatment of them rather than with the 
film story outline. 

For the time being let us set aside the 
question of what had happened in these people 
and turn to the subjects who had not re- 
sponded to the Premovie Questionnaire. Here 
we see the differences predicted by the 
present theoretical analysis: The insulted 
Subjects seeing a fantasy villain receive his 


TABLE 2 


ANALYSES OF VARIANCE OF POSTMOVIE ATTITUDES 
TOWARD THE EXAMINER 


Item 1^ Item 2b 
Source df 
MS Fe MS Fe 
S ome 1 (1243.22 | 83.24 |455.62 | 56.97 
Justified-Less 1| 19.60 1.22 
Justified (B) 
removie-No 1| 38.02 32.40| 4.05 
Premovie Ques- 
tionnaire (C) 
Rs (D) 1| 0.90 0.90 
Š XD 1| 3.02 7.22 
A XD 0.90 8.10 
AXD 1| 2402 4.90 
* XC 1| 202 4.90 
^ xc 1| 211.60| 14.17 | 28.90 
$ XB 1| 7022| 4.70 | 422 
BXCxD 1| 302 0.22 
A XCXD 1| 10.00 21.02 
AXBXD 1| 0.62 0.10 
A XBXC 1| 640 10.00 
RXBXCxD | 1| 10.00 0.02 
€sidual 144| 14.94 7.99 


a iT x 

fenem 1: "My attitude toward this task might have been 
D Ten ere had been another experimenter.” 1 

Mente 12: 1f I were to serve in other psychological expert 

the seu Would . . , "" (definitely want or not want to be with 

s Ame experimenter). 

| the 05 jenati®s given in this table are significant at less than 
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TABLE 3 
Mean Postmovie HOSTILITY TOWARD THE EXAMINER 
Angered Nonangered 
Premovie 
questions Justified | Less | Justified _ Less 
Aggres- | Justified | Aggres- | Justified 
sion |Aggression| sion | Aggression 
Item 1* 
Yes 7.05xy | 8.45xw | 11.25, | 10.80, 
No 3.15, 5.80 12.75, | 11.95, 
Allgroups?) 5.10, | 7.125 12.00, | 11.38, 
Item 2* 
Yes 6.30, | 5.95, 8.65, | 8.65. 
No 3.70, 5.05, 8.75, | 8.45. 
Allgroups»| 5.00, | 5.50, 8.70, | 8.55, 


Note.—For both items the lower the score the less favorable 
the attitude toward the examiner. 

a For description of this item, see Table 2. 

b Where each cell mean for the groups divided on the Pre- 
movie questionnaire variable is based upon 20 cases, the cell 
means given on the All groups row are based on 40 cases each. 
Separate Duncan range tests were performed for the two 
analyses on each of the two items. In each case cells having a 
subscript in common are not significantly different from each 
other at the .05 level. 


“just desserts" expressed stronger hostility 
toward the examiner than did the similarly 
provoked subjects who had witnessed less 
justified fantasy aggression. This difference 
was significant, however, only for the first of 
the two attitude measures. 

The postmovie questions dealing with the 
subjects’ attitudes toward the intelligence test 
task produced some evidence of a resentment 
in the insulted subjects. But for these items 
no significant effects could be traced to the 
Justification of Fantasy Aggression variable, 
whether the subjects had completed the pre- 
movie form or not. Thus, on the first of 
these measures the Angered subjects (taken 
together) indicated they were reliably less 
interested in the task than the people in the 
combined Nonangered groups (F = 7.86, 
p<.01), while on the second item the 
Angered subjects said they had enjoyed the 
task significantly less than had the non- 
insulted subjects (F = 17.11, p= 001). On 
both measures the No Premovie Questionnaire 
people witnessing the justified fantasy aggres- 
sion gave essentially the same scores as the 
No Premovie subjects who had seen Jess 


justified aggression. 


Effects of the Premovie Questionnaire 


The present analysis of fantasy aggression 
is supported only in the case of the subjects . 
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TABLE 4 


CORRELATIONS BETWEEN PRE- AND POSTMOVIE 
EVALUATIONS OF THE EXAMINER 


Angered Nonangered 
Item 
T Less i Less 
Justified : Justified p 
eSa | ie, | rion | Jared, 
1 81 .89 34 44 
2 .65 84 48 A7 


Note.—For N = 20 r's of .44 and .56 are required for sig- 
nificance at the .05 and .01 levels, respectively. 


not rating the examiner before they saw 
the film. What happened then, in the sub- 
jects completing the Premovie Questionnaire? 
Did the insulted subjects in this group 
"discharge" their anger in responding to the 
preliminary questions? 

The total pattern of findings seems to pre- 
clude such a possibility. While the provoked 
subjects in the Premovie Questionnaire group 
did express somewhat more favorable atti- 
tudes toward the examiner after the film than 
did their similarly insulted peers who had not 
evaluated him earlier, internal evidence sug- 
gests the difference could have arisen from 
felt pressures to appear consistent in the 
'subjects in the former "treatment." Thus, the 
mean premovie ratings of the examiner in the 
Angered condition were 8.0 on Item 1 and 
7.3 on Item 2, which were fairly close to the 
same subjects! final evaluation scores of 7.8 
and 6.1 on these items. Somewhat more con- 
vincing are the correlations between pre- and 
postmovie ratings of the examiner given in 
Table 4. The Angered subjects evidently were 
particularly likely to repeat their initial evalu- 
ations in rating the test administrator on the 
final questionnaire. Whatever the reason for 
such a high "reliability," the considerable 
degree of consistency in the ratings of the 
anger instigator indicates we should not look 
for movie effects in the Premovie Question- 
naire condition. Clearly, the consequences of 
witnessing fantasy aggression can best be de- 
tected in the groups not asked to evaluate 
the test administrator prior to the film. The 

í remaining discussion will therefore deal with 
only these latter subjects. 
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Effects of Intracondition Differences in Sym- — 
pathy for the Protagonist 


If fantasy induced restraints against aggres- 
sion varied directly with the level of sympathy 
felt for the main recipient of the fantasy 
aggression, as we suggest, the subjects in 
the Angered-Justified Aggression condition 
expressing the least sympathy for the film pro- 
tagonist should have displayed the greatest 
hostility toward the examiner. In order to 
test this possibility, the men and women in 
each experimental condition were divided at ` 
the condition median for their sex in terms - 
of their indicated sympathy for the protago- 
nist. The 10 “high sympathy” men and 
women within each condition were then com- 
bined, as were the 10 “low sympathy” men 
and women, and separate analyses of variance 
were then performed for each item employing 
the resulting 2 X 2 X 2 factorial design. 

Table 5 summarizes the Duncan range tests 
for the two items tapping attitudes toward 
the examiner. As can be seen in the table, 
our prediction was upheld for both items. 
The Angered subjects seeing a fantasy villain” 
receive a deserved beating and who felt” 
comparatively little sympathy for him gen- 
erally displayed greater resentment toward 
the insulting examiner than did any of the 
other provoked subjects. 

This is no simple matter of the low sym- 
pathy subjects having characteristically strong. 
aggressive tendencies. Their low sympathy 
level apparently does not mean these subjects 
were generally inclined to be unfriendly to the 


| 
7 
l 
TABLE 5 l 


MEAN POSTMOVIE ATTITUDE Scores 1N THE NO 
PREMOVIE QUESTIONNAIRE GROUPS 


Angered Nonangered 

Less 
Justified | LG 4 | Justified | justified 
Aggression Aggression Aggression | Aggression — 
— 


1i NV 
Sympathy: | High | Low | High| Low | High | Low | High Es 


124 
4.65] 1.7, | 5.7p| 5.9» | 12.3, 13.20 11.54 
$4. 20] 47, $4, | &5.| 9.0. 804 8% 


Item 1 
Item 2 


ple 
Note.— For these items the lower the score the less favorit c 


the attitude toward the examiner. Separate Duncan Ta eie is 
were conducted for each measure. Each mean in 
d on 10 cases. 
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examiner; sympathy scores were not related 
to attitudes toward the examiner in the 
Angered-Less Justified Aggression condition 
or in any of the Nonangered groups. The sub- 
jects expressed a relatively strong hostility 
toward the examiner primarily when they had 
been insulted by him and then presumably 
had their inhibitions against aggressive re- 
sponses lowered by seeing another frustrater— 
the protagonist—get a just comeuppance. The 
low sympathy toward the victim of the 
fantasy aggression in this Justified Aggression 
condition probably indicates a fairly complete 
acceptance of the story justifying the movie 
violence. Since the story outline clearly did 
not support the fantasy violence in the Less 
Justified Aggression condition, low sympathy 
here conceivably does not have the same sig- 
nificance; even the unsympathetic subjects in 
this treatment might have had some doubts 
as to whether the protagonist’s beating was 
really warranted. Consistent with this, we 
might note that the mean level of sympathy 
for the protagonist was 2.4 in the Angered- 
Justified Aggression-Low Sympathy group, 
but was twice as high, 5.2, in the comparable 
provoked, Less Justified Aggression treatment. 


Discussion 


In the absence of suitable control groups 
We cannot say unequivocally whether the 
justified fantasy aggression lowered inhibi- 
lions against subsequent hostile responses 
or the context of less justified aggression 
Strengthened such restraints.* In either case, 
however, the present data suggest that agen- 
cies seeking to prescribe a particular moral 
Code for motion picture and TV producers 
May at times be faced with a paradox. On 
one hand, such codes usually insist films 
should teach the lesson, “crime does not pay.” 
The transgressor should always be punished. 
The criminal must be caught, and the villain 
receive his “just desserts.” Yet—if we can 


_. Seneralize from the present findings—under 


Some conditions such prescribed endings may 
give rise to other socially unfortunate conse- 


*A subsequent experiment completed as this article 
Went to press suggests that the justified fantasy ag- 
un lowered the subjects’ restraints against 
‘pressing their own hostile inclinations. 
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quences. If the villain is defeated or punished 
aggressively—if he obtains the beating he 
merited, as is typical in most melodramas— 
we clearly have a case of justified aggression, 
and this type of fantasy violence may actually 
increase the likelihood that some recently 
angered member of a movie or TV audience 
will attack his own frustrater, or perhaps 
even some innocent people he happens to 
associate with the anger instigator. Seeing the 
fantasy villain *get what he deserved" may 
make the angered individual more inclined to 
hurt the villain in Ais life, the person who had 
angered him. In essence, our findings point to 
possible dangers in movie and TV adherence 
to the Mosaic injunction of an “eye for an 
eye." An aggressive villain should perhaps be 
punished, but apparently not aggressively. 

If the obtained differences in the Angered 
conditions arose primarily from increased re- 
straints against hostility in the Less Justified 
Aggression treatment we would have a pos- 
sible explanation for Feshbach's (1961) 
recent findings. His angered subjects witness- 
ing the movie violence may have defined the 
fantasy aggression as unwarranted, and the 
result could have been strengthened inhibi- 
tions against all forms of hostility. 

Berkowitz has also pointed out that the 
strong arousal of socially disapproved drives 
often serves to elicit fairly strong restraints 
against the display of the prohibited actions 
(Berkowitz, 1962; Clark, 1955; Thibaut & 
Coules, 1952). Such a phenomenon con- 
ceivably could have occurred in Feshbach’s 
Angered subjects shown the aggressive film. 
The prize fight movie could have stirred them 
up all the more, producing a relatively high 
arousal level. But, possibly doubting whether 
the fantasy aggression was warranted, and 
extremely conscious of the other people about 
them—the subjects were assembled in small 
groups—this strong arousal might have led 
to relatively intense guilt and/or aggression 
anxiety. As a consequence, they could have 
leaned over backwards to avoid displaying 
hostility, whether this was on the final 
questionnaire or on the word association task. 

The doubts just expressed, however, should 
not be taken as a denial of the pessibility 
of any anger reduction through witnessing 


412 


movies or TV. Although the empirical justi- 
fication for this is not as good as we would 
want, Berkowitz (1962) has proposed that 
aggressive films may weaken the audience’s 
instigation to aggression in at least two dif- 
ferent ways. One, angered audience members 
may establish a strong association between 
their own frustraters and the fantasy victim 
of aggression. Theoretically at least, their 
instigation to aggression should lessen to the 
extent that they associate the attacked—and 
injured—person on the screen with their 
own tormentors. But in addition, the film— 
whether it is violent or not—may make the 
audience members forget about the thwartings 
they have suffered. Carried away by the 
movie, not brooding on the injuries done 
them, their anger might well subside fairly 
quickly. Their instigation to aggression has 
weakened because they have not kept 
themselves stirred up. 
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N ACHIEVEMENT, TEST ANXIETY, AND SUBJECTIVE 
PROBABILITY OF SUCCESS IN RISK 
TAKING BEHAVIOR * 


NATHAN BRODY 


Princeton University 


This study was concerned with the effects of n Ach and Test Anxiety on 
risk taking behavior and subjective probability of success in a sequential 
decision task. The principal findings were as follows: Ss who scored high on 
n Ach and low on Test Anxiety tended to make their decisions in the inter- 
mediate quartiles of the distribution of reported confidence at the trial of 
decision more often than Ss who scored high in Test Anxiety and low in 
n Ach. High n Ach tends to bias the overall level of subjective probability 
of success upward. Ss who scored high in n Ach and low in Test Anxiety 
tended to increase confidence rapidly up to the level of 50% confidence and 
then decrease their rate of increase in confidence after the 50% level of 
confidence has been attained in comparison to Ss who scored low in n Ach 


and high in Test Anxiety. 


This study is one of a series of studies 
designed to validate a theoretical model 
(Atkinson, 1957) relating individual differ- 
ences in motivation and risk taking behavior 
(Atkinson, Bastian, Earl, & Litwin, 1960; 
Atkinson & Litwin, 1960; Feather, 1960; 
Mahone, 1960). The Atkinson model pre- 
dicts that the relationship between resultant 
Motivation and subjective probability of 
Success for subjects for whom it is assumed 
the motive to avoid failure is greater than the 
Motive to achieve success is a U function, 
_ negative at all levels of subjective probability 
of success, with maximum negative value at 
3 subjective probability of success. The 
Atkinson model also predicts that the rela- 
tionship between resultant motivation and 
subjective probability of success for subjects 
for whom it is assumed the motive to achieve 
Success is greater than the motive to avoid 
failure is an inverted U function, positive at 
all levels of subjective probability of success, 
with maximum positive value at .5 subjective 
Probability of success. Subjects who score 
high on the TAT measure of n Achievement 
(Atkinson, 1958; McClelland, Atkinson, 
Clark, & Lowell, 1953) and low on the 


* This paper is based in part on a doctoral dis- 
Station submitted to the University of Michigan 
É S 1960. The author wishes to express appreciation 
0 his chairman, John W. Atkinson, and to the 

ational Science Foundation for support during the 
Conduct of this research. 


Mandler-Sarason (1952) Test Anxiety Ques- 
tionnaire are assumed to be higher in the 
motive to achieve success than in the motive 
to avoid failure. Subjects who score low in 
n Ach and high in Test Anxiety are assumed 
to be higher on the motive to avoid failure 
than the motive to achieve success. Previous 
research (Atkinson & Litwin, 1960) has sup- 
ported Atkinson’s model by use of a ring 
toss game. ^ 

The present experiment is designed to 
extend earlier research in two ways. First, 
subjective probability of success will be 
measured directly by asking the subject to 
state his confidence in a decision he must 
make. Second, by use of a sequential decision 
task in which the probability of success goes 
from low to high as the subject is presented 
with more relevant information, it is possible 
to derive a new hypothesis from the Atkinson 
model relevant to changes in subjective 
probability of success. This experiment seeks 
to obtai data relevant to the following 


questions: 


1. Will subjects tend to make decisions at 
levels of subjective probability of success that 
will maximize their resultant motivation? 

2. What are the effects of n Ach and Test 
Anxiety on overall levels of subjective prob- 
ability of success? ? a 

3. What is the relationship between re- 
sultant motivation and changes in the sub- 

e 
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jective probability of success as it goes from 
low to high? 


As subjects for whom it is assumed the motive 
to achieve success is greater than the motive 
to avoid failure go from levels of subjective 
probability of success below .5 to .5, they 
should tend to increase their resultant 
motivation. However, any increase in subjec- 
tive probability of success beyond .5 should 
result in a decrease in resultant motivation 
for these subjects. If it is assumed that sub- 
jects tend to state a subjective probability of 
success that will maximize their resultant 
motivation, we should expect that subjects 
for whom it is assumed the motive to achieve 
success is greater than the motive to avoid 
failure should increase subjective probability 
of success rapidly up to a level of subjective 
probability of success of .5, and should then 
tend to decrease their rate of increase in sub- 
jective probability of success after a level of 
subjective probability of success of .5 is 
reached. Subjects for whom it is assumed the 
motive to avoid failure is greater than the 
motive to achieve success should show the 
opposite pattern of changes in subjective 
probability of success. 


: METHOD 
Subjects 


One hundred and fifteen male subjects were ob- 
tained from the introductory psychology course at 
the University of Michigan during the fall semester 
of the 1959-60 academic year. Subjects were run 
in groups that varied in size from 5 to 10. 


Procedure 


The experimenter and a female assistant were 
present during the test session. The subjects were 
told they were to be given a series of tests. The n 
Ach Test was then administered with standard 
neutral instructions (McClelland et al., 1953, p. 98). 
Pictures 2, 5, 9, and 24 were presented in that order 
(Atkinson, 1958). The Test Anxiety Questionnaire 
(TAQ) was then given. 

After the subjects had completed the TAQ they 
were presented with a lithographed booklet entitled, 
Michigan Decision Making Abilities Test. The sub- 
jects were then presented with a standard probability 
learning experiment after the method of Jarvik 
(1951).2 


? The results of the probability learning experi- 
ment will not be discussed in this paper. They are 
not relevant to the theory and data analysis being 
discussed here, 
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Sequential Decision Experiment 


The following printed instruction appeared in the 
test booklet: 


This is another test of decision making ability, 
Please write your name and code number in the 
space provided. 

The test administrator has a deck which contains 
50 cards, The deck definitely contains circles and 
may have in addition a certain number of squares, 
These are the only kinds of cards which the deck 
can have. The deck is well shuffled and the cards 
are not placed in any kind of order. You will 
have to decide how many circles there are in the 
entire deck. There can be either 5, 10, 15, 20, 25, 
30, 35, 40, 45, or 50 circles in the deck. One of 
these ten possibilities is the correct answer. The 
cards will be shown to you one at a time. You 
will be allowed only a single decision. It is up to 
you to decide when the best time to make your 
decision has been reached. You can make your 
decision at any point in the presentation of cards, 
Your performance will be evaluated by a scoring 
system. You will receive more points for a correct 
decision if you make it earlier in the series, In 
other words the sooner you make your decision 
the more points you will receive if it is correct. 
However, you will receive zero points if the de- 
cision you make is incorrect. Obviously, the more 
cards you look at the better chance you will have 
to make the correct decision. On the other hand, 
the longer you delay your decision the fewer the 
number of points you will receive for the correct 
decision. 

There is a second part to this experiment. Before 
each card is presented to you, you should check 
the appropriate position on the 100-point con- 
fidence scale. This should represent your confidence 
in the correctness of your decision if you had to 
make it before the next card is shown to you. Be 
sure you check all fifty of these scales at the 
appropriate time. Remember you are allowed only 
a single decision. When you feel that the best 
time to make your decision has arrived, wi 
your decision next to the confidence scale for the 
coming trial, For example, if you wish to make 
your decision after the fortieth card is shown to 
you, place your answer next to the 41st scale. 
Feel free to ask any question you may have about 
these instructions. 


The subjects recorded their responses on a set ii 
50 11-point scales with the 0, 50, and 100% P? i 
tions marked. Each confidence response was assigne 
1 of 21 values from O to 100 in units of 5. 


Treatment of Data 


Of the 115 subjects who were selected from t 
subject pool, 93 arrived for the experimental sessi" 
Two subjects were discarded from the analysis 
they came late and completed only part of the 
Ach Test. The TAQ was scored by a Pro! 
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outlined by Atkinson and Litwin (1960). The n Ach 
Test was scored by the writer. The n Ach Test was 
scored again by the writer after a 3-month period. 
The rank-order test-retest correlation was .93 
(N=91). The product-moment correlation between 
n Ach and TAQ was .05 (N=91), a result pre- 
viously reported by Atkinson and Litwin, and 
Feather (1960), indicating no relationship between 
these measures under neutral conditions. 

Subjects were assigned to motive groups on the 
basis of their scores on the two motive measures. 
The groups were composed, respectively, of subjects 
who scored above the median in n Ach and in the 
highest quartile of the TAQ (N —11); above the 
median in n Ach and in the lowest quartile of the 
TAQ (N=13); below the median in n Ach and 
in the highest quartile of the TAQ (N — 11); and 
below the median in n Ach and in the lowest 
quartile of the TAQ (N — 9). 


RESULTS 


Confidence statements between 30 and 70 
were selected a priori to represent intermediate 
levels of subjective probability of success. The 
number of subjects in each motive group who 
made their decision when their confidence 


‘levels were between 30 and 70 or less than 


30 and above 70 was obtained. There was no 
tendency for subjects for whom it is assumed 
the motive to achieve success is greater than 
the motive to avoid failure to decide at con- 
fidence levels between 30 and 70 more often 
than subjects for whom it is assumed the 
motive to avoid failure is greater than the 
Motive to achieve success. (Four of the 13 
subjects in the High n Ach-Low Test Anxiety 
group decided at levels of confidence between 
30 and 70, and 4 of the 11 subjects in the 
Low n Ach-High Test Anxiety group decided 
at levels of confidence between 30 and 70.) 
À further examination of confidence at the 
trial of decision was made by obtaining the 
distribution of confidence statements for all 
Subjects at the trial of decision. Table 1 
Presents the number of subjects in each 
Motive group who made their decision in the 
first, second and third, and fourth quartiles 
Of the distribution of reported confidence at 
the trial of decision. 

The Low n Ach-High Test Anxiety subjects 
tend to decide in the first and fourth quartiles 
of the distribution while the High n Ach-Low 
Test Anxiety subjects tend to decide in the 
Intermediate quartiles of the distribution. 
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TABLE 1 


NUMBER OF SUBJECTS IN First, SECOND AND THIRD, 
AND FoURTH QUARTILES OF THE DISTRIBUTION 
OF REPORTED CONFIDENCE AT THE 
TRIAL or DECISION 


Test an 
nach | anjety d (S | Gs 
High High 7 1 3 
High Low 2 0 11 
Low High 3 4 4 
Low Low 0 4 5 


This difference is statistcially significant by a 
one-tailed Fisher exact test (p < .05). It is 
of some interest to see whether the significant 
difference between these two groups is at- 
tributable to either or both of the motiva- 
tional variables, n Ach and Test Anxiety. A 
comparison between the High n Ach and Low 
n Ach subjects in tendency to make their 
decisions in the intermediate quartiles yields 
a nonsignificant x? (x? = .80, df= 1, p > .05). 
A comparison between the High Test Anxiety 
subjects and Low Test Anxiety subjects in 
tendency to decide in the intermediate 
quartiles of the distribution yields a signifi- 
cant x? (X? = 7.38, df=1, p< 01). The 
Test Anxiety variable is therefore the major 
contributor to the significant difference be- 
tween the High n Ach-Low Test Anxiety and 
Low n Ach-High Test Anxiety groups. 

The intermediate quartiles of the distribu- 
tion of reported confidence at the trial of 
decision fall within a confidence range be- 
tween 65 and 95. Consequently, although the 
High n Ach-Low Test Anxiety subjects tend 
to decide in the intermediate quartiles of the 
distribution, their decisions tend to occur at 
levels that are higher than would be expected 
on the basis of Atkinson’s model. One possible 
explanation for these unexpected results 
involves Festinger's (1957) theory of cogni- 
tive dissonance. The subject is instructed to 
write his decision next to the confidence scale 
for the following trial. Consequently, the con- 
fidence statements at the trial of decision 
actually are postdecisional measures of con- 
fidence and may be increased by the subject's 
attempt to reduce postdecisional cognitive dis- 
sonance. In order to test for this possibility, 
the confidence statements at the trial pre- 
ceding the trial of decision were examined. 
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The confidence statements at this trial are 
presumably uninfluenced by  postdecisional 
attempts to reduce cognitive dissonance. The 
mean confidence at the trial preceding the 
trial of decision for the High n Ach-Low Test 
Anxiety subjects is 78.1. Therefore, these 
subjects are still substantially above the 
theoretically expected level of subjective 
probability of success of .5. Furthermore, the 
mean trial at which 50% confidence is reached 
is 18.1 for the High n Ach-Low Test Anxiety 
subjects, and the mean trial of decision for 
these subjects is 27.3. Consequently, the High 
n Ach-Low Test Anxiety subjects are deciding 
on the average about nine trials after they 
have reached the 50% confidence level. These 
analyses of confidence at the trial preceding 
the trial of decision indicate that the un- 
expectedly high values of confidence at the 
trial of decision for the High n Ach-Low 
Test Anxiety subjects cannot be attributable 
solely to the effects of postdecisional cognitive 
dissonance. 

A plot of the mean confidence over trials 
by trial blocks of 12, 13, 13, and 12 is pre- 
sented for each of the motive groups in 
Figure 1. Examination of Figure 1 shows that 
the two High n Ach groups tend to have 
higher confidence ratings than the two Low 
n Ach groups. Differences in Test Anxiety 


D ACH TEST ANXIETY, 
100 o—o HGH HIGH 
*&--- HIGH Low 
SOF  a—a w HIGH 


Low 


TRIALS 


Fic. 1. Mean confidence in four motive groups by 
trial blocks of 12, 13, 13, and 12. 
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do not appear to be consistently related to | 
the overall level of confidence ratings. 

The inferences drawn from the qualitative 
analysis of data in Figure 1 are statistically 
supported by an analysis of variance per- 
formed on the mean confidence over all trials 
for each subject. Table 2 presents a summary | 
of this analysis. 

The analysis presented in Table 2 leads to 
two conclusions: High n Ach tends to increase 
the overall level of subjective probability of 
success. Test Anxiety is unrelated to the 
overall level of subjective probability of 
success. An analysis of variance performed 
on the confidence statements at the trial of 
decision reveals the same pattern of results. 
That is, High n Ach subjects tend to have — 
higher confidence at the trial of decision 
than Low n Ach subjects (F for achievement 
= 7.87, df = 1/36, p < .01). In the analysis 
of variance of confidence at the trial of 
decision only the achievement variable is 
significant. 1 

The final analysis is concerned with dit: - 
ferential rates of increase in confidence up t0 — 
the 50% confidence level and after the 50% 
level of confidence has been attained. Data 
for the statistical analysis of rates of change 
in subjective probability of success were ob- 
tained in the following manner: First, the 
rate of increase in confidence up to the stated 
level of confidence of 50% was obtained by - 
dividing the trial number at which a con- 
fidence value equal to or greater than 50 
was first attained, by 50. Second, the rate of 
increase in confidence after the 50% level was 


TABLE 2 


ANALYSIS OF VARIANCE OF MEAN 
CONFIDENCE RATINGS 


Source df MS F 
Anxiety (A) 1 581 | <1 
Achievement (Ac) 1 1047.7 51 
AcX A 1 439 | «1 
Error 28 182.6 

Total 31 


a 
Note.—Three subjects were discarded from the, oe 1 
Ach-High Test Anxiety group and one subject was disca 
from the Low n Ach-Low Test Anxiety group. These foU e 
jects checked their confidence ratings in terms of WE sould 
decision they had made, rather than the decision tl e. to 
make. They all ended the task with 0 confidence. In ordo 
achieve orthogonality this analysis is performed on & 
sample of un subjects drawn from each motive group- 
p «.05. 
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TABLE 3 


Means FOR ANALYSIS OF DIFFERENTIAL GROWTH RATE 
OF CONFIDENCE BEFORE: AND AFTER A SUBJECTIVE 
PROBABILITY OF SUCCESS OF .50 Is REACHED 


Test Anxiety 


n Ach 
High Low 
High 01 —.08 —.04 
Low 21 07 +.15 
A1 —.02 


obtained by dividing the difference between 
the trial number at which the highest con- 
fidence was first attained and the trial 
number at which a confidence level of at least 
50 was first attained, by the highest level of 
confidence attained minus 50. Finally the 
second measure was subtracted from the first. 
When the resulting value is negative, the 
growth rate of confidence has been more rapid 
up to 50% confidence than after 50% of 
confidence has been reached. Table 3 presents 
the mean values of this measure for the four 
Motive groups. 

A one-tailed ¢ test comparing the High n 
Ach-Low Test Anxiety subjects with the Low 
n Ach-High Test Anxiety subjects on the rate 
Measured reported in Table 3 reveals that 
they are significantly different (£ — 1.72, 
df = 22, p < .05). This finding indicates that 
subjects for whom it is assumed the motive 
to achieve success is greater than the motive 
to avoid failure tend to approach .50 sub- 
jective probability of success rapidly and then 
decrease their rate of increase in subjective 
probability of success relative to subjects for 
whom it is assumed the motive to achieve 
Success is less than the motive to avoid failure. 
As in the case of the analysis of the data 
Presented in Table 1, it is of interest to dis- 
Cover the contribution of the two motiva- 
tional variables, n Ach and Test Anxiety, to 
the significant difference that has been ob- 
tained. The one-tailed # test comparing the 
High n Ach subjects with the Low n Ach 
Subjects for the rate measure presented in 
Table 3 reveals a significant difference 
(t= 1.76, df = 42, p < .05). The f test com- 
Daring the Low Test Anxiety subjects with the 

igh Test Anxiety subjects is nonsignificant 
($= 1.31, df = 42, p > 05). 
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Discussion 


The results of this experiment are equivocal 
in support of the suggested theoretical rela- 
tionship between subjective probability of 
success and resultant motivation in different 
motive groups. The data on changes in con- 
fidence tend to support the theoretical rela- 
tionship suggested by Atkinson’s model, How- 
ever, the data on confidence at the trial of 
decision are only equivocal in support of the 
theoretical relationship. These results are in 
agreement with those of Atkinson and Litwin 
(1960) in that High n Ach-Low Anxiety 
subjects tend to take intermediate risks when 
intermediate is defined in terms of the median 
of the frequency distribution of goal setting 
behaviors. However, intermediate goal setting, 
as measured by confidence statements, does 
not coincide with the theoretically expected 
value of .5. If these results are replicated 
in other risk taking situations they would 
suggest that subjects for whom it is assumed 
the motive to achieve success is greater than 
the motive to avoid failure prefer intermedi- 
ate risks, but do not necessarily prefer situa- 
tions where the subjective probability of 
success is .5. 

The finding that n Ach positively relates 
to subjective probability of success supports 
the previous research of Pottharst (1955) 
and Atkinson et al. (1960). There has been 
no previous research on the effect of Test 
Anxiety on subjective probability of success. 
Therefore, the results neither contradict nor 
support past research, but they fail to support 
Atkinson’s (1957) conjecture that high Test 
Anxiety would decrease the overall level of 
subjective probability of success. 
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To further the theoretical expectation that orientation would be associated 
with leadership and related group behavior, the Bass Orientation Inventory 
was correlated with rated performance in sensitivity training groups of 32 
male supervisors and 25 female secretaries, in groups of 48 candidates for 
supervisory positions undergoing 3 days of assessment, and in reformed 
groups homogeneous in orientation. Task oriented Ss, particularly among 
men, were judged most favorably and interaction oriented Ss least favorably 
both by peers and outside assessors. However, homogeneous groupings for 
sensitivity training were most satisfying to interaction oriented Ss, whose 
leaders were most highly interaction oriented. Task oriented group leaders 
were highly task oriented, but leaders of self-oriented groups were relatively 


low in self-orientation. 


| n Leadership, Psychology and Organiza- 
tional Behavior (Bass, 1960), three types of 
group members, self-, interaction, and task 
oriented, were conceived. It was argued that 
attempts to lead under specified circumstances 
would be different among these three types of 
individuals. The Orientation Inventory (Bass, 
1962), 27 triads of questions about personal 
preferences, values, and projections, was con- 
structed to screen populations for samples of 
these idealized types. For each triad, subjects 
indicate which alternative they prefer most, 
and which they prefer least. 

The Orientation Inventory (Ori), in its 
final form, had the following test-retest 
reliabilities: self-orientation, .73; interaction 
orientation, .76; and task orientation, .75. 
The built-in negative correlations * among the 


n research was supported by Contract N7 onr 
; Group Psychology Branch, Office of Naval 
esearch, 
E forced-choice and Kuder inventories, these 
d Yield ipsative scores. Since the grand mean 
le à a constant, if an individual is high on one 
the diffe must be low on another. This accentuates 
Gm trentiation in responses of a given individual 
him x three orientations, making it easier to type 
us One extreme or another. Equall important, 
ih am d fits the conceptualization. We argue that 
eer members are concerned with the task, the 
unes On, or themselves. If they pay more at- 
DECRE one, then they must devote less to an- 

- Thus, we suggest that if a person is generally 


three scales and the obtained reliabilities 
made possible a classification system labeling 
individuals as of one type or another if they 
were in the top quartile of a particular dis- 
tribution. In a classification-reclassification 
analysis, it was seen that only 6.5% of 84 
subjects shifted from one idealized category 
to another. Most shifts were into and out of 
the residual category from one of the ideal- 
ized types (Bass, Frye, Dunteman, Vidulich, 
& Wambach, in press). 

In addition to sex differences, occupational 
and educational differences were also found 
in expected directions. For example, engineers 
earned significantly higher task scores than 
nonengineers (Dunteman & Bass, in press). 
At all age levels—early adolescent, late 
adolescent, and middle age—women were 


task oriented, then he is unlikely to be interaction 
oriented. Our measurements follow these restraints. 
The most uniform negative correlation seems to 
be between task and interaction orientation. This 
correlation. varies between —.3 and —.5. On the 
other hand, self-orientation sometimes shows less 
of a negative correlation with the other scales. 
Thus, “mixed types" of individuals emerge. A recent 
suicide was extremely high in task and self-orienta- 
tion and extremely low in interaction orientation. 
Juvenile delinquents in general tend to be low in 
task orientation and high in self- and/or interaction 
orientation. (The interaction oriented get along 
better in the institution and are paroled more 
readily.) 

° 
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420 
more interaction oriented than men (Bass 
et al., in press). 

Overt choice behavior was found associated 
in expected directions with Ori scale scores. 
Subjects with high task orientation scores 
completed tasks voluntarily more frequently 
following interruption than subjects with low 
task scores. Subjects with high task scores 
volunteered in greater frequency for psy- 
chological experiments. Subjects with high 
interaction scores were more likely to choose 
to work in a group or to volunteer for dis- 
cussions rather than working on problems 
alone, Subjects with high self-orientation 
scores were more likely to choose to work 
alone and, of all students, were most likely 
to shift from nonvolunteering to volunteering 
for service as a subject of a psychological 
experiment when an extrinsic monetary re- 
ward was added for volunteering (Bass et al., 
in press). 

On a battery of personality inventories and 
attitude questionnaires, the highly self- 
oriented subject described himself, to a 
statistically significant degree, as disagree- 
able, dogmatic, aggressive-competitive, sensi- 
tive-effeminate, introvertive, suspicious, jeal- 
ous, tense-excitable, manifestly anxious, lack- 
ing in control, immature-unstable, needing 
aggression, needing heterosexuality, lacking in 
need for change, fearing failure, and feeling 
insecure. The interaction oriented subject 
described himself as significantly in need of 
affiliation, socially group dependent, lacking 
in need for achievement, lacking in need for 
autonomy, needing to be helped by others, 
tending to warmth and sociability, and lack- 
ing in need for aggression. 

The task oriented subject described himself 
on the other inventories as self-sufficient and 
resourceful, controlled in will power, needing 
endurance, aloof and not sociable, sober and 
excitable, introvertive, radical, not dogmatic, 
lacking in need for heterosexuality, needing 
abasement, aggressive and competitive, lack- 
ing in need for succorance, low in fear of 
failure, mature and calm. He also scored 
higher in intelligence. He was a scholastic 
overachiever, if above average in ability, 
comparéd to equally intelligent subjects lower 
in task orientation (Bass et al., in press). 
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Part I 


Orientation and Peer Evaluations of Dis. 
cussion Performance 


The present report deals with several 
independent studies of differential behavior 
in groups of members assessed by Ori as 
self-, interaction, or task oriented. Behavior, 
as rated by peers, of 32 supervisors and 25 
secretaries in sensitivity training groups pro- 
vided two sets of correlated observations, 
Another study related orientation to ob. 
server’s appraisals of the leadership potential 
of 48 candidates for supervisory positions 
under intensive observation for 3 days using 
“country house" techniques. Finally, con- 
trived groups, homogeneous with respect to 
orientation, or containing specific proportions 
of members of each type, yielded further 
evidence on the overall differences in con- 
tribution to group life of the task oriented, 
the interaction oriented, and the self-oriented. 


Procedure 


Thirty-two members of a management training 
laboratory were administered Ori at the beginning 
of the laboratory. They were assembled in 
“balanced” discussion groups of eight each, matched 
in educational and occupational level (but without 
reference to Ori score), Each group met 10 times 
during 2 weeks for a total of 20 hours of dis- 
cussion about miscellaneous matters of their own 
choosing. 

Each of the discussions was of the leaderles 
type. The groups were sensitivity training groups 
without any formal leader, without any appointed 
chairman, and without any previously decided — 
agenda of a formal character. Each discussed 
lasted about 2 hours, and different from most 
training programs, no laboratory staff members were 
present during the discussion, although tape Y 
cordings were run of the discussions. The dis 
cussants were free to talk about anything and 
everything they wished, constrained only by th 
other seven members of the group in the ri 
Prior to these discussions, staff members e 
participants to experiment in new ways of bel hav- 
ing, to give each other feedback, and to take tim? 
during the discussion to analyze its process. 

A similar procedure was carried out for 25 pio- 
fessional women secretaries who met in two groups 
for sensitivity training for 1 weekend and 9 sue 
sive evenings once a week. Trainers were Hn 
at the regular meetings. The mean age of o 
secretaries and supervisors was o frasi 
with a range from the early 20s to the late 5 
After the tenth meeting, each member of à gen 
rated every other subject in that group °" 
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ORIENTATION AND GROUP BEHAVIOR 


items of behavior. A nine-point scale was used to 
indicate how much the person the subject was 
rating exhibited each of the behaviors during the 
20 hours of discussion (9 — Completely, 8 — Almost 
completely, . . ., 1 — Not at all). In addition each 


"subject ranked every other subject in his group 


according to the extent the subject ranked had 
successfully led or influenced the group. 

The seven ratings received by each subject on 
each item from fellow members in a group yielded 
a mean rating on each item which in turn was 


“correlated with the Ori scores of that participant 


(product moment). 


Results 
Both among supervisors and secretaries, 


L task orientation and interaction orientation 
| scales tended to correlate much more highly 


with peer ratings than did the self-orientation 
scale, and to a much greater degree. 

Task orientation was correlated signifi- 
cantly (joint probability at the 1 to 1% 
level) for both supervisors and secretaries 
with the following peer ratings: helps mem- 
bers express their ideas (.60, .43),* helps 
group stay on target (.58, .44), helps get to 
the meat of issues (.57, .38), gives good sug- 
gestions on how to proceed (.56, .39), pro- 
vides good summaries when needed (.52, .44), 
encourages group to a high level of produc- 
tivity (.52, 45), takes the lead in selecting 
topics (.49, .54), works hard (.49, .40), offers 
original ideas (.46, .37), effectively senses 
When to talk and when to listen (.45, 38), 
Successfully influences (.44, .43), concerned 
about successfully completing the group’s 
jobs (.33, .47), and does not run away when 
faced with a problem (.48, .39). 

Task orientation scores of supervisors but 
Not of secretaries correlated significantly at 
the 1% level with: provides helpful, objective 


feedback to members (.55) and easy to 


Understand what he is trying to say (.54). 
ask orientation scores of secretaries but not 
of supervisors correlated significantly at or 
Near the 5% level with: removal from the 
Soup would be a loss (.39), continues to 
Push point even after being blocked re- 
po (.51), annoys others (.37), domi- 

€s and imposes her will on the group 


ae first value in parenthesis is the product- 
mos correlation for 32 supervisors between task 
i: tion score and peer rating; the second value 
* corresponding correlation for 25 secretaries. 
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(.46), makes unjustified assumptions (.40), 
and blocks the group (.38). 

Thus, while task oriented subjects are seen 
as exhibiting initiative and aiding the grou 
to achieve its ends, among women create: 
domineering inflexibility is perceived as going 
along with helpfulness, initiative, and suc- 
cessful influence. 

Interaction orientation scores generally re- 
vealed a reversed pattern of rated behavior.* 
Among both supervisors and secretaries, inter- 
action orientation scores were negatively cor- 
related (at a joint probability at the 1 or 
5% level) with: concerned about successfully 
completing group jobs (—.36, —.30), domin- 
ates and imposes will on the group (—.39, 
—.49), offers original ideas (—.40, —.31), 
encourages group to a high level of produc- 
tivity (—.42, —.32), provides good sum- 
maries when needed (—.43, —.32), takes the 
lead in selecting topics (—.46, —.48), and 
helps group stay on target (—.51, —.38). 

Supervisors only who were high in inter- 
action scores were seen as: running away 
when faced with a problem (.38) and yielding 
to group pressures (.26) while highly inter- 
action oriented secretaries were rated highly 
in making others feel at ease (.30). 

Other ratings which correlated negatively 
in one sample at the 1 or 5% level, but not 
the other included (for supervisors only): 
easy to understand what he is trying to say 
(—.43); gives good suggestions on how to 
proceed (—.44); provides helpful, objective 
feedback to members (—.45); works hard 
(—.45); helps get to the meat of issues 
(—46); helps members express their ideas 
(—.55). For secretaries only, significant 
negative correlations at the 5% level were 
obtained between interaction orientation 
scores and: annoys others (—.45) and makes 
unjustified assumptions (—.48). 

Interaction oriented supervisors and secre- 
taries are seen as of little positive help to 
group work; among supervisors only, as 
avoiding problems and conflicts; among 
secretaries, as avoiding being unpleasant and 


unreasonable. 
Self-orientation scores for both supervisors 


4' This result is inseparable statistically or con- 
ceptually from the negative relation between task 
and interaction orientation. 
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and secretaries were significantly (joint 
probability at the 5% level) but negatively 
related with making others feel at ease 
(52 185120) 

Part II 


Orientation and Assessed Leadership Po- 
tential 


A major utility regularly selects journey- 
men and maintenance workers with promise, 
according to their performance records and 
boss’ opinions, and sends them to a center 
for 3 days of intensive observation in various 
quasi-real group situational tests following 
OSS procedures and methods usually de- 
scribed as the “country house” technique 
(see Vernon, 1950). Observers use a five- 
point scale to pool their ratings into a single 
overall evaluation of the promotability to 
supervisory positions of the candidates 
screened by the situational tests. 

Forty-eight of these candidates were given 
the Ori, The orientation scores, unknown to 
the assessors, were then compared for the 13 
candidates earning the highest “promot- 
ability” ratings with the 13 candidates 
earning the lowest appraisals by the assess- 
ment staff. Table 1 shows the mean Ori scores 
of the 13 high and low candidates. Down- 
graded candidates were significantly higher 
(at the 1% level) in interaction orientation 
and comparably lower in self- and task 
orientation. Evidently, the lack of initiative 
of the interaction oriented strongly affected 
their appraisals, consistent with preceding 
results obtained for secretaries and super- 
visors. On the other hand, task orientation 
failed to significantly raise evaluations any 
more than did self-orientation. 


TABLE 1 


MEAN ORIENTATION SCORES or 13 CANDIDATES AP- 
PRAISED HIGHLY PROMOTABLE TO SUPERVISORY 
PosrroNs wirrH 13 Appraisep as Low 
IN PROMOTABILITY 


Mean orientation 


Promotability 
Self Interaction Task 
High 21.8 23.8 35.5 
Lo 18.2 29.4 33.4 
p< 01 
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Part III 


Recomposed Groups Homogeneous in Orien- 
tation q 


To gain further insight into the behavior 
of discussants as a function of their orienta: 
tion, discussion groups were set up of pare 
ticipants, all of a single type. Consistent 
differences between self-oriented, interaction 
oriented, and task oriented groups, in mean- 
ingful ways, were observed for both the super: 
visors and the secretaries who met once im 


these recomposed groups. 


Procedure 


training groups of supervisors, the groups wer 
recomposed for the next discussion according t0 
members’ Ori scores. The members did not know 
the basis of recomposition. The eight members o 
the supervisor's laboratory scoring highest on the 
self-orientation scale were placed in a “Blue” group} 
the eight scoring highest on the interaction scale 
were placed in a "Brown" group; and the eight 
highest on the task orientation scale were pl 
in a "Purple" group. The remaining eight were mo 
as high as any of the other individuals on am 
of the three scales and were placed in an "Orange 
or “residual” group of individuals none of whom 
were high on any of the scales, nor low therefore 
on any of the scales. Each of these temporam 
recomposed groups held a 2-hour meeting ‘ 
which they were dissolved and members returned 
to their original groups. N 

A somewhat modified plan was followed for the 
secretaries. After the tenth regular meeting, 22 of 
the 25 secretaries present from the two reg 
training groups were recomposed into three groups” 
Blue, Brown, and Purple—of 7 or 8 subjects € ei 
of the same orientation. No residual Orange g1% 4 
was composed because of the fewer number 0! 
subjects available. 


Supervisor Analyses 


At the end of each of the four preceding 2- p. 
discussions in his regular training group duni 
the first 3 days of the laboratory, each super id 
had filled out a postmeeting questionnaire, He n 
the same at the end of the 2-hour discussion © 
the recomposed group. 
The questionnaire contained 12 items. Each i t 
was accompanied by a nine-point scale f 
from a generally favorable to a generally 1 
favorable reaction. For example, the first que 
was “How do I feel about this group now 
respondent could indicate a reaction ral 
1, “worst possible group,” to 9, “best 
group.” 
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Results 


The recomposed groups behaved as antici- 
pated, although to a degree exceeding ex- 
pectations. 

Table 2 shows the mean extent on each 
nine-point scale that a recomposed group’s 
discussion was higher in rating by its mem- 
bers than the same members had assigned to 
the last meeting of the balanced group from 
which they had come. For example, members 
of the Blue group, on the average, rated their 
Blue group .5 lower in quality (Item 1) than 
the various immediately preceding, regular 
groups. On the other hand, the interaction 
oriented members gave an average of 1 point 
more to their newly formed Brown group 
than to the various home groups from which 
they had come. In evaluating these responses, 
it should be kept in mind that in a manage- 
ment training laboratory of this sort, a great 
deal of loyalty is developed for one’s regular 
home training group. Most experienced staff 
personnel will agree that it is difficult to 
destroy this loyalty by rearranging mem- 
berships temporarily. Thus, in general, mem- 
bers were more favorably inclined to their 
original “home” group with which they had 
been meeting for the first 3 days than to 
their newly recomposed temporary group. 
But there was a striking exception—those 
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who met as the Brown group of highly inter- 
action oriented subjects. 

Interaction oriented members (the Brown 
group) felt quite differently about their re- 
composed group than did any of the other 
members. They seemed much more favorably 
disposed towards their recomposed group; 
and in the informal critique that followed 
they were rather extreme in this feeling. As 
noted in Table 2, they reported their recom- 
posed group had clearer goals, worked harder, 
had a more practical and realistic discussion 
centered in what was going on in the group. 
Most significantly in view of the question of 
loyalties, the interaction oriented members 
felt more “joined up” with their new recom- 
posed group than the group they had been 
meeting with for the past 3 days. 

On the other hand, self-oriented members 
(the Blue group) tended to rate more 
favorably the regular training group from 
which each had come, significantly more so, 
on the first and last of the 12 items. 

Some of the differences in reaction between 
self-oriented and interaction oriented subjects 
may be due to the fact that interaction 
oriented individuals seem to be much less 
interested in examining the “whys” and 
“wherefores” of discussion process than self- 
oriented persons who are much more reflective 


TABLE 2 


DIFFERENCES IN EVALUATION OF RECOMPOSED GROUPS OF SUPERVISORS AND THE IMMEDIATELY 
PRECEDING REGULAR GROUPS FROM WHICH MEMBERS CAME 


Mean difference in orientation 


Interaction Task Residual 
Recomposed group: Ge) (Brown) (Purple) (Orange) 
Postmeeting evaluation 
1 Quality of group ae E EU mae 
Clarity of group’s goals e dum Ls -8 
3. Worked hard at task 5 nee EXC 5 
4. Practical, realistic discussion. -4 Ae nel m 
5. Discussion about ourselves mid 20 —18 22** 
$ Members out to win own points c» mn —12* EN 
Group worked at developing itself -4 j 5 —10 
8. I leveled with others c D 2 14 
9. T felt joined up -16 3 9* qun 
10. Tolerated different views 3 * —9 E 
i Received help from others EN 14* 2) —28* 
2. Group concerned with content rather than process -15 3 : à 


^u positive value indicates response higher for recomposed group; 


$ «01, df — 6, 


negative value indicates higher rating of regular group. 
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and introverted concerning the process about 
them, In sensitivity training groups, efforts 
are made to get the group members to focus 
in a sophisticated way on the process of 
their interaction. Interaction oriented sub- 
jects paradoxically seem desirous of avoiding 
looking deeply into the very phenomena 
from which they seem to get the greatest 
enjoyment. Their opinion is often verbalized 
as: “people around here are trying to kill 
the discussion in order to dissect the corpse.” 
The twelfth item illustrates the differences 
between interaction and self-oriented groups. 
In comparison to the regular groups from 
which they had come, self-oriented super- 
visors reported their Blue group to be 1.5 
lower in concern with content rather than 
process. In opposite fashion, interaction 
oriented subjects noted that their Brown 
group was 1.4 higher in concern with content 
rather than process. 

As seen in Table 2, like self-oriented 
subjects, task oriented subjects also felt less 
favorably about their recomposed Purple 
group of only task oriented members. They 
judged it was less adequate a group, had less 
clear goals, worked less hard, yielded a less 
practical and less realistic discussion. Further- 
more, they felt that their recomposed group 
did not progress in its own development and 
was more intolerant of different views than 
the “back-home” regular training group 
from which each of the members had come. 

The “residual” Orange group lived up to 
its mixed composition. Much more conflict 
appeared in its discussion according to its 
members’ negative value on Item 6, “mem- 
bers were out to win own points.” The resid- 
ual members also felt that their recomposed 
group was much more content oriented than 
the regular training group from which they 
had come (Item 12). 


Supervisor critique. Before announcing the 
actual basis upon which the groups had been 
recomposed, a critique was held in an assembly 
of all 32 subjects after the recomposition experi- 
ence, Each of the recomposed groups was asked 
to summarize the content and process of its discus- 
sion. Illustrative of the effects of homogeneous 
grouping was the comment of the spokesman for 
the recomposed group of highly interaction oriented 
members (Brown group): 
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. . . In reflecting on our postdiscussion ratings 
of our group, we felt that they would likely he 
at the highest level in comparison to what we 
had been going through during the past two or. 
three days. . . . This gave us a very interesting. 
feeling and we thought you in the laboratory did 
a remarkable job [in composing our Brown 
group]. ... There wasn't a single excuse, argu. 
ment or a difference of opinion and we all reached 
the same conclusions. 


Secretary analyses. After meeting in recomposed 
groups, the secretaries, unlike the supervisors, had 
not been filling out postmeeting questionnaires after 
each meeting. So after meeting in the recomposed 
groups homogeneous in orientation, each secretary | 
completed a single five-item questionnaire. For each 
question she compared the recomposed with her 
regular training group on a nine-point scale coded - 
+4 to —4. As in Table 2, a positive mean score 
indicated choice of the recomposed group, while a 
negative mean score indicated that the item was 
more true about the regular group. However, the. 
measurement procedure was not directly comparable 
with the supervisors’ ratings. With 5 or 6 dj, & 
value 5-5 generally was likely to be significantly 
different from 0. 


Results 


Contrary to expectations, all three groups 
tended to be more favorable in response 10 
their recomposed group, as can be seen by 
the high proportion of positive responding 
to Items 1, 2, 3, and 4 of Table 3 concerning. 
group quality, drive, attractiveness, and 
mutual interest. The recomposed groups welt 
smaller (7 or 8) in comparison to the regular 
groups of 11 to 13 members, so direct com 
parisons may be less important than fhe 
relative contrasts between the three homo 
geneous, recomposed groups. Once again, the 
interaction oriented secretaries, like the m 
teraction oriented supervisors, were most 
favorably inclined to their newly recompose ; 
group. They felt their recomposed group n 
be best in comparison to their regular group) - 
they felt it had worked hardest, they PIF 
ferred the most to return to it rather than 
their regular group, and felt most compatible 

As before, the group of self-oriented ae 
jects felt relatively more compatible in inter 
ests than the comparable assemblage of be) 
oriented subjects, for relatively more eun A 
flict was perceived in recomposed group" { 
homogeneous in task orientation. Howeveb d 
indicated by the negative values, all subjec 
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TABLE 3 


DIFFERENCES IN EVALUATION OF RECOMPOSED GROUPS 
OF SECRETARIES AND THE REGULAR GROUPS 
1 


FROM WHICH MEMBERS CAME 


Mean difference in 
orientation 


Self Inter- 


action | Task 


Recomposed group: 


Comparative evaluation 
1. Quality of group 1 
2. Worked hard at task —.2 1. 
3. Would rather return A 1 
4, Common interests felt 8 
5. Much conflict 


* A positive value indicates a higher rating of the recomposed 
group; a negative value indicates a higher rating of the regular 
group. 


posed groups. 

Secretaries critique. Again, in general ses- 
sion, participants were asked to describe 
their meeting. 

The task and interaction groups spent a 
large portion of their time discussing aspects 
of the previous weeks’ discussions involving 
“the goals of development group training, 
While the self-oriented group utilized a great 
Proportion of their available time introduc- 
ing one another and discussing themselves. 
Also, the task group spent effort examining 
the possibility of applying knowledge derived 

trom the laboratory to the working situa- 
tion, Some individuals seemed aware that 
tach group contained members homogeneous 
In something, but none could verbalize just 
What these similarities were. 


more conflict in regular, than recom- 
f 


Part IV 


Orientation of the Most Influential Member 
un the Recomposed Group oj Members All 
ith in a Particular Orientation 


M the Successful leader depend upon the 
oa traits of those he leads (Bass, 
1960, pp. 174-177). Where physical prowess 
5 valued by the group, the leader tends to 
L Dow the average in this trait of those he 
ads; where criminal tendencies are com- 
Monplace in the group, the leader is often 
a archcriminal; where the group is highly 
“strated and resentful, the most vocally 


It is fairly well recognized that the traits 
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aggressive individuals are likely to come to 
the fore. 

A working hypothesis was formulated that 
in a group composed solely of task oriented 
members, the most influential member would 
tend to be outstanding in task orientation 
relative to the others in his group; in a 
group of highly interaction oriented members, 
the most outstanding member in interaction 
orientation would tend to be most influential 
in the group; and in a group of self-oriented 
members, the most influential member would 
tend to be most extreme in self-orientation. 

The six recomposed groups of supervisors 
and secretaries provided data for examining 
the hypothesis. Parallel findings emerged 
from the two samples of men and women 
when each subject ranked every other in 
his recomposed group in influencing the 
group. The discussant with the highest as- 
signed mean rank by his peers was identified 
as successful leader of the group. 


RESULTS 


In the task oriented group of seven secre- 
taries, the leader was the highest in the 
group in task orientation, exactly as pre- 
dicted. In the task oriented group of eight 
supervisors, the leader was third highest. 
Similarly, the most influential secretary 
among interaction oriented secretaries and the 
most influential supervisor among inter- 
action oriented supervisors were both next- 
to-highest in interaction orientation in their 
respective recomposed groups. 

A complete reversal (which made sense, 
after discovery) occurred in parallel fashion 
for both secretaries and supervisors when 
the self-oriented discussants were analyzed. 
The leader of self-oriented secretaries was 
next-to-lowest in self-orientation while the 
leader of the self-oriented supervisors was 
tied for lowest score in self-orientation. 


Further Analyses 

‘As reported earlier, task orientation had 
been found to correlate .43 and .44 with in- 
fluence in regular discussion groups. (Self- 
and interaction orientation were negatively 
correlated with influence in their regular dis- 
cussion groups, but not significantly so.) The 
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increased homogeneity of the recomposed 
groups compared. to regular training groups 
was expected to, and did, reduce these cor- 
relations substantially. For example, the cor- 
relation in the recomposed group of eight 
supervisors between task orientation and 
influence in the recomposed group of task 
oriented supervisors reduced to .24, while the 
correlation between task orientation and in- 
fluence among groups composed of self- 
oriented or interaction oriented members 
remained low negative. No correlation was 
found between previous influence in his 
regular group and successful influence in 
groups composed of self-oriented or inter- 
action oriented members. However, a cor- 
relation of .53 was obtained between leader- 
ship in one's regular groups and influence 
among task oriented members. 

Partial effects. A corrected measure was 
calculated of how much influence in a recom- 
posed group was associated with high scores 
on the particular scale used to compose the 
group, subtracting the effects of the same 
Scale score on influence in one's regular 
group. The emerging partial correlations were 
consistent with inferences based on the single 
most successful leader of each recomposed 
group. The partial correlation between inter- 
action orientation and influence among mem- 
bers high in interaction orientation (cor- 
rected for their respective influences back 
in their own regular group) was .42. The 
corresponding partial correlation between task 
orientation and influence among highly task 
oriented members in a recomposed group 
was .13 while the corresponding partial cor- 
relation between self-orientation scores and 
influence in a recomposed group of self- 
oriented members was only —.01. 

Placing a highly interaction oriented sub- 
ject among like-minded subjects increases 
his success as a leader. Since most task 
oriented subjects tend to lead regular training 
groups, when placed with other task oriented 
subjects, high task orientation, per se, reduces 
in importance in determining influence pat- 
terns. In groups of self-oriented, no relation 
remains between leadership and orientation 
after previous behavior in regular groups is 
discounted, HAG 
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DISCUSSION AND CONCLUSION 
Modifications in Conceptualization 


The interaction oriented member is now 
seen as considerably more superficial in his 
overall approach to group affairs. His con- 
cern with maintaining happy, harmonious re- 
lations makes it difficult for him to con- 
tribute to the group’s progress (unless every- 
one in the group is interaction oriented). The 
interaction oriented member’s interest in 
group activities is at a nonfunctional level 
as far as the group’s progress is concerned, 
Results here fit with unpublished learning 
data collected by Leo Postman suggesting 
that in comparison to other subjects, inter- 
action oriented subjects “speak before they 
think" in that they have a relatively 
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error rate coupled with the fastest attain- 
ment of an easy criterion of 9 out of 12 
correction responses. In comparison, the task 
oriented member is ready to examine all 
facets of group activities including the way. 
members need to relate to each other in order 
to accomplish the purposes of their group. 
Despite his concern with the task, the task 
oriented member works hard within the group 
to make it as productive as possible. It is 
the task oriented member who is likely to 
be rated as most helpful to the group and 
its constituent members—although among 
women, derogatory feelings may be arous 
because of the initiative displayed. 

But why did the pooling of only task 
oriented members fail to achieve an effective 
hard-working group? Task oriented super 
visors and secretaries seemed to be relatively 
less favorable to homogeneous groupings 
Several unrelated factors may be involv s 
First, the task oriented subject participat 
the most in his regular group and therefore 
probably had developed more of a comi 
mittment to it. He may have felt the 
arbitrary recomposition of membership, SW 


i 
| 
denly imposed by the training staff, t0 hr 
an intrusion interrupting the work of h n 
regular group to which he was contributing 
more than other members. Second, assum 
that successful leadership is rewarding; [ 


task oriented member had relatively i 
opportunity for such reward once plac 
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among other task oriented subjects. Third, 
it was suggested in the critique that task 
oriented subjects were more inclined to bring 
into their new recomposed group the par- 
ticular norms and standards developed in 
their regular group (which they, no doubt, 
had been most instrumental in developing). 
The result was more difficulty in reaching 
agreement on how to proceed in the recom- 
posed group. 

While the task oriented member emerges 
as the hero of the training group, as seen in 
our recomposed groups, for reasons indicated, 
it by no means leads to the extrapolation 
that the best group contains only task 
oriented members. In addition to the specific 
situation just discussed, it must be reiterated 
that while we suggest that some subjects are 
generally and consistently more task oriented 
than others over many situations, there are 
various conditions when the usually task 
oriented subject becomes least concerned 
with the task. Thus, task oriented subjects 
are most likely to become apathetic when 
faced with a dull, irrelevant group chore 
not providing them a sense of accomplish- 
ment, while interaction oriented subjects 
might remain content just to have the op- 
portunity to keep interacting. For example, 
when dyads of subjects of varying combina- 
tions of self-, interaction, and task orienta- 
tion were asked to judge photos during a 
regular class hour instead of doing classwork, 
it was interaction oriented subjects, not the 
task oriented, who were most favored as 
partners (Campbell, 1961). Yet, when the 
task for the dyads involved midterm examina- 
tions, interaction oriented subjects proved 
themselves least agreeable and caused their 
Partners to feel more conflict in two-man 
discussions about the right answers to the 
examinations. 

As a combination of several factors in- 
cluding the need to dominate as well as to 
introspect, the self-oriented member is most 
likely to be rejected by others as well as 
to be unresponsive to the needs of his group. 

1S concerns with himself are detrimental 
to his evaluation as a group member by his 
associates and to his likelihood of modifying 
his behavior in response to the group’s needs 
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in comparison to his own extrinsic demands 
unrelated to the task at hand. An under- 
standing of his performance may be found 
in the fact that when given control of a two- 
way communication system, the self-oriented 
discussant “turns off” disagreeing partners, 
and “turns on” agreeing ones, behaving in 
the Hull-Spence tradition, increasing what is 
immediately rewarding and avoiding what is 
immediately unpleasant. The behavior of 
interaction and task oriented subjects follows 
more dynamic theory. Their talking is posi- 
tively reinforced by agreeable partners. When 
they can control the systems, they talk more 
and listen less to agreement; they talk less 
and listen more to disagreeing partners 
(Kanfer & Bass, in press). The lack of 
receptivity of self-oriented subjects is illus- 
trated by a study in progress by Jerry 
Mendelsohn where he finds in simulated 
counseling that the more self-oriented the 
counselor or counselee, the less either agrees 
on what they talked about during counseling, 
but the more valuable they thought the 
counseling session. Lack of modifiability 
among the self-oriented seems reflected in 
their tendency to shift their opinions less. 
Rejection by peers of the self-oriented 
supervisors and secretaries seems consistent 
with the tendencies of self-oriented subjects 
to indicate more defense feelings under ego 
threat than matched interaction or task 
oriented subjects (Bass, in press) and with 
their greater tendency to show up as mal- 
adjusted clients at counseling centers or in 
institutions for juvenile delinquents. 

Results of these analyses also appear con- 
sistent with a most direct exploration by 
Frye (1961) of the proposition developed 
by Bass (1960, pp. 153-157) of how orienta- 
tion affects attempts to lead under various 
conditions of success and effectiveness as a 
leader. Frye studied 48 quartets homogene- 
ous in one orientation or another as they 
discussed the solutions to a series of nine 
problems. After each trial, subjects were fed 
back false information concerning the effec- 
tiveness of the group solution as well as their 
success or failure in influencing others. 

First, Frye found that regardless of treat- 
ment, interaction oriented subjects spent sig- 
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nificantly more time in discussion than sub- 
jects of other orientations, while task oriented 
subjects were most attracted to these problem 
solving groups after nine trials of experience. 
Self-oriented subjects talked the least and 
were least attracted to their groups at the 
conclusion of the experiment. However, in 
Opposition to what had been suggested by 
Bass (1960), when a subject was told he 
was influential but the resulting group solu- 
tion was ineffective, he was significantly 
more likely to increase his attempts to lead 
if he was task oriented, and significantly more 
likely to reduce his attempts to lead if he was 
self-oriented. Continued failure at the task 
spurred the task oriented who saw himself 
as influential, while it deterred the self- 
oriented. The interaction oriented subject 
was most stimulated into increasing his at- 
tempts to lead when told that the group was 
achieving effective decisions, but that he was 
not successfully influencing the group. 

Minimally, it seems warranted to conclude 
that identifying group members in advance 
as self-, interaction, or task oriented is a 
profitable discrimination that correlates con- 
sistently with patterns of leadership and 
other aspects of group behavior. 
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In 2 experiments the Ss were “workers” operating under the supposed 
guidance of their peer (a stranger) who was playing the part of their “super- 
visor.” The men in the High Dependency condition were informed that E’s 
evaluation of the supervisor would depend largely upon their productivity, 
while the Low Dependency Ss were told that this evaluation would not be 
affected by their performance. There was a significantly greater performance 
in the High Dependency condition. In general, the Low Dependency Ss worked 
harder when told that the supervisor would soon learn of their performance 
than when they had been informed that he would not find out about their 
productivity until much later. The immediacy with which the supervisor 
would learn of S's productivity did not affect performance under high 


dependency, 


however. The Ss in this condition worked relatively hard 


presumably because of a feeling of responsibility to the dependent peer. 


Most modern psychologists seem to regard 
man as being entirely wrapped up in himself. 
If we are to judge from the theoretical formu- 
lations currently guiding most of the research 
in personality and social psychology, human 
beings—or at least those living in Western 
society—have practically no concerns but 
themselves. They apparently are constantly 


_ oriented toward reducing their own tensions, 


and are persistently on the defensive. Such 
autistic processes, of course, cannot be denied. 


- But we also live in a world containing other 


people, and we sometimes do things because 
of our obligations to others. Civilized man 
Occasionally wants to help other people be- 


- cause it is his duty and responsibility to do 


80. As Gordon Allport (1960, p. 304) has 
Pointed out, Western psychology has largely 
neglected such considerations. 

In contrast to psychologists, sociologists 
and other social observers have not blinded 
themselves to the questions of duty and obli- 
gation. Perhaps because they professionally 
are more aware of the problems of everyday 
life, these social scientists have long noted 
that people often carry out particular actions, 
Sometimes even at considerable psychological 
Cost to themselves, merely because they are 
expected to do such things. Society is not 
kept from distintegrating into narcissistic 
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anarchy only by fear and constant police 
surveillance. 

This is not to say that sociologists view 
civilized man as essentially altruistic. Many 
social observers seem to believe the individual 
lives up to his obligations in order to recipro- 
cate for the “good turns” he has received in 
the past and those he expects to receive in 
the future (cf. Gouldner, 1960). For example, 
according to Gouldner, Malinowski’s analysis 
of social conformity in primitive societies was 
based on the reciprocity principle. Conformity 
with social norms, even when the rules of 
conduct are “hard and irksome,” is presum- 
ably something people give to each other in 
payment of the obligations people owe to 
each other. 

Such “give-and-get” expectations appar- 
ently are strongly learned and widely shared, 
Gouldner has hypothesized that the norm of 
reciprocity—which, to him, requires that 
people should help and not injure those who 
have helped them—is “no less universal and 
important an element of culture than the 
incest taboo" (p. 171). The reciprocity prin- 
ciple is probably important in the formation 
of cooperative behavior patterns; a person 
may carry out “selfless” actions as an adult 
because his parents had taught him as a child 
that he should do good unto others if he wants 
others to do good unto him. Nevertheless, 
whatever the genesis of the behavior, socially 
responsible actions may be performed—par- 
ticularly if the costs are not too great— 
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without conscious, explicitly felt anticipations 
of reciprocated rewards. i 

If the present reasoning is correct, then, 
many people in our society seek to aid others 
only because they believe that these others 
are dependent upon them for their rewards. 
The perception of the dependency relationship 
presumably arouses feelings of responsibility 
to these others, and the outcome is a height- 
ened instigation to help them achieve their 
goals. 

Other writers have advanced similar propo- 
sitions, Thomas (1957) has suggested, for 
example, that the magnitude of the “respon- 
sibility forces” within a group is in direct 
proportion to the degree of “facilitative role 
interdependence” among the members. In his 
experiment women group members who were 
highly dependent upon each other for the 
performance of a task exhibited greater task 
motivation than subjects who could work 
independently of the others in their group. 
His findings may be attributable to the reci- 
procity principle rather than our conjectured 
feeling of responsibility to dependent others. 
The interdependent subjects could have be- 
come highly motivated to help the others in 
the group in order not to endanger their own 
chances of gaining the reward. They gave in 
order to receive. Much the same interpreta- 
tion can be offered to explain the results in 
other experiments. Deutsch (1949) demon- 
strated that cooperative groups were more 
highly task motivated than competitive 
groups, probably because the people in the 
former groups were (in Deutsch’s words) 
“promotively interdependent”; any member’s 
behavior could advance all members toward 
the goal and, hence, each felt the others in 
the group were dependent upon him. But 
again, reciprocity may have been in opera- 
tion. Each member could have worked hard 
for his partners because he wanted them to 
work hard for him. Similarly, in an experi- 
ment by Berkowitz and Levy (1956) graups 
evaluated as a unit became more highly 
motivated than groups whose members were 
evaluated separately; the high task motiva- 
tion presumably resulted from the perceived 
member interdependence created by applying 
rewards to the group as a whole. The inter- 
dependent subjects probably were concerned 
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with aiding both their fellow group me 
and themselves. f 

Because of the reciprocity involve 
interdependent groups, Berkowitz 
conducted another experiment desi; 
assess the motivational effects of both 
dependency and dependency relati 
Using pairs of men engaged in a joi 
deavor, he found there was the greatesi 
vation toward the assigned task whe 
the subject and his partner could gain 
able prizes for effective performance. 
case the interdependency had been heigl 
by increasing the value of the reward 
subjects in such highly interdependent 
worked harder than the subjects asse 
in pairs in which only one member? 
achieve the valued goal. But, consist 
the present thesis, the results also ind 
that the individual subject was as 
motivated when only his partner wa: 
for the prize as when only he could atta 
reward. In both of these latter cond 
further, the subjects worked harder thai 
no prize was offered. Thus, the subje 
motivation was strengthened by telli 
their partners’ (but not their own) 
of attaining a valued goal was dependent 
their performance in the situation- 
though there was nothing tangible in 
them. a 

These subjects may have anticipa 
benefits, however. The work situatio: 
structured in the Berkowitz experim: 
the subject’s partner became aware 
subject’s performance almost immed 
Consequently, the subject might have 
relatively highly motivated to win his 
ner's approval. The present experiment: 
designed primarily to investigate the elt 
of differences in the immediacy with whic 
subject’s partner (who was either more 0l 
dependent upon the subject) would 
the subjects performance. If the 
middle-class college student has int 
the obligation to aid others who are dep 
upon him for their rewards, the subjec 
exhibit a relatively high level of mo 
to help a dependent peer even whe 
person would not learn of the sul 
performance until much later, if at all: 

The two experiments reported in the P 
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paper provide tests of this hypothesis. But, 
we should point out, a demonstration that the 
subjects in our sample do tend to work hard 
to aid peers who are highly dependent upon 
them would not necessarily question the ap- 
plicability of the reciprocity principle to other 
situations. The present subjects do not have 
to expend much effort in helping their peer. 
Their willingness to work for a dependent 
Other without return benefits undoubtedly 
would decrease with an increase in the psycho- 
logical cost of this work to themselves. Such 
an increase in costs could well heighten their 
desires for some sort of payment. More than 
this, however, even the subjects in our situa- 
tion may want some rewards for their effort. 
The subject working hard so that another 
may gain may well feel a degree of relative 
deprivation which, in turn, could produce 
some resentment. The second experiment deals 
with such a possibility. 


EXPERIMENT I 


The subjects in both experiments played 
the part of a “worker” operating under the 
guidance of a “supervisor.” A 2X 2X2 
factorial design was employed in the first 
study. In some cases the subjects were in- 
formed the supervisor’s evaluation was highly 
dependent upon their performance, while in 
the low dependency conditions these evalua- 
tions of the supervisor would not be affected 
by the subject’s performance. The subjects 
also were sometimes led to believe a cash 
Prize would be offered to the supervisor re- 
ceiving the most favorable rating, while no 
Mention was made of such prizes to the re- 
maining subjects. Finally, some of the sub- 
Jects were told the supervisor would receive 
Periodic reports of their performance. The 
men in the low awareness conditions, on the 
Other hand, were informed the supervisor 
Would not be told of their performance. 


Method 


Subjects and procedure. The subjects, 80 under- 
Staduate men recruited from introductory courses in 
Boley at the University of Wisconsin, were 

heduled for participation in the experiment in 
Pairs, with none of the pair members having more 

an a passing acquaintance with each other. Each 
dd Was told the purpose of the experiment was to 
velop a test of supervisory ability based upon 
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work samples. One of the pair members would be 
the supervisor whose task it would be to tell the 
other pair member (the worker) how to construct 
paper boxes. The two would be in separate rooms, 
they were informed, and the supervisor would com- 
municate with the worker by means of notes. The 
experimenter pointed out that the written notes 
would provide a record of the supervisor’s instruc- 
tions that could be scored later. 

The experimenter then described the sequence of 
events to be followed. After the two men went into 
their separate rooms, one of them (supposedly) 
would be chosen as the supervisor, and he would be 
given a few minutes to familiarize himself with the 
task. He then would write instructions to the worker 
who would have a 15-minute practice period and 
then a 30-minute work period. The practice period 
performance would not be counted. After these 
instructions, the subjects went to separate rooms 
where, a few minutes later, the experimenter told 
each subject he had been chosen to be the worker 
while the other person was to be the supervisor. 

Experimental manipulations. Since a pilot study 

had indicated practice period performance was not 
significantly affected by this procedure, the experi- 
menter (during Experiment I only) provided the 
instructions designed to create the experimental vari- 
ables before the onset of the practice trial. All sub- 
jects were told the supervisor would be rated on 
how well he had written his instructions, but the 
men in the High Dependency conditions also were 
informed that the supervisor’s evaluation also would 
depend upon the number of paper boxes they made 
during the 30-minute work period, (Thus, by im- 
plication, the supervisor’s rating depended greatly 
upon their performance.) In the Low Dependency 
conditions, however, the instructions stated that the 
supervisor's evalution would not be affected by the 
worker’s output (supposedly because there was a 
very great variability among workers). Half of the 
subjects in each of these conditions, those in the 
High Awareness treatment, were informed that the 
subject’s productivity would be reported twice to the 
supervisor during the main work period (at 15 and 
at 30 minutes). In contrast, the subjects in the Low 
Awareness conditions were told the supervisor would 
receive no information as to their performance. 
Finally, the subjects in the High Reward conditions 
were led to believe their supervisor would win a $5 
reward if he received the highest rating, In the 
Low Reward conditions the supervisor with the 
highest rating supposedly would be informed of his 
evaluation at the completion of the experiment, 
- Following these statements the experimenter left 
the.room and after 10 minutes brought each subject 
a note ostensibly written by the supervisor but 
actually constructed beforehand to reinforce the 
experimental instructions. In the High Dependency- 
High Awareness treatment, for example, this note 
said: 

Hope you can understand my notes. TIl have 
them ready in a few more minutes. Experimenter 
probably told you I will be seeing your boxes. She 

e 
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has given me the choice of how I will be rating 
your work, so I'll let you know how I'm doing it. 
TIl be rating your work on (1) number of boxes 
you make, and (2) the quality of the boxes. I 
think that (1) is the more important. Good luck, 
I'm counting on you. 


The note in the Low Dependency-High Awareness 
condition mentioned the same two performance 
criteria and also listed the first as the more im- 
portant. The last sentence was omitted, however. In 
the Low Awareness conditions the subjects given 
the High Dependency "treatment" also were wished 
“good luck" and were told the supervisor was 
"counting on" the subjects, but were informed by 
the note that the supervisor would not see the 
subject's work. 

There were 10 men in cach of the eight experi- 
mental conditions. 

Dependent variables. About 5 minutes after the 
first note was received, the experimenter brought 
each subject the written instructions from the fic- 
tional supervisor together with the work materials: 
Scotch tape and sheets of paper. The instructions 
were the same for all conditions. The subject was 
told the practice period had started, and that the 
experimenter would return in 15 minutes. The 
experimenter came back 15 minutes later to tell the 
subject that the main trial had begun, The experi- 
menter recorded the number of boxes constructed 
by the subject at the end of the practice period and 
at the middle and end of the main work period. 
Because these scores were found to be skewed, the 
square root transformation was applied to the data. 

At the end of the work period the subjects were 
given a brief questionnaire dealing with their re- 
actions to the experiment. Some of the items sup- 
ported the ostensible purpose of the study, but 
several were designed to test the success of the 
experimental manipulations. For each question the 
subject was to answer by placing a checkmark at 

, an appropriate place on a 5.5-inch-long linear rating 
scale. The scores in each case were the distances, in 
.25-inch units, from one end of the scale, 


Results 


Tests of manipulation effectiveness. Analy- 
Ses of two questionnaire items attest to the 
success of the dependency manipulation. The 
first of these questions asked, “How depend- 
ent was your partner upon your doing well 
at your job?" with the scale ranging from 
"very much" to “very little." The only sig- 
nificant effect revealed in the analysis of vari- 
ance of the responses to this item was that 
for High versus Low Dependency (F — 22.67, 
P < 001). As expected, the subjects in the 
High Dependency conditions said their part- 
ner was more highly dependent upon their 
performance. The second item read, “How 
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much did you feel that you were helping you 
partner get a favorable evaluation?” Again, 
the scale extremes were “very much” and 
“very little," and again, the analysis of vari- 
ance yielded only a significant effect for 
the dependency manipulation (F = 12.30, 
$ < 01). Subjects in the High Dependency 
conditions indicated they had given their 
partner more help in getting a favorable 
evaluation. 1 
Two other questionnaire items showed the 
High versus Low Reward manipulation had 
some effect. On one of these the subjects, 
informed their partner was eligible for a $5 
prize, rated his task instructions significantly 
more favorably than did the subjects in the 
Low Reward conditions (F = 11.28, p < 01). 
(There also was a significant main effect 
for dependency—F = 9.47, p = .01—with the 
men in the High Dependency conditions ap- 
parently also trying to help their partner by 
giving him a more favorable rating.) Simi- 
larly, on the second item the men in the High’ 
Reward conditions rated their partner’s super 
visory ability significantly better (F = 6.29, 
p = .05) than the subjects in the Low Reward 
groups. The subjects in the High Reward) 
conditions clearly believed these two ratings 
of their partner might affect his chances of 
gaining the cash prize and they evidently 
sought to help him out. The reader will note 
that these findings are consistent with the 
present analysis of dependency effects; sub- 
jects thought their peer’s chance of reaching 
the monetary goal was dependent upon their 
behavior—the ratings they gave—and they 
acted in what they presumably regarded as 
a “socially responsible” manner toward him: 
None of the questionnaire items revealed 
a significant effect for the High versus LOW 
Awareness manipulation. It could be, as W 
will argue later, that some of the subjects 1 
the Low Awareness groups believed their 
partner might learn of their performance m 
the very near future. : 
Performance. Table 1 summarizes the 
analysis of variance of the transformed Pp!” 
duction data. The scores are based on the 
number of boxes constructed in the various j 
experimental conditions during the 15-minul 
practice period and in each of the two A 
minute segments of the main work 
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TABLE 1 


ANALYSIS OF VARIANCE OF NUMBER OF Boxes Com- 
PLETED DURING PRACTICE PERIOD AND FIRST AND 
Seconp HALF oF MAIN WORK PERIOD 


(Square Root Transformation) 


Source df | MS F 
1. High versus Low Dependency (D)| 1|146| 11.91% 
2. High versus Low Awareness (A) 1] 8 
3. High versus Low Reward (R) i] 5 
&DX 1] 8 
SDXR 1] 61 |  4.98* 
6 AXR | 6 
LDXRXA 1| 11s 
8, Subjects/groups (Ss) 72| 123| 10.06** 
9, Trials (T) 2|384 | 315.27** 
10, T XD 2| 2.95] 2422** 
WT XA 2 2 
GST XR 2| 04 
3.TXD XA 2| 4| 328* 
WTXDXR 2| .05 
IS T XA XR 2| 05 
16 T XDXAXR 2. 2 
17. T X Ss 144| 112 

*p < 05. 
* b € 001. 


One of the significant effects listed in this 
table, the Dependency X Reward interaction 
on Row 5, warrants special comment. This 
interaction stems from differences in practice 
period performance. Even though the men 
were told this period “did not count,” the 
subjects in the High Dependency-High Reward 
condition constructed more practice boxes 
than the people in any other condition. 
Whether this greater performance was due 
to “chance” or the premature operation of 
f the experimental instructions is not certain.* 
The significant interaction of Trials X 
Dependency x Awareness indicated on Row 
13 is relevant to the theoretical argument 
advanced in this paper. The means involved 
in the interaction are reported in Table 2. 
AS can be seen in this latter table, only the 
men believing their supervisor was highly 
[ dependent upon their productivity showed a 
Significant rise in productivity above the 
Practice level. In the High Dependency condi- 
tions, furthermore, the awareness variable had 
Practically no effect; the subjects told their 
Supervisor would not learn of their perform- 
ance right away generally worked just as 
td as the men informed that the supervisor 
Would get reports of their productivity. Both 
High and Low Awareness groups in the 
; s analysis of covariance of the main work 
Priod ces was conducted holding the practice 
NC luctivity scores constant, and showed only 
‘Significant main effect for dependency. 
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High Dependency condition had a reliably 
greater level of productivity during the main 
work trials than the men in the Low 
Dependency-Low Awareness condition. 

However, although the Low Dependency 
subjects told their supervisor would learn of 
their performance worked somewhat harder 
than the similar Low Dependency subjects in 
the Low Awareness condition—as we would 
expect—this difference is not significant and 
clouds the interpretation of the present find- 
ings. There is no evidence the low awareness 
manipulation worked. The subjects conceiv- 
ably could have become highly motivated 
under the High Dependency conditions be- 
cause they thought the peer dependent upon 
their work would find out about their per- 
formance fairly soon, Their instructions had 
said the supervisors would be given an evalua- 
tion (either a prize or a rating) “at the end 
of the experiment.” But they did not know 
when the experiment would be concluded. 
Some of the subjects could have believed the 
study would end in a few days, and so, the 
peer might learn of their productivity rela- 
tively quickly. Since they might meet the 
peer again within the near future, they 
also could have become motivated to win 
his approval by demonstrating effective per- 
formance. 

TABLE 2 


Numer or Boxes COMPLETED DURING PRACTICE 
PERIOD AND First AND SECOND HALVES or MAIN 
Work PERIOD 


(Square Root Transformation) 


High Dependency Low Dependency 
D High Low High Low 
Awareness | Awareness | Awareness | Awareness 
Practice | 1.92 1.91 1.87 1.84 
a^ a a a 
A A A A 
Main: 3.28 3.22 2.78 243 
b b ab a 
B B B B 
Main; 3,54 3.54 2.99 2.62 
b b ab a 
C Cc B B 


‘ote—These are the conditions means involved in the sig- 

nitate Trials x Dependency X Awareness interaction re- 
in Table 1. 

e ted iden differences were tested by the Duncan multiple 
range test. Comparisons across conditions (within rows) used 
Row 8 in Table 1 as the error term. The presence of significant 
(.05) condition differences is indicated by different capital 
letters. Comparisons from one trial to the next (within 
columns) used Row 17 as the error term, and the DIE of 
significang differences is indicated by different small letters. 
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EXPERIMENT IT 


The second experiment was designed to 
overcome these difficulties. The subjects in 
the Low Awareness conditions were told the 
supervisor would not receive any “feedback” 
until the end of the semester—more than a 
month away—and even then, they were in- 
formed, they probably would never meet him 
again. Under such conditions, would informa- 
tion that a peer was highly dependent upon 
the subject’s performance serve to enhance his 
motivation to work on this task? 

Another question dealt with in this study 
had to do with the subject’s reaction to the 
demands imposed upon him by the perceived 
dependency relationship. Any obligation he 
felt to work for his peer was costly to him; 
he would have to expend extra effort on a 
task that was not intrinsically satisfying to 
him. Consequently, although the subject 
might not explicitly verbalize his feelings—his 
felt obligation could prevent him from ex- 
pressing any strong complaints—the High 
Dependency condition could produce rela- 
tively strong resentment. Consistent with such 
a possibility, Thomas (1957) reported the 
subjects in his highly interdependent groups 
displayed greater indications of tension than 
did the subjects assembled in groups having 
a lower degree of interdependence. 


Method 


Subjects and procedure. Thirty-two undergraduate 
men, also recruited from undergraduate psychology 
courses, were distributed evenly among the four 
experimental conditions of the 2 X 2 factorial design 
in the present study. As before, the subjects were 
Scheduled in pairs, and they were given the same 
general introduction about the ostensible purpose of 
the experiment. But this time, after each man was 
informed he had been chosen to be the worker, he 
was given the 15-minute practice trial before the 
manipulation instructions were provided. The High 
and Low Dependency conditions were created in 
exactly the same manner as in Experiment I. But 
unlike the first study, all subjects were told the 
supervisor would receive a $5 cash prize if he had 
the highest rating. As indicated earlier, the awareness 
manipulation also was varied somewhat. The High 
Awareness instructions again emphasized that the 
supervisor would get reports of the subject's per- 
formance in the middle and at the end of the 30- 
minute main work period. The subjects in the Low 
Awareness condition, however, were explicitly told 
the supervisor would not receive his evaluation (and 
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the prize if he won it) until the study was com- 
pleted at the end of the semester, The subjects also 
had completed a questionnaire at the start of the 


session in which they indicated their future plans | 
(administered for the purposes of the awareness - 


manipulation), and the experimenter also told the 
subjects in the Low Awareness treatment that the 
supervisor would be leaving the university at the end 
of the semester. 

Dependent variables. As in Experiment I, the 
major dependent variable is the number of paper 
boxes constructed in response to the supervisors 
instructions. However, the production data were mot 
transformed for the present analyses since the dis- 
tribution of scores was approximately normal, 
Further, in the interests of convenience, the score 
for any one subject is the difference between his 
practice period and main work period performance, 

Following the conclusion of the work period the 
subjects filled out a brief questionnaire designed 
primarily to assess their reactions to the task and 
the supervisor. When this was completed the decep- 
tions were explained and the subjects (all pledging 
secrecy) were dismissed. 


Results 


Table 3 summarizes the analysis of vati- 
ance of the performance data, while the mean 
increase in productivity from the baseline 
practice period in each experimental condi 
tion is given in Table 4. In line with the 
results of the first experiment, there was à 
highly significant main effect for the depend- 
ency manipulation. The subjects told their 
peer was dependent upon their work showed 
a significantly greater rise in productivily 
than the subjects informed their peer was 
not highly dependent upon their performance. 
The Dependency X Awareness interaction di 
not quite reach the customary level of sig- 
nificance, but since the Duncan range test 
does not demand significant F ratios (com 
pensation is made for the possibility of chance 
differences), tests were made of the differences 
among the four experimental conditions. AS 
can be seen in Table 4, the awareness variable 


TABLE 3 


ANALYSIS OF VARIANCE OF INCREASE IN Propgesivi’ 
FROM PRACTICE PERIOD: EXPERIMENT 


Source af | MS ee 
1. High versus Low Dependency (D) | 1 | 56149 161 
2. High versus Low Awareness (A) 1| 9800! 69 
3 DXA 1| 128.00 
4. Subjects/groups 28}| 34.67 
*p <0. 
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TABLE 4 


MEAN INCREASE IN Propuctivity FROM PRACTICE 
PERIOD: EXPERIMENT II 


Dependency 
Awareness 
High Low 
High 19.0, 14.6, 
Low 19.5; 7.1» 


Note.—Cells containing the same subscript are not signifi- 
cantly different from each other (at the .05 level) by Duncan 
multiple range test. 


did have a significant effect in the Low 
_ Dependency conditions. The men led to be- 


lieve their supervisor would soon learn of 
their productivity had a reliably greater 
productivity increase than the subjects told 
the supervisor would not learn of their per- 
formance until much later. The subjects in 
the Low Awareness-Low Dependency group 
apparently did not believe they could gain an 
important degree of approval from their peer 
by working hard. 

But despite this presumed low concern with 
gaining approval through performance, the 
men in the Low Awareness condition worked 
Just as hard as the subjects in the High 
Awareness treatment when both groups were 
told their peer's chances of gaining a valued 
goal was highly dependent upon their per- 
formance. Both groups of men apparently felt 
Some degree of responsibility to help their 


_ Peer reach his goal, even though (particularly 


m the Low Awareness condition) there was 
little tangible benefit to themselves in such 
havior, 

Questionnaire responses. Several of the 
questionnaire items administered at the con- 
clusion of the work period sought to 
determine why the subjects in the High 

ependency conditions did work hard for 
their supervisor. Analyses of these items 
Partly confirmed the questionnaire results 
obtained in Experiment I; the subjects per- 
ceiving the strong dependency relationship 
evidently felt some obligation to help their 
Peer. Thus, in response to the open-ended 
question, “What would you say your chief 
Motive was in making the boxes during the 
Work period?” a significantly higher propor- 
tion of the subjects in the High Dependency 
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conditions indicated they sought to help their 
supervisor, Seven of the 16 High Dependency 
subjects gave this type of answer, while 15 

of the Low Dependency subjects described 

some other motivation, such as for self- 

satisfaction in a feeling of accomplishment 

or to follow the instructions of the experiment 

(x? = 4.33, p= .05, employing Yates’ cor- 

rection). Possibly because of the greater 

feeling that they were helping their partner, 

the subjects in the High Dependency condi- 
tion regarded their own job as more important 

than did the subjects believing their super- 

visor was not too dependent upon them. 

The former subjects expressed a stronger 

denial to the question, *Do you think that 

workers in this type of work situation feel iso- 

lated or unimportant?” (F = 4.09, p = .06, 

approximately) . 

However, the perceived importance of their 
position in the High Dependency treatment 
did not compensate altogether for the lack 
of tangible rewards. The subjects were asked 
to complete the item, “I would have felt like 
working harder if such changes as [fill in] 
had been made regarding the worker's situa- 
tion in this experiment." Twelve of the High 
Dependency subjects (4 in the High Aware- 
ness group) listed at least one complaint, as 
did 15 of the Low Dependency subjects. 
There was a reliable difference in the nature 
of these complaints, however. Seven of the 
12 “griping” High Dependency subjects said 
they missed having some personal incentive, 
such as the possibility of a prize. In contrast, 
only 1 of the 15 complaining Low Depend- 
ency men made such a statement, most of 
them saying they wanted contact with the 
supervisor or more pleasant working condi- 
tions (x? = 6.04, p= .02, employing Yates' 
correction), Some of the men seeing them- 
selves as working for their peer evidently felt 
a degree of relative deprivation. The super- 
visor—for whom they were expending effort— 
stood to gain something tangible. Why not 
something for themselves as well? 

This presumed lack of “distributive justice” 
(cf. Homans, 1961) could conceivably lead to 
some resentment toward the individual de- 
pendent upon them, even though the anger 
might not be expressed directly. Suggestive 
indications of such resentment can be found 
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TABLE 5 


Mean Score on Irem, “To WHAT Extent WOULD 
Your Work Have BEEN BETTER IF YOUR PARTNER 
Hap BEEN REPLACED?” 

EXPERIMENT IT 


Dependency 
Awareness 
High Low 
High 18.75, 17.50,» 
Low 14.50, 16.505 


Note—The higher the score the greater the satisfaction with 
the partner. (EE 

Cells containing a subscript in common are not significantly 
different (at the .05 level) by Duncan multiple range test. 


in the responses to another questionnaire 
item. The question asked, “To what extent 
would your work have been better if your 
partner had been replaced?" The mean score 
in each condition on this item is reported in 
Table 5. As can be seen, the subjects in the 
High Dependency-Low Awareness group ex- 
pressed a reliably greater dissatisfaction with 


their supervisor than the men in the High — 


Dependency-High Awareness condition. As- 
suming that the latter people suspected that 
their peer might see their questionnaires (this 
possibility was not definitely excluded by the 
experimenter's instructions), we can hypothe- 
size that there was some general dissatisfac- 
tion with the supervisor throughout the entire 
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High Dependency condition which the sub. 
jects in the High Awareness group did not 
care to display. If such is the case, the re- 
sentment could have arisen because the men 
saw themselves as working hard for someone 
else—who would attain a valued goal as a 
consequence of their behavior—while they 
themselves would receive no compensation 
for this effort. Indeed, in the Low Aware- 
ness group there would not even be the 
compensation of friendliness and approval. 
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THE EFFECTS OF EXPERIMENTAL SOCIALIZATION 
PARADIGMS UPON TWO MORAL RESPONSES 
TO TRANSGRESSION 


JUSTIN ARONFREED ! 


University of Pennsylvania 


In an initial experiment on the antecedents of self-criticism and reparation, 
57 Sth-grade girls were equally assigned to 2 treatments: High Cognitive 
Structure-High Control and Low Cognitive Structure-Low Control Each S 
was repeatedly punished for an aggressive act on 10 training trials. On the 
test trial, a more destructive act was contrived to elicit internalized moral 
responses. In a 2nd experiment, using the same techniques but designed to 
distinguish independent antecedents, 68 5th-grade boys were equally assigned 
to each of 4 treatments: High Cognitive Structure-High Control, High 
Cognitive Structure-Low Control, Low Cognitive Structure-High Control, and 
Low Cognitive Structure-Low Control Induction of self-criticism was sig- 
nificantly related to E’s cognitive structuring during training. Reparative 
responses were a function of whether S or E controlled punishment. The 2 
moral responses were concluded to be the consequences of distinct patterns 
of social reinforcement and not attributable to a unitary entity such as 


“conscience” or “superego.” 


The development of moral behavior has 
proved to be remarkably refractory to psy- 
chological analysis and empirical study, de- 
spite the increasing attention that has been 
given to it. The difficulties in a psychological 
treatment of the origins of moral responses 
are undoubtedly related to the virtual absence 
of experimentation in which the conditions 
Presumed to effect their establishment and 
Maintenance have been systematically varied. 
There have been some informative assessments 
(Allinsmith, 1960; Heinecke, 1953; Sears, 
Maccoby, & Levin, 1957, Ch. 10; Whiting & 
Child, 1953, Ch. 11) of moral responses in 
the context of variations in parental child 
rearing practices. But despite the theoretical 
Ingenuity exercised in interpreting the results 
of these surveys, the very limited extent 
and magnitude of the relationships uncov- 
ered make it apparent that we have as yet 
Only a restricted understanding of how such 
Tésponses are acquired. 

Contemporary conceptions of moral de- 
velopment tend to attribute a wide array of 
elevant phenomena to a unitary psychological 
Structure established through a single process 
of acquisition. Thus, in the psychoanalytic 
E author gratefully acknowledges the able 
ad Syd and ise suggestions of Karl Brolein 

a Sandowsky. 


formulation (Fenichel, 1945, Ch. 6; Freud, 
1936, Ch. 8), various aspects of internalized 
morality are subsumed under the concept of 
the superego and are viewed as collectively 
acquired through the mechanism of identifica- 
tion. Even more detailed and behavioral 
accounts of moral development are inclined 
to treat different forms of response as more 
or less equivalent criteria of the “extent of 
development of conscience" (Sears et al., 
1957, Ch. 10), of the "severity of standards" 
(Allinsmith, 1960), or of the "strength of 
superego formation" (Heinecke, 1953). 

This assumption of an underlying unity in 
the forms and sources of moral behavior may 
obscure some important differences between 
specific responses and their distinct anteced- 
ents. For example, most of the psychological 
research on moral behavior has focused on its 
prohibitive and punitive components. Yet 
people are obviously moral in a broader sense 
than that of merely avoiding or reacting to 
transgressions. They also come to find many 
of their actions intrinsically rewarding, quite 
aside from any consequent external approval. 
Even the relatively monolithic psychoanalytic 
conception distintinguishes between the puni- 
tive superego and the rewarding ego ideal. 
And if we think of moral responses as being 
acquired through social sanctions and becom- 
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ing, in varying degrees, independent of these 
sanctions, then it seems clear that the rein- 
forcements which originally define trangres- 
sions and their consequences may be very 
different from those which define actions to 
be experienced as rewarding or praiseworthy. 

Significant distinctions can also be made 
even among responses which reflect the puni- 
tive and prohibitive functions of morality. 
Many years ago, Hartshorne and May 
(1928) concluded from their studies of deceit 
and self-control that moral behavior was 
highly dependent on the external situation 
and often did not accord with verbalized moral 
judgment. More recently, Mowrer (1960b, 
Ch. 10), in applying learning concepts to 
socialization, has reproduced a letter from 
R. L. Solomon that describes work in progress, 
on internalization in dogs, in terms which 
suggest that resistance to temptation and 
reactions to transgression may derive from 
two patterns of reinforcement which are, to 
some extent, independent of one another. Hill 
(1960) has proposed that different types of 
internalized response to transgression may 
follow from different learning processes which 
vary in the temporal relations and rein- 
forcement contingencies between the child's 
responses and the onset or termination of 
punishment. And Whiting (1959) has used 
cross-cultural observations to infer that ef- 
fective moral control in the absence of direct 
external supervision need not rest on the 
experience of guilt and that a degree of 
internalization is possible merely through the 
fear of real or imagined external sources of 
punishment. 

The findings of a survey previously reported 
by the writer (Aronfreed, 1961) indicated 
that children's internalized responses to trans- 
gression assume a great variety of forms and 
often reveal little evidence of cognitive re- 
Sources of moral judgment usually associated 
with the phenomenon of guilt. The response 
of self-criticism, for example, was very cir- 
cumscribed in its frequency of occurrence. 
Confession, apology, reparation, and com- 
mitments to modify future behavior were 
common, but the greater proportion of such 
responses appeared without the explicit ap- 
plication of a standard of judgment to the 
transgression. Furthermore, numerous re- 
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sponses were characterized by the perception 
of externally defined consequences of trans- 
gression. Although all of the responses found. 
in the survey were internalized, in the sense 
that they occurred without external observa- 
tion of the transgression and in the absence 
of any explicit or threatened punishment, 
there was no basis for assuming that the 
responses were alternate manifestations of a 
single, more fundamental reaction. There was, 
on the contrary, striking confirmation that the 
different responses were attributable to dif- 
ferent patterns of social reinforcement in that: 
they were predictably related to the socio- 
economic status and sex role of the child, and, 
to a lesser degree, to maternal disciplinary 
practices. 

The two experiments reported in the 
present paper are attempts to examine the 
specific conditions of reinforcement affecting 
children’s use of two moral responses, sell- 
criticism and reparation, in controlled so 
cialization paradigms. The second experiment, 
in replicating the first, introduced certain. 
conceptual and methodological refinements. 


| 
ExPERIMENT I 


One of the implications of the survey de- 
scribed above is that transgressions may ap: 
parently be socially defined for a child ina 
minimally cognitive context. The internalized 
behavioral consequences of a transgression 
would certainly require discrimination, sincè 
there must be distinct response cues to which 
they are attached, but it would seem thal 
such consequences encompass a much broader 
class of behavior than that for which sell- 
evaluation or judgment is a prerequisite: 
There is considerable recent evidence (Kohl 
berg, 1963; MacRae, 1954; Peel, 1959) 
that a normative framework of standa a 
for evaluating actions, in contrast to what 
are merely expectations about their const 
quences, is not simply a natural feature ° 
cognitive development and social ene 
as has been suggested by Piaget (1948). f 
appears instead to be a variable function 0 
particular social roles and cultural settings 
which the child has experienced. 

Self-criticism, viewed in its bare essel 
as a consequence of transgression, is the 
of a verbal-symbolic referent to ones 


ntials | 
use 
own 
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behavior, thoughts, or feelings. As such, it 
represents a response in which the individual 
takes action, through the internal mediation 
of his own cognitive resources, with respect 
to himself as the object of action. The active 
quality of the self-critical response was 
recognized by earlier social-psychological 
theorists such as J. M. Baldwin (1906) and 
G. H. Mead (1934), who, while they were 
primarily concerned with morality as a phe- 
nomenon of valuation and consciousness, 
nevertheless viewed a moral response as being 
one in which the individual was capable of 
taking the role of another with respect to 
himself. The same property of action can be 
seen to apply to reparation, which uses the 
individual's own behavioral resources to cor- 
rect or ameliorate the effects of a transgres- 
Sion. Although reparative responses may 
simply reduce unpleasant feeling without 
necessarily implying self-evaluation, they are 
always constructive or restitutive, and there- 
fore require some kind of manipulation of 
one's own behavior as well as of the external 
environment. 

The active, self-corrective character of 
Moral responses such as self-criticism and 
reparation suggests that their establishment 
might depend on the extent to which a child 
has been previously socially reinforced for 
evaluating and acting upon its own behavior. 
À child who has been encouraged to make 
Corrective or punitive responses to its socially 
unacceptable behavior would be expected to 
more frequently resolve subsequent transgres- 
Sions through its own critical and reparative 
actions, even in the absence of external 
Sanctions, than a child who has experienced 
the consequences of transgression in events 
Outside of its own control. One might also 
€xpect that a child’s self-mediated resources 
for responding to transgression would be 
enhanced by the incorporation of explicitly 
verbalized standards into its socialization. A 
Cognitive context for punishment would pro- 
vide evaluative labeling responses for the 
child to act upon its own behavior with self- 
criticism, It would also provide, therefore, 
Additional cues for appropriate reparative 
Tésponses of a kind quite different from those 
Provided by the stimulus of the transgression 
self. A "number of studies (Allinsmith, 
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1960; Aronfreed, 1961; MacKinnon, 1938; 
Sears et al., 1957, Ch. 10), in which reasoning 
and the reinforcement of self-punitive reac- 
tions were subsumed under the category of 
"psychological," “love oriented,” or “induc- 
tion” techniques of discipline, have reported a 
positive relationship between the use of such 
techniques and the child's selí-corrective 
activity and independence of external events 
in responding to transgression. 

The basic procedure of the experiments 
reported in this paper was to use mild dis- 
approval and deprivation to punish the child 
repeatedly for an act of aggression carried 
out on 10 successive training trials. In the 
first experiment, two distinct treatments were 
employed. One treatment maximized cogni- 
tive structure through the verbalization of 
an explicit standard in reference to the aggres- 
sive acts, and also gave the child control over 
the evaluation and punishment of the trans- 
gression. In the second treatment, cognitive 
structuring was minimal, and evaluation and 
punishment of the child's behavior were 
entirely in the experimenter's control. A 
common test trial at the end of both treat- 
ments, where a more destructive transgres- 
sion occurred in the contrived breaking of 
a doll, was designed to elicit self-critical and 
reparative responses when the experimenter's 
punitive socializing role was terminated. The 
specific moral responses to be observed dur- 
ing the test trial were not made by the child 
in the course of the experimental treatment, 
in order that their appearance might be 
taken to reflect generalized response tend- 
encies and not merely isolated pieces of 
behavior repeatedly attached to a stimulus 
situation. The nature of the aggressive acts 
was purposely made unusual and somewhat 
removed from ordinary experience, so as to 
minimize the effects of the child's predisposi- 
tions and maximize the effects of the 


procedure. 


Method 


Subjects. The subjects for the first experiment 
were 57 fifth-grade girls drawn from two public 
schools in a large urban school system Both of 


2 The author is indebted to a number of adminis- 
trators and teachers in the Philadelphia public school 
system, whose cooperation made the experiments 


possible. 
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the schools served fairly homogeneous socioeconomic 
areas of a working-class character. The children 
were randomly assigned to the two experimental 
conditions in roughly equal numbers. 

Procedure. Each subject was individually taken 
by the experimenter, who was a male, from her 
classroom to the experimental room, where she was 
asked to sit in front of one end of a rectangular 
piece of composition board resting upon a small 
table. The board was roughly 2 X 3.5 feet in size. 
Twenty-four toy soldiers of the small plastic 
variety were thickly clustered in a triangular 
formation at the other end of the board. Behind 
the base of the formation, at the very edge of the 
board, stood a wooden doll about 6 inches in 
height, with clearly feminine features and a black 
uniform not unlike a skiing outfit. A large cloth- 
padded cardboard box rested on a chair just 
beyond the far edge of the board and behind the 
doll. The box was below the height of the board’s 
surface, so that the interior was not visible to the 
subject when she was seated. The experimenter sat 
to one side of the board toward the far end. On 
a table adjacent to the experimenter’s right hand 
were some recording sheets, a box of small Tootsie 
Rolls, and an instrument to be introduced as the 
“pusher” (actually a miniature hoe, roughly 2 feet 
long). 

After the experimenter and the subject were 
seated, the experimenter used one of two procedures 
which were identical in all aspects not relevant to 
the experimental treatment, including the specifica- 
lion of the behavior to be punished and the nature 
of the punishment. The procedures differed only 
in their degree of cognitive structuring and in the 
direction of control over evaluation and punish- 
ment, In the High Cognitive Structure-High Control 
condition, the subject's activity was repeatedly put 
into the context of standards verbalized along the 
dimensions of careful-careless and gentle-rough, and 
the subject was also asked to evaluate and punish 
her own actions. In the Low Cognitive Structure- 
Low Control condition, only the minimal cognitive 
framework necessary to clarify the task was used, 
and. evaluation and punishment were under the 
experimenter's control The task in which the child 
was required to engage was arranged so that some 
degree of aggression was inevitable, since the children 
might otherwise have made punishment inappropri- 
ate by simply inhibiting any action construed as 
a transgression. 

High Cognitive Structure-High Control. The 
verbal instructions given by the experimenter were 
as follows (italicized words and statements were 
used only in this condition) : 


I have something here for you to do. The 
Army uses it to pick people for a special kind 
of work. To do this work, you have to be very 
careful and gentle. Being careful and gentle is 
the most important thing. 

Here’s the way we do it. Back here there is a 
nurse in a special uniform [experimenter points to 
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doll]. She is important in case anyone gets sick, 
and all of the soldiers love her because she is so 
good to everyone. Whenever there's any danger, 
she has to leave the field by going back into this 
box [experimenter indicates box by lifting it from 
chair]. Here’s how she goes into the box. You 
push her off the board with this pusher. You 
can’t lift the pusher; you must keep it down 
on the board when you push [experimenter 
demonstrates, then hands pusher to the subject], 

Now the soldiers guard the nurse very carefully, 
They stand all around her like this [experimenter 
points to soldiers]. Of course, to push her off, 
you will have to push through the soldiers, and 
so you will knock some of them over, even though - 
you're trying to be careful and gentle [this is, in 
fact, obviously true, as the spacing between the - 
soldiers is much too narrow to permit the pusher's 
blade to pass without toppling some of them], 
The idea is to see how many of the soldiers you 
will knock down. If you knock down just a few, 
that's good. If you knock down a lot of them, 
that's not so good. When you use the pusher, 
try to be as careful and gentle as you can. 

Here is a pile of thirty Tootsie Rolls for you 
[experimenter places pile and small empty box - 
in front of the subject]. When we're all through, - 
you may keep however many you have left. Each 
time, after you push off the nurse, you look at - 
the number of soldiers you knocked down, and 
you decide how careless and rough you've been. 
Then you decide how many Tootsie Rolls yot 
should take from your pile. You take the number 
of Tootsie Rolls you think is right and put them 
in the box here. Those are the Tootsie Rolls you 
lose. You might take one Tootsie Roll, or two, 0r 
three, but not more than three. The amount yot 
take depends on how careless and rough yot 
think you've been. 


The subject then went through a series of n 
training trials, The experimenter began the firs 
trial by saying: 


Remember to keep the pusher down on the board. 
Push off the nurse and knock down as i 
soldiers as you can. And be as careful am 
gentle as you can, All right, go ahead. 


The experimenter began all subsequent trials by 
simply saying: 


Okay, go ahead. 


On each trial, after the nurse had been pushed 


over, the experimenter said: 


H on 
All right. You knocked down some soldiers, a f 
decide how careless and rough you've bett ru 
as many Tootsie Rolls as you think is 178) 
put them into the box. 
" ie Roll 
While the subject was making the Tootsie 


Hen 
payment, the experimenter reset the nurse and fal 
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soldiers. Just as the subject put the Tootsie Rolls 
into the box, the experimenter casually said: 


Good! 


This verbal reinforcement was intended to make it 
somewhat easier for the child to continue the self- 
depriving behavior. 

Low Cognitive Structure-Low Control. The 
procedure in this condition was initially the same 
as that indicated in the first three paragraphs of 
instructions under the first condition described 
above, except that the italicized portions for maxi- 
mizing cognitive structure were eliminated here. 
The experimenter then continued as follows (itali- 
cized words and statements were used only in this 
condition) : 


Here is a pile of thirty Tootsie Rolls for you 
[experimenter places pile and small empty box 
in front of the subject]. When we're all through, 
you may keep however many you have left. 
Each time, after you push off the nurse, 7 look at 
the number of soldiers you knocked down. Then 
I decide how many Tootsie Rolls J should take 
from your pile. 7 take the number of Tootsie Rolls 
I think is right and put them in the box here. 
Those are the Tootsie Rolls you lose. 7 might 
take one Tootsie Roll, or two, or three, but not 
more than three. 


The subject then went through the series of 10 
training trials. The experimenter began each trial 
in the same way as indicated above under the first 
condition, except that the italicized statement (for 
the first trial) was eliminated. On each trial, after 
E. nurse had been pushed over, the experimenter 
said: 


All right. You knocked down some soldiers, so 
IU have to take—let's see—Lone, two, three] 
Tootsie Rolls. 


The experimenter then removed the Tootsie Rolls 
and reset the nurse and soldiers. Since there was 
always considerable disarray, the experimenter had 
freedom to vary the number of Tootsie Rolls he 
took on different trials. This flexibility was used 
to equate the total loss of Tootsie Rolls for children 
Under this condition with the total loss for children 
Under the first condition, where they controlled 
their own losses, 

Test trial The eleventh trial of both of the 
eXperimental treatments was planned as an ap- 
Parently unexpected disruption due to the breaking 
of the nurse doll. The bottom half of one leg of 
the doll was detachable, having been prepared so 
that it was attached only by a tiny screw and 
could be unobtrusively removed by the experimenter 
Li ith only a few gentle turns. Since the experimenter 
ul to be present during the test trial to observe 

€ child's responses, some change in the situation 
had to occur to indicate to the child that the previ- 
Ous punitive consequences of transgression were 
no longer forthcoming. Accordingly, the experimenter 
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terminated his role as a socializing agent on this 
trial and used nondirective, apparently casual verbal 
stimuli to make the situation more appropriate for 
the child to show her own overt reactions to trans- 
gression. 

When the subject pushed off the nurse on this 
trial, the experimenter looked into the box with a 
surprised expression and said: 


Oh, my—it’s broken. 


While making this statement, the experimenter 
reached into the box with both hands, as though 
slowly picking up the doll, and quickly removed 
the leg. This procedure took only 2 or 3 seconds, 
and the child could not see the interior of the 
box. The experimenter held up the doll in one hand 
and the detached leg in the other, and, looking at 
them (but not directly at the child), added: 


And we don’t have another nurse here to use 
for this—[1] I wonder why it broke, 


This last query, spoken reflectively as though the 
experimenter were thinking to himself, was an 
indirect verbal stimulus to elicit a self-critical 
response in those subjects in whom it might be 
prepotent. 

If the subject gave any response to Stimulus 1 
that was relevant to the cause of the doll’s break- 
ing, whether or not it was selí-critical, the experi- 
menter went on to the third stimulus (given below). 
If the response was not clearly relevant to the 
doll's breaking, or if the subject gave no response, 
the experimenter presented Stimulus 2: 


Why do you think it broke? 


This second question was meant to provide a 
stronger eliciting stimulus for self-criticism. Then, 
regardless of the responses to the first two stimuli, 
Stimulus 3 was: 


Well, now that it’s broken, I wonder what we 
should do. 


This third comment was spoken reflectively . and 
was the first indirect stimulus intended to elicit 
reparative responses. If the subject offered any 
response that was relevant to the implied question, 
whether or not it was reparative, the experimenter 
terminated the test trial procedure. If the subject 
gave no response, or one that was not relevant to 
the question, the experimenter then presented the 
final, stronger eliciting Stimulus 4: 


What do you think we should do now? 


Then, regardless of the subject’s response to this last 
question, the test trial procedure was always termi- 
nated. The subject’s responses to all of the experi- 
menters verbal stimuli were written down verbatim 
as they occurred. 

Closing procedure. A closing procedure was used 
to put the subject at ease about her performance 
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and to invoke her cooperation in not discussing the 
experiment with other children. Informal checks, as 
well as the children’s behavior during the experi- 
ment, indicated that excellent security was 
maintained. 


Results and Discussion 


It was apparent that all of the children 
took the experimental situation seriously and 
that, without respect to treatment, they uni- 
formly paid close attention to the loss of 
Tootsie Rolls and exercised care in pushing 
over the nurse. Almost all of the children were 
visibly concerned about the breaking of the 
doll, though their responses were quite vari- 
able. Some self-critical responses were given 
to the experimenter’s first, rhetorical verbal 
stimulus (1), but the majority appeared only 
when the second, more direct stimulus (2) 
was presented. Likewise, most of the repara- 
tive responses occurred to the direct 
question (4) rather than to the indirect 
stimulus (3). Frequency counts were taken 
of the number of children who showed any 
instance (one or more) of each of the two 
types of response, to either the direct or 
indirect stimuli, and of the number of chil- 
dren who showed no evidence of the response. 

Self-critical and reparative responses were 
independently identified by highly specified 
criteria which required virtually no judgment 
or interpretation. A response was classified 
as self-critical if the child, in accounting for 
the doll’s breaking, referred to her behavior 
in pushing it—for example, any response 
indicating that she had not pushed the nurse 
“the right way,” had pushed it too hard, had 
pushed it so that it did not hit the box 
properly, etc. The fact that only four of the 
children actually used the words “careless” or 
“tough” suggests that self-critical responses 
did not represent merely what the child 
regarded as appropriate verbalizations of the 
experimenter’s words and that they followed 
from a more general self-evaluative orienta- 
tion induced by the procedure. Responses 
were classified as reparative when they in- 
dicated the child’s perception that the effects 
of transgression could be corrected or 
ameliorated through her own resources for 
constructive action. These responses invari- 
ably took the form of suggestions for repair- 
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ing the doll or continuing the procedure, eyen 

without the doll, in some alternative way, 
Table 1 shows the frequency of self-critical 

and reparative responses under each of the 


experimental conditions. Both types of re . 


sponses were significantly more likely to 
occur when cognitive structure and the 
child's control over punishment had been 


maximized than they were when there had 


been minimal cognitive structure and con- 
trol It is interesting to note that children 
who gave no self-critical responses, par- 
ticularly those in the Low Cognitive Struc- 
ture-Low Control condition, often attributed 
the breaking of the nurse to factors external 
to their own actions. Thus, they might in- 


dicate that the nurse was poorly constructed, — 


that it was broken by the physical impact 
with the cloth-padded box, or that it had 
been given too much use. Analogously, chil- 
dren who did not make reparative responses 
sometimes simply indicated their inability 


to make any constructive suggestion. Some - 


made comments to the effect that the experi- 
menter should decide what ought to be done. 

While the experimental effects confirmed 
expectations, they were limited to indicating 
that, in response to a transgression in the 
absence of external punishment, the children’s 
use of cognitive and behavioral resources 
to act upon either their own actions or the 


external environment was a function of tht - 


extent to which they had previously been 


TABLE 1 


FREQUENCY oF SkLF-CRITICAL AND REpARATIVE 
RESPONSES UNDER CoNDrrIONs or Hio COGNITIVE 
SrRucTURE-HiGH CONTROL AND Low COGNITIVE 
StructurE-Low CONTROL 


High Cognitive | Low Cognitive 
Type of response | Structure-High Structures 
(N = 29) (Ñ = 28) 

Self-criticism 

Present 18 7 

Absent 11 21 
Reparation 

Present 16 8 

Absent 13 20 


ow 


1 shi 
Note.—Frequencies represent number of subjects iho of 


and ni values 
subjects who show no evidence of the response. Chi square tgp. 
for 2 X 2 contingency tables (employing correction TOC Po tajled 
ity) are as follows: Self-criticism, x? = 6.52, 2 < OL. 
test; Reparation, x? = 3.15, p < .05, one-tailed test. 


any instance (one or more) of a given response ant 
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provided with such resources and encouraged 
to use them. The design of the experiment 
reflected a conceptualization of the anteced- 
ents of self-criticism and. reparation that ob- 
scured certain more detailed and explicit 
interpretations of the findings. The socializa- 
tion paradigms were constructed in a manner 
that made it impossible to separate the 
effects of cognitive structuring from those of 
control over punishment. Since the two moral 
responses bore a parallel relationship to the 
experimental treatments, it could not be 
ascertained whether contingencies existed be- 
tween them or whether they had independent 
antecedents which had been subsumed to- 
gether in the treatments. For example, the 
two responses might be viewed as inter- 
changeable variants of a single generalized 
response tendency or, alternatively, as sepa- 
rate response tendencies induced by common 
features of treatment. It was even possible 


that one of the responses was induced only 


secondarily through the mediation of the 
other. 

In order to examine the antecedents of 
the two moral responses more closely, a 
second experiment was designed to permit 
cognitive structure and control over punish- 
ment to vary independently of one another 
and to go beyond merely drawing a parallel 
between the properties of the responses and 
the resources provided by the socialization 
Paradigms. Conceptual changes were intro- 
duced which emphasized the functional sig- 
nificance of the responses and outlined the 
Specific mechanisms through which the two 
antecedent variables might have different be- 
havioral consequences. Self-criticism and rep- 
aration, while they might both be instances 
of the child’s active use of its own resources, 
Were clearly distinct responses, and it was 
difficult to imagine that they did not have 
distinct determinants. One obvious possibility 
that suggested itself was that self-criticism, 
4 cognitive and evaluative response, might be 
More affected by the articulateness of stand- 
ards provided for the child than by the degree 
of control over punishment. Conversely, a 
teparative response, oriented toward actively 
Correcting the effects of transgression, would 
Seem more closely related to whether a child 
Was given responsibility for its own depriva- 
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tion than to the amount of cognitive structur- 
ing present. 

The second experiment also utilized a dif- 
ferent kind of sample in order to expand the 
generality of the expected findings. Previous 
evidence (Aronfreed, 1960, 1961; Sears et al., 
1957, Ch. 10) of the relationship of moral 
responses to socioeconomic status and sex 
role suggested the desirability of using 
middle-class boys as subjects, in contrast to 
the working-class girls used in the first experi- 
ment. Replication of the findings, under such 
a sampling variation, would indicate that 
the experimental effects could not be at- 
tributed to an interaction of the procedures 
with predispositions already attached to 
particular social roles. 


EXPERIMENT II 


The conceptual framework of the second 
experiment proceeded from the view that a 
behavior may be defined as a transgression 
to the extent that it has been exposed to 
negative (punitive) sanctions. The social 
punishment results in the behavior itself be- 
coming a cue for an affective response that 
may be given the general designation of 
anxiety, though it might well have various 
qualities dependent on the mature of the 
punishment. When the anxiety is no longer 
contingent on the actual presence of punish- 
ment, it may be regarded as the first and 
invariant component of any internalized 
moral response to transgression. The anxiety 
is reducible by a number of different re- 
sponses which acquire instrumental value be- 
cause they reproduce certain significant. cues 
which are often associated, in the original 
socializing situation, either with the avoidance 
or termination of punishment or with the at- 
tenuation of the anticipatory anxiety that 
precedes punishment. The child may then 
make the relevant cue producing responses 
to reduce the anxiety aroused by subsequent 
transgressions, even in the absence of external 
punishment. The use of the term guilt, in 
this framework, would be appropriate only 
when the moral responses have a self-evalu- 
ative, cognitive component.? 


3 What is being described here, of course, is nothing 
more than a learning process having two aspects 
e 
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The cues associated with anxiety reduc- 
tion may, in certain instances, be stimulus 
aspects of the child's own responses which 
were originally effective in arresting or termi- 
nating punishment. Children frequently learn 
that their reparative responses (as well as 
other responses, such as confession) are fol- 
lowed by this kind of direct external rein- 
forcement. In the self-deprivation condition 
of the first experiment, the child made a 
reparative response, on each training trial, 
that terminated a punitive situation and the 
anxiety aroused by it. Presumably, such 
training might well have induced a corrective 
or ameliorative disposition on the test trial, 
when there was no punishment, even if 
explicit standards of evaluation had not been 
provided. In naturalistic socialization, of 
course, reparation may also be followed by 
positive reinforcement, and the experimenter’s 
saying the word “Good!” when the child re- 
moved her own Tootsie Rolls might be taken 
as analogous to such a reinforcement. 

Direct external reinforcement of a moral 
response does not provide, however, a very 
satisfactory account of the origins of self- 
criticism. The self-critical response is not 
easily open to observation by others (it is 
not ordinarily verbalized). Its initial ap- 
pearance in very young children, among 
whom it is more often overt, is frequently 
quite vigorous and sudden. Furthermore, it 
actually reproduces an aspect of the socializ- 
ing agent’s punitive behavior, and the manner 
in which its external reinforcement could be 
controlled by the agent is not readily ap- 
parent. For example, in the experimental 
socialization paradigm with high cognitive 
structure, only the experimenter had used 
evaluative labels during training to refer to 
the children’s behavior. The appearance of 
a self-critical response on the test trial may 
be viewed, then, as an adoption of the 
experimenter’s role, a phenomenon with many 


which correspond to varieties of classical and instru- 
mental conditioning. Mowrer (1960a) has described 
the reinforcement value of cues associated with the 
termination of punishment (and anxiety) as being 
crucial to a complete account of secondary reinforce- 
ment. A number of animal studies relevant to this 
phenomenon have recently been summarized by Beck 
(1961). 
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of the properties commonly referred to as 
imitation or identification. i 

Freud (1933, Ch. 3; 1936, Ch. 8) at 
various times attributed the child’s adoption 
of a model’s behavior both to the desire to 
reproduce the characteristics of a loved 
object and to the desire to defend against 
the anxiety aroused by the threatening or 
aggressive aspects of the model. These two 
motive sources have been elaborated and 
modified by others (Bronfenbrenner, 1960; 
Freud, 1946, Ch. 9; Mowrer, 1950, Ch. 21; 
Sanford, 1955), but without a description 
of the specific reinforcement mechanisms 
through which the modeling takes place 
Direct external reinforcement has been sug- 
gested to explain certain forms of imitation 
(Church, 1957; Miller & Dollard, 1941), and 
Hill (1960) has recently attempted to use 
it to derive the origins of self-criticism. Other 
theorists (Mowrer, 1960b, Ch. 3; Sears et al, 
1957, Ch. 10; Whiting & Child, 1953, Ch. 
11) have taken the view that the child te- 
produces stimulus properties of a models 
behavior which are already secondary rein- 
forcers because of their association with the 
model’s affection and nurturance. A form of 
observational learning has also been proposed 
(Darby & Riopelle, 1959; Maccoby & Wilson) 
1957), in which reinforcements or their af 
fective consequences are somehow vicariously 
generalized from model to subject. Finally, 
a number of recent theoretical treatments 
specify that, because of the resources or 
goal states controlled by the model but de- 
sired by the subjects, it becomes self-rein- 
forcing for the subject to maximize the per 
ceived similarity between self and mode 
(Kagan, 1958) or to covertly practic 
(toward the self and others) role actions ol 
the model which occur in close contiguity f 
the subjects own responses (Maccobyi 
1959) or which maintain drive reducing 
effects through their translation into ee 
trol over the resources in fantasy (Whiting: 
1960). 

The reinforcement mechanisms outl a 
above are awkward in their application a 
the experimental paradigm in question as 
(and to the ordinary socialization situation)» 
where the child makes a response tha ui 
previously made only by the socializing ap 
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and that is clearly associated with punish- 
ment. It is difficult to see how the self-critical 
response could have been directly reinforced 
during the training trials, why the child 
would reproduce aspects of a nonnurturant 
experimenter’s role which obviously do not 
result in pleasurable experience, or why the 
child would receive vicarious satisfaction in 
perceiving the experimenter’s criticism of 
its actions. Likewise, the motivation for re- 
producing the experimenter’s control of puni- 
tive resources is not apparent. The child may, 
of course, have been making implicit self- 
critical responses during training, but we 
are still left with the problem of why the 
responses are made at all and of how they 
are reinforced. 

A solution to the problem is made possible 
by considering that moral responses may ac- 
quire instrumental value for reducing the anxi- 
ety aroused by a transgression through more 
than one pattern of reinforcement. Not only 
may they reproduce cue aspects of those 
behaviors of the child which were originally 
associated with avoidance or termination 
of punishment, as in the case of reparation, 
but they may also reproduce cue aspects of 
the previous punitive behavior of a social- 
Ling agent. After a child has had some 
experience with a transgression, punishment 
itself may come to serve as a cue signifying 
the attenuation of the anxiety that accom- 
panies its anticipation. When a child's punish- 
ment incorporates the verbalization of evalu- 
ative labels in reference to its actions, the 
labels, like any other component of punish- 
ment, may become cues for the termination of 
the anxiety that comes to be directly attached 
to the transgression. The child can then 
Subsequently itself make the evaluative 
Tésponse, even in the absence of external 
Punishment, and thus reproduce the anxiety 
reducing cues in its own behavior. 

We may assume that, as a result of the 
training procedures, the child begins to ex- 
perience anxiety each time the soldiers are 
knocked down. Under high cognitive structur- 


, ‘An appreciation of this cue value of punishment 
B useful in understanding why some children's pre- 
dominant response to transgression might lie in the 
Perception of punishment in the actions of other 
Deople or in impersonal fortuitous events. 
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ing, the experimenter's critical evaluation 
(“careless” and “rough”) becomes part of 
the anticipated punishment associated with 
the termination of the anxiety. The place of 
the verbal criticism in the punishment, and 
its timing with respect to onset and termina- 
tion of anxiety, would therefore be expected 
to motivate the child to reproduce the 
critical responses during the test trial, regard- 
less of the degree of direct control that the 
child has exercised over the loss of Tootsie 
Rolls. 


Method 


Subjects. The subjects for the second experiment 
were 68 fifth-grade boys from another public school 
in the same urban school system from which the 
girls for the first experiment had been drawn. The 
school was in a residential area having an entirely 
middle-class population. Seventeen children were 
randomly assigned to each of four experimental 
conditions. 

Procedure. The second experiment used the same 
experimenter used in the first one. The procedures 
were also, in most essential respects, the same as 
those of the first experiment. They differed from 
the original procedures primarily in using four 
rather than two distinct conditions, and in intro- 
ducing minor changes of instruction and treatment 
into two of the four conditions, in order to permit 
cognitive structuring and control over punishment 
to vary independently of one another, There were 
now two conditions in which explicit standards 
of evaluation were presented. The initial portion 
of the instructions for both of these conditions 
was identical to that described under the first 
three paragraphs of general instructions for the first 
experiment, including the italicized sections for 
maximizing cognitive structure. There were, like- 
wise, two conditions of low cognitive structure 
using the same initial instructions, but with the 
italicized sections removed. The procedure within 
each of the two sets of conditions then bifurcated 
so as to introduce variation in the degree of control 
over punishment. 

High Cognitive Structure-High Control, The 
instructions continued here as shown in the fourth 
paragraph of instructions under the original High 
Cognitive Structure-High Control condition. 

High Cognitive Structure-Low Control. Here 
the experimenter continued the procedure as follows 
(italicized words and statements were used only 
under the condition with low control): 


Here is a pile of Tootsie Rolls for you Lexperi- 
menter places pile and empty box next to the 
subject’s right hand]. When we're all through, 
you may keep however many you have left. 
Each time, after you push off the nurse, T look 
at the number of soldiers you knocked down, and 
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I decide how careless and rough you've been. 
Then 7 decide how many Tootsie Rolls 7 should 
take from your pile. Z take the number of 
Tootsie Rolls 7 think is right and put them in 
the box here. Those are the Tootsie Rolls you 
lose. J might take one Tootsie Roll, or two, or 
three, but not more than three. The amount I 
take depends on how careless and rough J think 
you've been. 


The experimenter's statements in initiating trials 
were again the same as those described for the 
original High Cognitive Structure-High Control 
condition (initiation of trials did not carry any 
manipulation of the Control variable). The experi- 
menter’s procedure in terminating each trial was 
to say: 


All right. You knocked down some soldiers, so 
I decide how careless and rough you've been. PH 
have to take—let's see—[one, two, three] 
Tootsie Rolls. 


Low Cognitive Structure-High Control. After the 
common initial portions of the instructions, the 
procedure here continued as follows (italicized words 

and statements were used only under the condi- 
tion with high control): 


Here is a pile of thirty Tootsie Rolls for you 
{experimenter places pile and empty box next 
to the subjects right hand]. When we're all 
through, you may keep however many you have 
left. Each time, after you push off the nurse, 
you look at the number of soldiers you knocked 
down. Then you decide how many Tootsie Rolls 
you should take from your pile. You take the 
number of Tootsie Rolls you think is right and 
put them in the box here. Those are the Tootsie 
Rolls you lose. You might take one Tootsie Roll, 
or two, or three, but not more than three. 


The experimenter initiated trials in the way 
described under the original Low Cognitive Struc- 
ture-Low Control condition (initiation of trials did 
not carry any manipulation of the control variable). 
In terminating each trial, the experimenter said: 


All right. You knocked down some soldiers. Take 
as many Tootsie Rolls as you think is right and 
put them into the box. 


Low Cognitive Structure-Low Control. The 
remainder of the instructions and procedure in this 
condition was the same as that described for the 
original Low Cognitive Structure-Low Control 
condition in the first experiment. 

For all four of the experimental treatments sum- 
marized above, the test trial and closing procedures 
were identical to those used in the first experiment. 


Results and Discussion 


The criteria for establishing the presence 
of self-critical and reparative responses were 
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the same as those described in the rep 
the first experiment. Table 2 shows th 
quencies of occurrence and nonoccurren 
of the two types of response under each 4 
the four experimental conditions. Chi squ 
values for the comparisons evaluating 
effect of each independent variable upon 
of the two responses are presented in Taj 
The two tables indicate a series of significa 
differences which are almost entirely tho 
expected. Self-critical responses are 
likely to appear when the socializ 
paradigm provides explicit standards of e 
ation than when cognitive structure is 
mal. It is also clear that the effect of co 
structuring on self-criticism is in no W 
contingent on the degree of control 
punishment given the child, since the 
is equally apparent under conditions of 
high and low control. Control over punis 
ment alone, when separated from any vali 
tion in cognitive structure, obviously 
effect whatsoever on self-criticism. It si 
reasonable to conclude, therefore, thi 
Socialization situation in which the 

actively uses its own resources in respo 
to transgression, as happens under condi 0 
of self-deprivation, does not in itself evol 

generalized response tendency of action 
respect to one’s own behavior, of which sd 
criticism is one form. It would appear t 
the reinforcement of a self-critical respol 
requires that punishment be associated 
the verbalization of specific cognitive lab 


TABLE 2 


FREQUENCY OF SELF-CRITICAL AND REP. 
RESPONSES UNDER Four CONDITIONS OF COGNI 
STRUCTURE AND CONTROL OF REINFORCEMENT 
PUNISHMENT TERMINATION 


High Cognitive 


Low Cognitiy 
Structure Si 


structure - 
Type of response 


High | Low | High | La 
Control | Control | Control | © 


Self-criticism 
Present 11 10 5 
A Absent 6 fi 12 
eparation 
Present 14 4 10 
Absent 3 13 7 


Note,—N = 17 in each of the experimental p 
quencies represent number of subjects who mom E ects Y 
(one or more) of a given response and number of su 7 


show no evidence of the response. a 
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TABLE 3 


Cur SQUARE VALUES FOR FREQUENCY COMPARISONS 
OP SELF-CRITICAL AND REPARATIVE RESPONSES 
UNDER FouR CONDITIONS OF COGNITIVE 
STRUCTURE AND CONTROL OF REIN- 
FORCEMENT AT PUNISHMENT 
'TERMINATION 


Chi square value 
Comparison 


Self- 
criticism. 
f High Cognitive Structure versus 


Reparation 


Low Cognitive Structure 
High Control D. 
Low Control 3. 
Both groups I 

High Control versus Low Control 

High Cognitive Structure 0. 
0. 
0. 


00 9,56** 
Low Cognitive Structure .00 1.06 
Both groups .06 9.94** 


eoploying correction for continuity) based on frequencies in 
able 2. 

*p <.05, one-tailed test. 

™ > <.01, one-tailed test. 


The effects of the experimental treatments 
on reparative responses, while generally in 
the anticipated direction, are not quite as 

- definitive as the effects on self-criticism. 
Control over punishment, rather than cog- 
nitive structuring, is the major source of 
Variation, But its effect is clearly apparent 
only under the condition of high cognitive 
structure. There is some tendency for con- 
ttol over punishment to affect reparative 
Tésponses even when cognitive structure is 
Minimal, but the tendency does not attain 

- Statistical significance. Since cognitive struc- 

lure per se does not significantly affect 
teparative responses, its effect on these re- 
sponses might be interpreted as a secondary 
or modifying one. Reparation seems to be 

.Finforced primarily by giving the child 

active control over the corrective or punitive 

Consequences of transgression. It is quite 

Possible, however, that explicit cognitive 

labeling adds another dimension to this 
control, and facilitates reparation through 
the additional cue values which it provides. 
uch an interpretation might find some tenta- 
- live support in the fact that, given the con- 
dition of high control over punishment, there 
5 a tendency for reparative responses to be 
More frequent when cognitive structure is 
Maximized. Tt is obvious, in any case, that 


Note.—Chi square values for 2 X 2 contingency tables 
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self-criticism and reparation cannot be 
thought of as alternative or equivalent re- 
sponses to transgression deriving from a 
single pattern of socialization. Nor can either 
one of the responses be viewed as a funda- 
mental reaction to transgression through 
which the other is only secondarily mediated. 
The two responses must be regarded as out- 
comes of distinct patterns of social reinforce- 
ment, even though these patterns may happen 
to be intimately interwoven in the ordinary 
course of child rearing relationships. 

Certain restraints need to be recognized in 
interpreting the results of these experiments. 
While there was no punishment on the test 
trial, so that the moral responses observed 
showed internalization in the sense of being 
independent of the original negative reinforce- 
ments on which they were based, some 
ambiguity remains as to the degree. of inter- 
nalization, since the experimenter had to be 
present in order to observe the children’s 
responses. Further, the effect of the punish- 
ment itself, being compounded of disapproval 
and deprivation, is difficult to evaluate pre- 
cisely. The use of terms like “careless” and 
“rough” might have created a greater per- 
ceived intensity of punishment, a possibility 
conceivably reflected in the fact that minimal 
cognitive structuring seemed to depress even 
the impact of high control over punishment 
on reparative responses. Finally, the treat- 
ments are not entirely divorced from the 
child’s previous socialization, and generaliza- 
tion or transfer may have entered into their 
inducement of moral responses. An unequiv- 
ocal demonstration of the conditions neces- 
sary to establish a new moral response tend- 
ency would require the use of symbolic ref- 
erents with no previous evaluative connota- 
tion and a more exact control over the timing 
of punishment with respect to the behavior 
of both the child and the socializing agent. 
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DIFFERENTIAL RESPONSE TO FAILURE IN HOSPITAL 
AND NONHOSPITAL GROUPS 


SAMUEL J. NEIDITCH 1 
Florida State University 


90 functionally psychotic Ss and 90 nonhospitalized Ss were given varying 
amounts of failure and nonfailure experiences on a puzzle task. These 
experiences were temporally associated with a neutral paper-and-pencil task. 
Tests were made for any differential effect upon preference for the neutral task 
after this association. A 3-dimensional analysis of variance design yielded 
significant results (beyond the .01 level) only in the groups of psychotics 
subjected to failure. These Ss lowered their preference for the formerly 
neutral task. The results may indicate a loss in the psychotic’s ability to 
respond differentially and appropriately to individual experiences. 


A portion of the research aimed at pre- 


- dicting and modifying behavior deals with 


the effect of failure upon different individuals. 
Success and failure are concepts which, 
fortunately, are amenable to operational 
definition. Studies in this area (Filer, 1952; 
Gebhard, 1948, 1949; Jessor, 1954; Lantz, 
1945; Sears, 1937) have generally indicated 
that failure experiences tend to influence at- 
tractiveness and performance in a negative 
Manner with reference to the failed tasks 
themselves as well as tasks similar to the 
original, Expectations of subjects were also 
lowered after encountering these failure 
experiences. 

Zeller (1950), using learning of nonsense 
syllables and a test-retest method, induced 
Subjects to fail on a different intermediate 
lask (block tapping). Subjects who failed 
On the intermediate task were lower in their 
learning ability than those who did not 
fail. Zeller considered his subjects to be a 
Somewhat emotionally unstable group. They 
Were college student volunteers who were 


1 Now at Patton, California, State Hospital. 
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n the statistical problems. 
m am also indebted to the administrative officials 
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jp eres Indiana, and Patton State Hospital 
in e time allowance and the assistance in secur- 
8 subjects for this research. 
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hoping to receive personal psychological 
services. 

Etzel (1953) compared children’s prefer- 
ences for tasks as influenced by failure experi- 
ences on different intermediate tasks. She 
compared sixth-grade children on their Prefer- 
ence-For-Doing one of nine tasks in a bat- 
tery of a variety of tasks. She first used 
preliminary rankings by a different group to 
discover a “neutral” task. This task was then 
used in the experiment by temporally as- 
sociating it with failure and nonfailure experi- 
ences on some other type activity. Her re- 
sults indicated that preference for the neutral 
task was higher when it followed a failure 
experience on a different task than when it 
did not follow failure. However, when the 
subject was subjected to two such experiences, 
i.e., failure-neutral task, failure-neutral task, 
it resulted in lowering the subject’s preference 
for the neutral task. Two tentative hypoth- 
eses were offered to explain the latter 
result: the occurrence of an increase in as- 
sociation between the failure experience and 
the neutral task; increased generalization of 
a response resulted from an increase in 
severity of frustration. 

The research reported here was, in part, 
aimed at observing a situation similar to the 
experiment cited above, i.e., the influence of 
failure and nonfailure experiences in one task 
upon a temporally associated, but different 
type of second task. It was also the purpose 
of this experiment to examine these reactions 
in two different groups of people—those in 
a psychiatric hospital, and those never 


e 


450 


treated for any emotional difficulty. In view 
of the prevalence of feelings of failure en- 
countered in examining patients being treated 
for emotional disturbances, it was felt that 
this feeling may serve as a base for a 
potential measurable distinguishing character- 
istic between the two groups of interest. 

Subjects were, in addition, given varying 
amounts of these failure and nonfailure 
situations in order to determine the effect 
of this variable upon the hospitalized and 
nonhospitalized groups. 


METHOD 
Subjects 


Ninety hospitalized subjects met the following 
criteria: they were patients in a psychiatric institu- 
tion, with an emotional disorder as a primary 
diagnosis; organicity was not evident on the basis 
of available data; they were between 18 and 35 
years old; they could read, write, and follow in- 
structions; total hospitalization time did not exceed 
5 years, 

Ninety nonhospital subjects were selected from 
various sources—church groups, students, hospital 
personnel (primarily nurses and psychiatric tech- 
nicians). The selection criteria corresponded to the 
hospital group except that the nonhospital sub- 
jects had never received treatment for any emotional 
disturbance or central nervous System disorder. 


Apparatus 


Two puzzle boxes, adapted from a box devised 
by Keister (1943), were used in the experiment. 
These were ordinary cigar boxes, altered so that 
the bottom was raised to within .25 inch of the 
top. Nine small, flat, wooden figures were located 
in each box. Figures were Shaped like various 
objects—flower, airplane, pocket watch, etc. Both 
boxes and figures were identical except that the 
figures in one box were each somewhat larger than 
those in the other box. Both sets of figures fit into 
their box without overlapping, allowing the lid to 
close completely. 

These puzzle boxes were called the Primary 
Experience variable. On the basis of pre-experimental 
trials, it was considered highly improbable that 
any subject given the box with the larger figures 
could place them according to instructions in the 
time allowed. This proved to be correct in that no 
Failure subject solved the puzzle during the experi- 
ment. Working on this box was therefore defined 
as the Failure (F) level of this variable. Subjects 
who worked on the box with the smaller figures 
had no difficulty placing them correctly and thus 
composed the Nonfailure (NF) level. The con- 
trols (C), both hospital and nonhospital, received 
neither puzzle box. 

A battery of nine paper-and-pencil tasks was 
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also used in the experiment. These were | 
tasks, such as requiring the subject to es! 
length of lines (multiple choice), underlinin 
that rhymed, crossing out all letter “o’s” in a 
passage, etc. There were two full size fo 
this battery, each task on one side of stand; 
paper. The corresponding tasks in the two 
were alike except for the specific content. Four 
forms of each type of task were also avai 
each about one-third to one-half sheet in ler 


Procedure 


Subjects were tested individually by the 
menter. They were informed that the experi 
was aimed at discovering how people liked g 
different tasks. They were also told not to | 
to finish any task. The first battery of nine ] 
and-pencil tasks (Bi) was then administered, 
jects were stopped on each task just beyong 
half-way point. The mean time per task 
seconds. Upon completion of all nine ta 
subject was given an unused B, set, in th 
order as administered, and instructed to rank 
according to how he liked doing them. A 2m 
rest followed. 

The puzzle box was then administered. 
were very briefly shown the open box ¥ 
pieces in place. The box was then emptie 
the table and the subject instructed to pl 
pieces back in any way that would allow the Ii 
close completely. 

Failure situation. The Hospital and Non 
F groups used the box with the larger figuri 
were told that most people could do it 
time allowed—2 minutes. As they worke 
failure experience was reinforced by the 
menters comments—“You seem to be 
trouble,” “You can't seem to get it,” etc. | 
2 minutes they were stopped. 

Nonfailure situation. The above commenti 
cluding the time limit, were omitted for 
groups. These subjects received the box coni a 
the smaller figures, and all managed to pl 
in the box in less than 2.5 minutes. 

Control situation. Hospital and Nonhos| 
subjects received neither puzzle box. They 
during this period. i 

Immediately following the puzzle period a 
jects received a short form of the fifth 
task in his own B; ranking. The 
Experience variable (puzzle situation) was 
temporally associated with the neutral task. 
jects were allowed to complete their short 
task, then rested for 3 minutes. ‘ 

The other independent variable, dealing 
Number of Associations of Puzzle-Neut 
each subject was to receive, determined the 
which followed. Subjects receiving more tl 
association repeated the puzzle, followed im d 
by another short form of the same paper-an 
task. The Two Association group thus T 
the procedure twice; the Four Association 
repeated the procedure four times. 


RESPONSE TO FAILURE 


At the conclusion of the proper number of 
repetitions of the association procedure subjects 
rested for 3 minutes and were then given the 
second battery of paper-and-pencil tasks (B2). All 
conditions were similar to the administration of 
B Subjects were stopped on each task at the 
same point completed on the corresponding B: 
task. They were then asked to rank a blank Bi 
battery according to how they liked doing them. 
Special note was made of the final position of 
the subject’s own original central ranked task. 

At the conclusion of the experimental procedure 
all F subjects were told the true difficulty of the 
puzzle and impressed with the need to withhold 
this information from others temporarily. 


Design 


A table of random numbers was used to assign 
subjects to 18 cells in a 2X3X3 analysis of 
variance design. The design was thus composed of 
three independent variables, with two, three, and 
three levels to each variable: Subjects (psychia- 
trically ' hospitalized, nonhospitalized), Primary 
Experience (F, NF, C), Number of Associations 
(one, two, four). There were 10 subjects in each cell. 


Order of administration was random. 


RESULTS 


No significant differences were found in the 
number of subjects choosing a specific task as 
neutral in the original ranking. This mini- 
mized the possibility of a single task produc- 
ing a systematic bias. Tests were performed 
on other variables which were felt to have a 
potential influence on the results. Mean ages 
of the hospital and nonhospital groups were 
28.0 and 26.4 years, respectively. Education 
levels were 11.4 and 12.7 years. Mean length 
of hospitalization ranged from 12.3 to 16.7 
Months, The differences noted in the above 
variables were not significant. 

Table 1 presents the data of the main 
dependent variable. Table 2 contains the 


TABLE 1 


Secon RANKING OF THE ORIGINAL 
Firta RANKED Task 


Fi | Fz | Fe | NFi1| NFs |NF4| Ci | C2 C4 
Hospital 
x 5.9| 6.1} 7.0] 4.4| 4.6 | 4.9 |5.0] 5.2] 4.9 
D 182.0 |19|1.6| 1.9 | L8 | .3|0.9|2.3 
Nonhospital k H 
M 3.9|4.0|5.9|5.4| 5.6 | 5.3 |48|5.3]4.8 
D 18|18|16|L5|L5|17|L4|1.5|L5 


Note.—F = Failure, NF = Nonfailure, C = Control. 
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TABLE 2 


SuMMARY OF ANALYSIS OF VARIANCE BASED UPON 
FINAL PREFERENCE RESPONSES TO A NEUTRAL TASK 


Source df 


Subjects (A) 

Puzzle (B) 

Number puzzle: 
A E (C) 


MS F 


x : 
AXC 87 129, 
BXC 7.09 241 
AXB 4 A9 Ave 
Within cells 162 2.94 

"Total 179 


Note,—Bartlett test of homogeneity of variance yielded a 
nonsignificant chi square of 9.894. Chi square at the .01 level 
*p «.01 (Fat .01 = 4,75, df = 2/162). 


analysis of variance of this variable. The 
significant F ratio was found in the inter- 
action of the Subject variable with the 
Primary Experience (puzzle) variable. It 
implied a differential effect of the puzzle 
conditions upon the different levels of sub- 
jects. Main effects were not significant. There- 
fore, tests were made of the simple effects of 
each factor at each level of the other factor. 
The simple effects of the nonhospital groups 
were not significant, indicating that, in the 
present experiment, the nonhospital subjects 
were not differentially affected by the various 
experimental conditions. 

Significant results, beyond the .01 level 
(F = 10.75), were obtained in the analysis 
of the hospital groups with reference to the 
puzzle conditions. The 7 test conducted on 
all pairs of cells within these interaction 
groups revealed above .01 level differences 
between the Hospital F groups and every 
other group tested (Hospital NF, Hospital C, 
Nonhospital F, Nonhospital NF, Nonhos- 
pital C). 

Diagnosis of the patients in each hospital 
group was very similar. All but three, in three 
different groups, were labeled as psychotic. 

The results may be summarized: 

1. A negative effect was noted in the 
reaction of hospitalized psychotic subjects 
to original neutral tasks which had been 
temporally associated with failure experiences. 

2. Nonhospital subjects were not measur- 
ably affected by the experimental variables. 
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3. No significant differences were found on 
the basis of varying amounts of failure and 
nonfailure experiences. 


Discussion 


Numerous studies have been performed 
indicating that failure experiences tend to 
produce various negative reactions related 
to the failed tasks and other, similar tasks. 
Zeller’s (1950) results also point to a nega- 
tive effect on performance of tasks temporally 
associated with entirely different failure ex- 
periences. These latter results were obtained 
with what Zeller considered relatively un- 
stable subjects. 

Etzel’s (1953) study initially pointed to a 
positive effect of failure upon preference for 
different type tasks which followed the 
failure. However, although her subjects were 
small children, she too found a reversal in 
preference with repeated associations of the 
primary task and the failure experience. This 
negative effect was noted to be highly sig- 
nificant in the present experiment dealing 
with individuals diagnosed as having a func- 
tional psychosis. However, unlike Etzel's 
results, the significant negative influence upon 
the neutral task was produced after a single 
temporal association with the failure situa- 
tion. The difference between these studies 
may reflect a characteristic of the psychotic— 
a relative general loss in their ability to re- 
Spond adaptively and appropriately to in- 
dividual experiences. They seem to be char- 
acterized by what may be described as a 
feeling failure set. 

Osgood (1953), in discussing the type of 
behavior observed in the present experiment, 
points out that it is not necessary that be- 
havioral principles invariably be related to 
adjustive consequences. This certainly ap- 
plies to the generalization of a response. A 
response which is carried over from one 
stimulus to another, and not actually a func- 
tion of the latter stimulus, may be considered 
maladaptive. From this point of view, the 
characteristic behavior observed in the Hos- 
pitalized F subjects of the present experiment 
may be “maladaptive.” This type of re- 
sponse may thus also be a potential measure 
of psychological adjustment. 

Borrowing a concept from Rotter’s (1954) 
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social learning theory, this study may al 
be viewed as an attempt to examine 
ferential expectancies. Expectancy, within 
this context, is essentially defined as 
“internal probability” that a particular ey 
will occur, and it is determined by the f 
quency of its occurrence in past events, 
failure is an important, repetitive occurrence 
in the past experience of individuals with 
functional psychosis, they may be expected 
to have an elevated internal probability of 
failure occurring in future events. ; 
Rotter’s concept of the generalization of 
changes in expectancy is also closely related 
to the results of the studies considered here; 
The concept proposes a gradient of ex 
pectancies, i.e., a child who has been refused 
a candy bar might have a lower expectancy 
getting an ice cream cone, but not be affected 
in his expectancy of hurting his foot sho 
he kick the door. Jessor’s (1954) study si 
ported this concept. However, the respo 
of the psychotics in the present experiment 
did not appear to be confined to this gradient: 
Their behavior appeared to transcend the 
dimension of similarity. It seemed to refl t 
their inability to make the discriminations 
and unbiased evaluations necessary for eM 
cient adjustment to a current situation. 
These concepts have been cited to empha 
size the relatively more objective methods of 
dealing with the type of information obtained 
in this experiment. These formulations até 
very applicable in dealing with success ang 
failure. However, research has not gener 
dealt with these variables as potential me 
ures of psychological adjustment. Never 
less, the important influence, and from 0! 
point of view, the irrational aspects; of. E. 
observed responses to failure, were exemplified 
in an early study by Cartwright (1942) whi E 
indicated that anticipation of failure reduced 
attractiveness of a task even more than Uf 
failure experience itself. 
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PERSUASIBILITY IN YOUNG CHILDREN AS A FUNC- 
TION OF AGGRESSIVE MOTIVATION AND 
AGGRESSION CONFLICT 


ALAN ROLAND 
Adelphi College 


This study investigating the dynamic bases of the reported negative relation- 
ship of overt aggression to persuasibility postulated that low persuasibility is 
associated with an overtly aggressive orientation, high persuasibility with a 
defensive need to inhibit strong aggressive motivation, and medium persuasi- 
bility with a relative absence of aggressive motivation and inhibitions against 
aggression. 3 groups of 15 were selected from an initial sample of 169 1st- 
grade boys for high, medium, and low persuasibility, and were compared on 
aggressive motivation and an aggression conflict ratio. The results are that 
High and Low persuasible (P) Ss have more aggressive motivation than 
Medium P Ss; High P Ss have a smaller aggression conflict ratio than Low 
P Ss. The results thus support the initial postulates. i 


The predisposition to agree or disagree 
with other people, which has been labeled 
“persuasibility” by Janis and Hovland 
(1959), is an important element in the 
acquisition of attitudes and opinions. Since 
this predisposition is considered to be a per- 
sonality trait, it is supposedly independent 
of specific factors such as the content of the 
communication, the particular communicator, 
the medium of communication, and rewards 
and punishments in the communication situa- 
tion. In an attempt to elucidate the motiva- 
tional factors underlying _ persuasibility, 
Hovland and Janis (1959a) have recently 
collected the results of research relating per- 
suasibility to other personality variables. One 
promising aspect of this work is the finding 
for both adults (Janis & Rife, 1959) and 
children (Lesser & Abelson, 1959) that the 
expression of overt aggression and persuasi- 
bility are negatively related. These authors 
focus on one part of the relationship, however, 
the association of high overt aggression and 

_ low persuasibility. The present study is an 
"attempt at a more detailed investigation, ex- 
ploring the determinants of middle and high 
persuasibility as well. 

From a motivational standpoint, the associ- 
ation of high overt aggression with low per- 
suasibility seems easy to understand. Low 
persuasibility, or the predisposition to dis- 
agree, may be considered as one aspect of an 
overtly aggressive orientation to the environ- 
ment. However, the association of high per- 
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suasibility with low overt aggression is not 
as understandable, since a relative absence of 
overt aggression does not necessitate a pit 
disposition to agree. It seems reasonable that 
an individual who displays little aggression 
may still be assertive enough to express his 
own opinion. Similarly, the correspondi 
association of medium persuasibility with 
moderate overt aggression has no obvious 
explanation. d 
A more detailed analysis of overt aggression 
may provide a basis for understanding the 
negative relationship of overt aggression wil 
persuasibility. Mussen and Naylor (1954 
Kagan (1956), and Lesser (1958b) have 
demonstrated that the amount of overt ag 
gression is determined not only by the level 
of aggressive motivation, but also by th 
strength of inhibitions against aggressi 
This implies that relatively low overt aggre: 
sion may be a function of a low level 0 
aggressive motivation, or that inhibitions 
against aggression may be sufficiently E 
to prevent the expression of aggressiv? 
motivation despite the motive’s intensity: 
Bach (1945) found that in children as 0 
aggression tended toward the very low se 
of the continuum, it became more a na 
of the strength of inhibition than of 4 ad 
level of aggressive motivation. However ‘on 
same study showed that a moderate express” 
of overt aggression was related.to the fa 
of low aggressive motivatio: 
These results suggest the follow 
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tims of the negative relationship of overt 
aggression with persuasibility: 


1. Low overt aggression is associated with 
high persuasibility not through the absence 
of aggressive motivation, but rather through 
the presence of strong inhibitions against 
aggression. A stable predisposition to agree 
"may be considered related to a defensive need 
to inhibit any overt expression of aggression. 
Strong inhibitions against aggression probably 
erate only when substantial aggressive 
tivation is present. 

2, The middle range of persuasibility is 
theoretically related to a low level of aggres- 
ive motivation which obviates the necessity 
for inhibition. When the level of aggressive 
j Motivation is low, there is no consistent need 
either to agree or disagree, and therefore the 
i cidence of agreement is probably deter- 


3. The lower range of persuasibility would 
len be related to the presence of strong 
essive motivation and relatively weak 
bition. It is this situation which results 
high level of overt aggression. 


Wo hypotheses were derived írom this 
ysis: (a) High and Low persuasible (P) 
E have a higher level of aggressive 
3 vation than Medium P subjects; (5) 
eh persuasibility is associated with a 
ler ratio of aggressive motivation to 


mibitions against aggression than low 
Wetsuasibility, 

A METHOD 

Subjects 


i ERE consisted of 169 first-grade boys 
New York the public school system in Uniondale, 
the tk. Boys were used as subjects to control 

ae Variable. Since Lesser and Abelson (1959) 
- Ee uty studies and Bach (1945) and 
found si 51) in their doll play aggression studies 
las E mt sex differences, only boys were used 
litas im in the present investigation. In both 
sults ‘J it was found that more significant 
lliren *re elicited from boys than girls. First-grade 
Poyet be chosen because this age group was 
Buenos Y Abelson and Lesser (1959) in develop- 
billing a of persuasibility and by Pintler (1945), 
i devlop o and Hollenberg and Sperry (1951) 

ession. ng experimental techniques for measuring 
t, th n doll play, As evaluated by Abelson and 
oe aUniondale population is largely upper- 
Wet-middle class in socioeconomic status. 


* ference in age between the subjects 
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Measures oj Persuasibility 


Persuasibility was measured by a Persuasibility 
Booklet measure (Examiner Persuasibility) and by 
an incomplete stories technique (Mother Persuasi- 
bility), both developed by Abelson and Lesser 
(1959), and administered according to their pro- 
cedure. However, the original Persuasibility Booklet 
measure, consisting of drawings of 19 pairs of objects, 
was expanded for the present study to include two 
alternate-form booklets of 19 items in each (Form A 
and Form B). This was done to obtain a more stable 
measure of persuasibility. In a preliminary study, 
the alternate form reliability was tested by two 
teachers administering the two booklets 2 weeks 
apart to 56 subjects in two classes, yielding a co- 
efficient of .81. In the present study, the two forms 
of the Persuasibility Booklet were administered 1-3 
days apart, in this case by the experimenter, to 169 
boys, yielding an overall reliability coefficient of .73. 
The coefficients for the 12 classes ranged from .48 
to .96. However, for those classes from which the 
three groups of subjects were selected, the coefficients 
ranged from .62 to .96, all significant at the .01 level. 
These figures compared favorably with split-half 
reliability coefficients obtained by Abelson and Lesser. 
Abelson and Lesser presented evidence for the valid- 
ity of the two measures, as well as the reliability 
of the Mother Persuasibility measure. 


Measures of Aggressive Motivation and In- 
hibitions against Aggression 


A modified TAT procedure was developed by 
Lesser (1958b) to evaluate the strengths of approach 
and avoidance tendencies in aggression. Ten pictures 
designed to elicit aggressive responses and, as far 
as possible, to avoid stimulus elements which would 
evoke dependent, affiliative, or other needs were used. 
However, the pictures varied quite widely in the 
extent to which they pertained to aggressive motiva- 
tion, Level of aggressive motivation was defined by 
a fantasy aggression score and inhibitions against 
aggression by a fantasy aggression anxiety score. The 
aggression conflict ratio was defined as the ratio of 
fantasy aggression to fantasy aggression anxiety, the 
latter consisting of the total score of the four cate- 
gories of fantasy aggression anxiety developed by 
Lesser. 

The scoring system used in the present study was 
similar to that developed by Lesser (1958b) with 
the major modification that all aggressive responses 
were scored provided that responses beyond the first 
one were not classified as aggression anxiety. Thus, 
a fantasy aggression score of more than one could 
be assigned to a story. This modification took into 
account the differences in stories between Lesser’s 
study and the present one, wherein the initial aggres- 
sive theme was often repeated and additional ag- 
gression was expressed by characters whose actions 
were unconnected with the principles in the story. 
These differences are due presumably to the dif- 

in the present 
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study and those studied by Lesser (6-year-old boys 
in contrast to 10-13 year old boys). Each story was 
recorded verbatim by the experimenter and scored 
afterwards by two judges. Interjudge agreement on 
scoring for fantasy aggression was 89% and for 
fantasy aggression anxiety, 7996. 

An experimental doll play procedure developed by 
Pintler (1945) and Phillips (1945) was also used to 
measure aggressive motivation and inhibitions against 
aggression. This procedure was designed to elicit the 
maximum expression of young children's aggressive 
motivation over the course of several play sessions 
by experimental reduction of inhibitions against ag- 
gression rather than by instigation of the aggressive 
drive. Each subject was seen for three sessions of 
15 minutes each. This seemed to be the maximum 
number of sessions and length of time that all sub- 
jects could maintain story elaboration. Aggressive 
motivation was defined in the present study as the 
aggression expressed in the third, or last play session, 
and the aggression conflict ratio as the aggression 
expressed in the first play session. Aggression was 
measured by its latency, frequency, and intensity. 
The definition of the aggression conflict ratio was 
derived from studies by Bach (1945), who found 
that aggression in the first session is related to normal 
overt aggression, and by Lesser (1958b), who demon- 
strated that overt aggression is a function of the 
ratio of aggressive motivation to inhibitions against 
aggression, On the basis of the above definitions, 
inhibitions against aggression were found by algebraic 
derivation to be expressed by the ratio of aggression 
in the third session to aggression in the first session. 

The scoring system used in this study was a result 
of testing in a pilot study scoring methods developed 
by Pintler (1945), Hollenberg and Sperry (1951), 
and Sears (1951). Frequency and intensity of aggres- 
sion were derived from the two major categories of 
aggression which occurred in the doll play procedure: 
thematic and nonthematic. The former consisted of 
all aggression expressed in the story about the doll 
family, verbal and physical, while the latter consisted 
of any aggressive expressions by the child towards 
the experimenter. Frequency of aggression was scored 
for each verbal or physical aggressive response of 
thematic aggression, as well as for each expression 
of nonthematic aggression. Intensity of aggression 
was scored on a scale for verbal and physical 
thematic aggression, and was given the maximum 
score for intensity on nonthematic aggression. A 
further score for aggression consisted of aggression 
latency, or the time elapsed between the start of a 
play session and the initial aggressive response. 

The recordings of aggressive responses and the 
latency of the first aggressive response were made 
by the experimenter. No one-way vision room was 
available in the schools in which this study was 
made, so that a hidden observer could not be used 
for a reliability check as was done in previous 
studies by Pintler (1945), Hollenberg and Sperry 
(1951), and Sears (1951). To rule out any un- 
conscious bias by the experimenter with regard to 
judgments of aggression, the experimenter recorded 
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all responses which seemed in any way aj 
and did not participate in the scoring. This proc 
enabled all judgments of aggression, e: 
borderline cases, to be made later by two j 
rather than by the experimenter. The two j 
scored all responses for frequency and inte 
aggression. Interjudge agreement was 96% for fi 
quency of aggression and 89% for intensity, 


Procedure 


The Experimenter Persuasibility (EP) mea 
was administered in the classroom to 169 bo} 
12 classes by a female experimenter in the sta 
manner described by Abelson and Lesser (1959 
experimenter administered Forms A and B of 1 
measure on separate occasions, which ranged’ 
1 to 3 days apart. Persuasibility booklets were 
out to girls as well as boys, although only the boy 
booklets were used. From the total scores of Fol 
A and B of the EP measure, 15 subjects wit 
highest scores, 15 subjects with the lowest 
and 15 subjects with the scores closest to th 
were selected from the total group of 169 
Subjects in the middle range of persuasibility 
included in the study only if their scores om 
A and B agreed within three points. The m 
the High, Medium, and Low P subjects are 
18.13, and 5.38, respectively. The standard de a 
for the three groups in the same order are 1,86, 1 
and 225. 

Each boy was seen by a male experimen! 
four times in individual sessions on successive 
days. In the first session, the child was adi 
the Mother Persuasibility (MP) measure an 
the modified TAT procedure. The MP measuiti 
administered first as it required less involvement’ 
the TAT procedure, and was probably less sus 
to the effects of anxiety. In the next three 
the child participated in the permissive doll 
procedure. All measures were scored afterwalt 
eliminate any unconscious bias by the expermll 
in his role of recorder. 

Each time the child was seen, the expe 
spent a few minutes talking with the child to 
lish rapport before the testing began. The 
menter told the child in the beginning thal 
would be playing some story games togë 
some of them, the experimenter would tell 
some stories, while in other games the child 
tell the experimenter some stories. The MP 
was first introduced in a general Way as & B 
wherein the experimenter would tell the chic E 
stories, while the TAT procedure and 
procedure were introduced as games wi 
child, himself, would make up a story. Thi 
theme provided an easy transition from © 
of the testing procedure to another. 

A few boys selected on the persuas 
were absent from class on days when the of 
cedures were administered. As 2 result, the 
of subjects in the Medium and Low P gt 
somewhat. 


ibility 


RESULTS 
Validity of the Measures 


Persuasibility. The MP measure was intro- 
duced in this study as a validity criterion 
for the EP measure. High, Medium, and Low 
P groups of subjects, selected on the basis 
of Forms A and B of the EP measure, were 
not differentiated in persuasibility on the MP 
measure. High Ps had an MP mean of 8.47 
and a standard deviation of 5.69; Medium Ps 
had a mean of 8.87 and a standard deviation 
of 4.17; Low Ps had a mean of 7.38 and a 
standard deviation of 5.58. 

Aggressive motivation. Aggressive motiva- 
‘tion was operationally defined as fantasy 
aggression on the TAT procedure and aggres- 
sion in the third doll play session. It was 
expected, therefore, that fantasy aggression 
would correlate positively with frequency and 
intensity of aggression and negatively with 
aggression latency in the third play session. 
Because both the fantasy aggression and the 
doll play aggression scores were asymmetri- 
tally distributed, phi coefficients were used 
| to determine the relationship between the two 
“measures. The results show that fantasy 
aggression correlated .68 with intensity of 
aggression and —.67 with aggression latency. 
The chi square test showed these relation- 
ships to be significant at the .01 level. The 
Comparison of fantasy aggression with fre- 
"Quency of aggression yielded a correlation of 
32 and a chi square having a p value slightly 
less than 05. These findings were taken as 
partial evidence for the validity of the 
Measures, 

Inhibitions against aggression. Inhibitions 
against aggression were defined as fantasy 
aggression anxiety on the TAT procedure. 
On the doll play procedure, inhibitions against 
| 4getession were found by algebraic derivation 
to be expressed as aggression in the third 
Play session divided by aggression in the first 
play session. It was predicted, therefore, that 
fantasy aggression anxiety would correlate 
with aggression in the third session divided 
Y aggression in the first session, and that 
e correlation would be positive for fre- 
quency and intensity of aggression and nega- 
tive for aggression latency. The results show 
fantasy aggression anxiety correlated 
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—.23 with the aggression latency ratio, 42 
with the frequency of aggression ratio, and 
.40 with the intensity of aggression ratio. The 
chi square test showed these latter two rela- 
tionships to be significant at the .01 and .02 
levels, respectively. These results suggest that 
the two procedures for measuring inhibitions 
against aggression are valid. 

Aggression conflict ratio. This ratio was 
operationally defined as the ratio of fantasy 
aggression to fantasy aggression anxiety on 
the TAT procedure and aggression in the first 
play session. It was predicted, therefore, that 
the ratio of fantasy aggression to fantasy ag- 
gression anxiety would correlate positively 
with frequency and intensity of aggression 
and negatively with aggression latency in the 
first play. session. The ratio on the TAT pro- 
cedure correlated .67 with intensity of aggres- 
sion, .34 with frequency of aggression, and 
—.47 with aggression latency. The correlations 
with intensity of aggression and aggression 
latency were significant at the .01 level, and 
the one with frequency of aggression at the 
05 level. The validity of the two measures of 
the aggression conflict ratio was suggested by 
the data. 


Tests of the Hypotheses 


Hypothesis a. It was predicted that High 
and Low P subjects, selected on the basis of 
the EP measure, would show aggression more 
frequently, with greater intensity, and with 
lower latency during the third doll play 
session than would Medium P subjects. The 
former groups were also expected to have 
more fantasy aggression than Medium P 
subjects. 

The means and standard deviations of the 
three groups on aggression in the third play 
session and fantasy aggression on the TAT 
procedure are presented in Table 1. Because 
of skewed distributions in the measures of 
aggression, the Mann-Whitney U test was 
used to test for significance of differences 
among the groups on both play aggression 
and fantasy aggression. The evaluations of 
the differences are presented in Table 2. 

As Table 1 illustrates, in the third play 
session, High and Low P groups each had 
higher frequency and intensity of aggression 
and lower aggression latency than the Medium 
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TABLE 1 ; 


MEANS AND STANDARD DEVIATIONS OF HIGH, MEDIUM, AND Low PERSUASIBLE GROUPS ON AGGRESSION IN THE 
THIRD Pray SESSION AND FANTASY AGGRESSION 


High P Medium P Low P 
M SD M SD M SD 
(N = 15) (N = 13) (N = 14) 
Play Session 3 
Aggression Latency* 4.62 5.21 9.85 5.87 3.32 4.46 
Frequency of Aggression 12.73 11.90 3.77 5.63 17.36 11.10. 
Intensity of Aggression 27.20 28.52 7.92 13.54 38.64 3024 
(N = 15) (N = 14) (N = 16) 
Fantasy Aggression 
Total = 7.87 3.65 6.43 2.32 7.31 3.02 
High Relevance 4.73 1.57 4.71 1.60 4.50 1.73 
Low Relevance 3.13 2.31 1.71 1.13 2.80 1,67 


a In minutes. 


P group. As indicated in Table 2, these dif- 
ferences were significant at the .01 level. The 
High and Low P groups did not differ sig- 
nificantly from each other in any category of 
aggression recorded during the third session. 
The evidence supports Hypothesis @ that 
High and Low P subjects have more aggres- 
sive motivation than Medium P subjects in 
the doll play situation. 

Table 1 also shows that the High and Low 
P groups had more total fantasy aggression 
than the Medium P group, but as Table 2 
indicates these differences were not signifi- 
cant. However, an analysis of fantasy aggres- 
sion scores on the 10 modified TAT cards 


TABLE 2 


Comparison OF HIGH, MEDIUM, AND LOW PERSUASIBLE GROUPS ON AGGRESSION IN THE 
Tump PLAv SESSION AND FANTASY AGGRESSION 


indicated that half the cards elicited aggres 
sive responses from most of the subjects, 
while the other half yielded a much greater 
range of aggression scores. The former group 
of pictures was labeled “high relevance! | 
(HR) cards because they elicit more aggré 
sive responses than the latter group, whi 

has been termed “low relevance” (LR) cards, 
This division of the TAT procedure into five 
HR and five LR cards has a further basis M 
that the rankings of the pictures on aggressive 
responses in this study are highly similar t0 | 
the rankings reported by Lesser (19582). 

Spearman rank-difference technique was ap 
plied to the two sets of rankings and resulte 


High versu: i ; i Low P 
Medium P. Mae aes Medians | 5. M 
Play Session 3 
Aggression Latency 2,0995 3.42*** 3.40*** 89 
Frequency of Aggression PFE bee Siow GRE ied 1.09 
Intensity of Aggression 240** 3.11%% 3.18*** 99 
Fantasy Aggression 
Total 98 91 148 x 
High Relevance 4l a as 30 
Low Relevance 1.60* 169" rgo ae 


Note.—s scores of the Mann-Whitney U test are used throughout. 


*.05 <p « .10. 
**p «.05, one-tailed. 
*** > < 01, one-tailed. 


PERSUASIBILITY AND AGGRESSION 


in a correlation of .89, significant at the .01 
level. 

If the data are examined separately for HR 
and LR cards, it is apparent that the three 
groups do not differ on the former but do 
differ on the latter. The Low P group had 
significantly more fantasy aggression than the 
Medium P group (at the .05 level), while 
the High P group just missed having signifi- 
cantly more than the Medium group. When 
both the Low and High P subjects were 
grouped together, they had more LR fantasy 
aggression than the Medium P subjects (at 
the .05 level). The evidence from the fantasy 
aggression scores gives partial support to the 
hypothesis that High and Low P subjects have 
more aggressive motivation than Medium P 
subjects. 

Hypothesis b. It was predicted that the 
High P group would have significantly lower 
frequency and intensity of aggression and 
higher aggression latency in the first play 
session than the Low P group. The High P 
Broup was also expected to have a smaller 
Tatio of fantasy aggression to fantasy aggres- 
sion anxiety than the Low group. The means 
and standard deviations of High and Low P 
Subjects on first play session aggression and 
the ratio of fantasy aggression to fantasy 
aggression anxiety are presented in Table 3. 
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The Mann-Whitney U test was again used to 
test significance of differences between the 
two groups on both play aggression and the 
ratio scores. The evaluations of the differences 
are presented in Table 4. 

The results show that High P subjects had 
lower frequency and intensity of aggression 
in the first session and higher aggression 
latency than Low P subjects, these differences 
being significant at the .01 level. These find- 
ings support the prediction. Table 3 also 
indicates that the High P group had a lower 
ratio of total fantasy aggression to total 
fantasy aggression anxiety than the Low 
group, but the difference has a value slightly 
greater than .05. When the ratio scores were 
broken down in terms of the HR and LR 
cards, it was only on the HR cards that 
the difference was significant (at the .01 
level). These results provide support for 
Hypothesis 5. 

Although the Medium P group was not 
involved in Hypothesis 5, the data for this 
group has been included in Tables 3 and 4. 
The results in Table 3 show that in the first 
play session Medium P subjects had some- 
what higher frequency and intensity of ag- 
gression, and somewhat lower aggression 
latency than High Ps. Compared to Low P 
subjects, the Medium Ps had lower frequency 


TABLE 3 


MEANS AND STANDARD DEVIATIONS OF HiGH, MEDIUM, 


First Pray SESSION AND THE RATIO OF FANTASY AGGRESSION TO FANTASY 


AND Low PERSUASIBLE GROUPS ON AGGRESSION IN THE 
AGGRESSION ANXIETY 


High P Medium P Low P 
M SD M SD M SD 
(N = 15) (N = 13) (N = 14) 
Play Session 1 
Aggression Latency* 12.00 4.91 10.23 4.90 7.22 4,81 
Tequency of Aggression 1.33 1.96 2.31 3.01 4.57 $ a 
Intensity of Aggression 1.67 2.49 4.08 6.13 7.14 
(N = 15) (N = 14) (N = 16) 
Ratio of Fantasy Aggression to Fantasy Ag- 
gression Anxiety 1.95 
1.80 3.05 2.06 2.89 E 
Hiph Relevance 15 | Lo 2.79 | 158 310 | 208 
Low Relevance 1.72 95 1.51 1.01 1.88 1 


^ In minutes, 
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TABLE 4 


Comparison or HIGH, MEDIUM, AND Low PERSUASIBLE GROUPS ON AGGRESSION IN THE First PLAY SESSION AND 
THE RATIO OF FANTASY AGGRESSION TO FANTASY AGGRESSION ANXIETY 


ALAN ROLAND 


High versus Low P 


Medium versus Medium versus 
LowP 


High P 

Play Session 1 

Aggression Latency 240*** 98 1.52* 

Frequency of Aggression 2.62%" 49 1,58* 

Intensity of Aggression DATA 52 145* 
Ratio of Fantasy Aggression to Fantasy Ag- 

gression Anxiety 
Total 1.46* 1.55* .62 
High Relevance 2:2 158 3.06*** 2T L 
Low Relevance 92 94 127 


Note.—z scores of the Mann-Whitney U test are used throughout. 


*05 <p <.10. 
*** p < 01, one-tailed. 


and intensity of aggression and higher ag- 
gression latency. These differences closely 
approached the .05 level. 

Table 3 indicates further that Medium P 
subjects had a higher ratio of fantasy ag- 
gression to fantasy aggression anxiety than the 
High P subjects on all the cards and on the 
HR cards, but a somewhat lower ratio on 
the LR cards. None of the comparisons be- 
tween the Medium and Low P groups were 
significant. Table 4 indicates that while none 
of the differences between Medium and Low 
Ps approached significance, the difference 
between Medium and High P subjects on the 
HR cards was significant at the .01 level, 
and the difference on all of the cards just 
missed being significant at the .05 level. 


Discussion 


The data generally confirmed the two hy- 
potheses based on the postulated dynamic 
relationship of aggression to persuasibility. 
High and Low P subjects had more aggressive 
motivation than Medium P subjects, and 
High P subjects had a smaller aggression con- 
flict ratio than Low P subjects. Thus, it was 
concluded that low persuasibility is associated 
with an overtly aggressive orientation, high 
persuasibility with a defensive need to inhibit 
strong aggressive motivation, and medium 
persuasibility with a relative absence of ag- 
gressive motivation and inhibitions against 
aggression. 

Both hypotheses were confirmed by the 
data of the doll play procedure, but were 
only partially confirmed by the data of the 


TAT procedure. With regard to aggressive 
motivation, only the LR cards of the TAD 
procedure differentiated the groups. The most 
parsimonious explanation for this finding is 
that the LR cards were less suggestive of 
aggressive themes than the HR cards, anb. 
were therefore more sensitive to individual 
differences in aggressive motivation. On the 
other hand, with regard to the aggression 
conflict ratio, the High P subjects were dit | 
ferentiated from the Low P group only on the 
HR cards, It would appear that since the 
cards elicited more aggressive responses 
general than the LR cards, they were likely 
to elicit a greater degree of inhibitions 
against aggression in a given individual. 
Thus, since they provide a better measure of 
inhibitions against aggression for each indi 
vidual, these cards were more likely to show 
individual differences in aggression conflict. 

In a comparison of the Medium with the 
High and Low P groups on the ae 
conflict ratio, as manifested in the first d 
play session and the TAT procedure, dor 
results can best be explained by n 
persuasibility being associated with low . 
hibitions against aggression, as well as 10. 
aggressive motivation. The Medium ps : 
jects in play sessions manifested less ® 
gression than the Low P subjects pe i 
less aggressive motivation, and manie 
higher aggression conflict ratio om 
cards of the TAT procedure than the 
subjects because of less inhibitions 
aggression. 

The confirmation of the major hypo 


udy suggests some additional research 
area of drive expression and drive 
e. Two types of investigations should 
nade: the relationship of other specific 
"and drive control systems to persuasi- 
ty and the relationship of general levels 
ve expression and drive control to per- 

ibility. Crutchfield (1955) has already 
sported that general defensiveness is related 


play procedure were significantly corre- 
in the predicted direction with respect 
rations defining aggressive motivation, 


n conflict ratio. Therefore, the above 
‘comparisons affirm the validity of these 
‘Measuring instruments. This use of a 
tive test and behavioral procedure in 
ction for assessing personality variables 
accord with the suggestion made by 
nd and Janis (1959b) for improving 
validity of measures of personality 
les in persuasibility research. 
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SOME EFFECTS OF VERBAL EXPRESSION OF HOSTILITY * 


RICHARD peCHARMS 


Washington University 
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113 Ss participated in simulated interaction in the course of which they 
were verbally attacked. The experiment demonstrated (with a Before-After. 
measure) increased hostility in the Ss at the end of the experiment which 
was a positive function of the number of hostile statements made to the 
attacker after the attack. A vicarious analogue was consistent in showing a 
similar increase. The major findings are inconsistent with a simple catharsis 
hypothesis. They are more easily interpreted by viewing hostility as instru- 
mental behavior which is unsuccessful (frustrated) and as a result increases 


in intensity. 


Much of the research stimulated by the 
Frustration-Aggression hypothesis (Dollard, 
Doob, Miller, Mowrer, & Sears, 1939) has 
assumed a basic motive, aggression, which is 
the causal agency responsible for aggressive 
acts. The phenomena of aggression are found 
in responses and, therefore, it is crucial to 
avoid the circularity of explaining aggressive 
behavior by postulating an aggressive motive 
which is not independently defined. The 
danger of circularity is particularly acute in 
dealing with the catharsis hypothesis which 
states that “the occurrence of any act of 
aggression is assumed to reduce the instiga- 
tion to aggression” (p. 50). If a drive reduc- 
tion view is taken one might assume that any 
act of aggression should have drive reducing 
properties, a position taken by Feshbach 
(1955). 

The position will be taken here that in- 
dependent measurement of the aggressive 
drive and aggressive responses is rare. In 
order to avoid the drive response confusion 
the concern here will be entirely with hostile 
behavior with no reference to drive or motive. 

One of the main purposes of the present 
research is to investigate the form of the 
catharsis hypothesis which implies that the 
occurrence of any act of aggression (hostile 
behavior) will reduce the probability of 
further hostile behavior. In addition, this 
research is concerned with a vicarious 
analogue of this hypothesis, namely, that 
observing someone else engage in hostile 


1This study was carried out under Contract 
Nonr-816 (11) between the Office of Naval Re- 
search and the Small Groups Research Center, Social 
Science Institute, Washington University. 


activity will affect the level of hostility of 
the subject in a way similar to a situation 
in which the subject himself engages in hos- 
tile activity. The above statement derives 
from the definition of vicarious phenomena 
proposed by deCharms and Rosenbaum 
(1960). 

Worchel (1961) has demonstrated that 
status restoration after an attack results in 
lower residual hostility. Status restoration was 
accomplished by assuring the subjects that 
they were performing a task well after at 
first berating their performance. After verbal 
interpersonal attack, status restoration and 
reduced hostility might result from making 
self-laudatory remarks or hearing someone 
else make such remarks. These hypotheses 
which involve self-enhancement will also be 
tested in the present study. 

The experiment to be presented is a direct 
outgrowth of and methodologically similar to 
the study reported by Rosenbaum and 
deCharms (1960). They found a reduction in 
hostility for subjects with low self-esteem, 
in situations in which the subject himself was 
allowed to respond after provocation, and in 
which he merely observed the “vicar” berate 
his provocateur. As in the study by Thibaut 
and Coules (1952) the reduction in hostility 
could not be definitely attributed to catharsis 
since very little actual hostile behavior 
occurred. 

Using the terminology of the earlier experi- 
ments, we will refer to Communication con- 
ditions in which the subject is allowed to 
communicate directly with the attacker after 
the verbal attack. The present experiment 
includes two communication conditions. The 
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a ‘Communication Maximum Restraint (Com 


— 


Max) condition is identical to the Com- 
munication condition of Rosenbaum and 
deCharms (1960). The Communication Me- 
dium Restraint (Com Med) condition derived 
from a study preliminary to the present one 
in which an attempt was made to reduce 
the restraint against hostile statements. This 
condition directs the subject to spend some 
of his time discussing characteristics of the 
attacker. 

In the present experiment the Negative 
Vicarious Communication (— Vi Com) con- 
dition is identical to Rosenbaum and de- 
Charms’ (1960) Vicarious Communication 
condition. The Positive Vicarious Com- 
munication (+ Vi Com) condition is unique 
to this experiment. These vicarious condi- 
tions involve situations where the subject 
cannot respond to the attack, but a third 
person (the *vicar") does. In the positive 
condition he praises the subject; in the nega- 
tive he berates the attacker. The final experi- 
mental condition involves the control of No 
Communication (No Com). 


Hypotheses 


Catharsis hypothesis. This hypothesis 
would predict a change toward reduced 
hostility after an attack upon a sub- 
Ject’s self-esteem in the Com Med condition 
as compared to the No Com condition. Fur- 
ther, it would predict that the amount of 
change (reduction in hostility) is a positive 
function of the amount of hostility displayed 
by the subject during the communication 
period. 

If the manipulation of restraint is suc- 
cessful, a crucial test of the catharsis hy- 
Pothesis will be obtained in comparing the 
Com Max condition (little hostility com- 
municated) with the Com Med condition 
(considerable hostility communicated). The 
Specific prediction is that there will be greater 
Teduction in hostility in the Com Med con- 
dition than in the Com Max condition. 

Self-enhancement hypothesis. Reduction in 
hostility is a positive function of positive 
Statements made about self in the com- 
munication period. This hypothesis can be 
tested in both the Com Max and the Com 

ed situations. 


463 


Vicarious phenomena hypotheses. Vicarious 
analogues of both the catharsis hypothesis 
and the self-enhancement hypothesis are 
presented by the — Vi Com and + Vi Com 
conditions. The definition of vicarious phe- 
nomena leads to the hypotheses that reduc- 
tion in hostility will be greater in the — Vi 
Com condition than in the No Com condition 
(catharsis analogue) and that reduction in 
hostility will be greater in the +Vi Com 
than in the No Com condition (self-enhance- 
ment analogue). 


METHOD 
Subjects 


One hundred and thirteen subjects participated in 
the experiment. All were male freshmen or sophmores 
serving in a campus ROTC unit. Eight subjects 
were eliminated from the analysis for reasons of 
suspicion or failure to complete forms correctly. 
The final analysis included 105 subjects, 21 in 
each of the five experimental conditions. 


Equipment 


The experiment was conducted in four small 
rooms, one of which served as a master control 
room and the others served as experimental rooms 
in which the subject was isolated. It was possible, 
therefore, to have a maximum of three subjects 
at one time. This number was optimal, since every 
effort was made to have the subject believe that 
he was one of a group of three. Care was taken 
so that the subjects did not see each other before 
the experiment. Each experimental room contained 
a desk and chair and the subject was provided 
with a pencil, a set of headphones, and microphone. 
The master control room contained four tape 
recorders, three for recording everything that hap- 
pened in each of the three experimental rooms 
and the fourth which played a master tape which 
was heard over the headphones in all three of the 
experimental rooms. 


Procedure 


As soon as the subject entered his experimental 
room he filled out the Value Profile which con- 
tained the Self-Esteem scale used by Rosenbaum 
and deCharms (1960). The experimenter then 
visited his room briefly to instruct him in the 
use of the equipment and to inform him that 
detailed instructions would be given over the inter- 
communication system. When all subjects were 
prepared they heard taped instructions. These in- 
structions described the procedure in detail and 
stressed that the subject had two tasks. First, he 
was to attempt to give a clear picture of himself; 
and second, he was to try to arrive at an impression 
of the other participants. 

After the instructions, the subject was prepared 
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TABLE 1 
ROUND 6 PROCEDURES UNDER VARYING CONDITIONS 


Maximum Restraint 


Medium Restraint 


Positive Vicarious | Negative Vicarious 


No Communi- T C icati [e icati e icati 
tation eam Mas | comameation” | Commonication (Vi Com 
Person speaking Vicar Subject Subject Vicar Vicar 
(not heard by 
the subject) i 
Topic Ideal self Ideal self* Comparison? Ideal selfe Ideal self 


^ The instructions were, 


On this final round the person indicated in the box below is to give a picture of his ideal self. What would you like to be? 


b The instructions were, 


On this final round the person indicated in the first box is to compare his personality with that of the person indicated in 


the second box. 


These Were written in an always directed the subject to compare himself with the attacker, E y ; 
* In the two vicarious conditions the vicar actually ignored the instructions about ideal self and either discussed the subject 


(+Vi Com) or the attacker (—Vi Com). 


to describe himself to two other participants fol- 
lowing the topics and the order presented to him 
on an instruction sheet. This sheet made it clear 
to him when he was to speak and when the others 
were to speak. The topics were as follows: 


Round 1. Who are you? (This was to include 
relevant background information but no names.) 

Round 2. Describe your social self. 

Round 3. Describe your nonsocial self, 

Round 4. Characteristics you dislike in yourself. 

Round 5. Compare your personality with that of 
the person indicated. (The letter “A,” which identi- 
fied the attacker, was written in except in the Com 
Med condition where the subject was to compare 
himself with the vicar.?) 

Round 6. Varied as 
Table 1. 


to condition as shown in 


Brief explanations were given of each of these 
topics to give the subject a start. The instructions 
did not lead the subject to expect any spontaneous 
interaction with the other subjects since each was 
to be talking about himself and was not to ask 
or answer questions, The order in which each person 
Spoke varied from round to round but the overall 
Order was constant for all conditions up to and 
including Round 5. 

The master tape which each subject heard con- 
tained instructions from the experimenter explaining 
the topic to be discussed and telling who was to 
speak to whom. The tape also contained standard- 
ized speeches for both the attacker and the vicar 
for each of the rounds, When the subject was 
Speaking the tape was Silent, and the other tape 
recorders recorded what each subject said about 
himself on each round, Five versions of the tape 
were used, and these constituted the primary 
experimental variable. The five versions were 
identical up to and including Round 5. On Round 5 
the attacker who had up to this time presented 
himself as a self-assured, rather egotistical person, 


? This difference in the Com Med condition was 
necessary to make the situation plausible since the 
subject would be comparing himself to the attacker 
in Round 6. ? 


attacked the subject, calling him, among other 
things, an “egotistical deceitful liar,” and alleging that 
he was full of “bull —.” This attack lasted for 
1 minute as did all the other communication 
periods for each person within a single round. The 
attack always came at the end of Round 5. 

The attack could not, of course, refer specifically 
to anything a subject had said since it was to be 
used for all subjects. Extensive experimentation 
with different attacks resulted in a highly credible 
tape. The fact that subjects in Round 4 were asked 
about negative characteristics of themselves as- 
sured that almost every subject had said both good 
and bad things. This laid the groundwork for the 
attack which started by saying that the subject 
was inconsistent and therefore probably mixed-up, 
unlikable etc. The effectiveness of this attack on 
inconsistency rather than Specifics was apparently 
strengthened by the fact that most subjects felt 
that they rambled around and were inconsistent. 
This fact gave the cue to the “vicar” who could 
defend the subject by saying that anybody in this 
situation would have such trouble. 

Round 6 in the present experiment was different 
from that used in previous experiments in that 
only one person spoke for 1.5 minutes rather than 
the 1-minute period of the earlier rounds. The 
primary independent variable was manipulated in 
Round 6 by varying the directions as to who was 
to speak and the content if the vicar was to 
speak. Table 1 gives the procedure in Round 6 
as well as clarifying the five experimental conditions 
as presented above. The person speaking and the 
topic varied in Round 6 according to condition. 

A Before-After measure was inserted into the 
Procedure which had the advantage of using the 
same After measure (the Structured Personality 
Sketch) as used in the earlier experiments. The 
Before measurement could not be identical with 
the After measure because it took too much time to 
complete. A seven-point check list of 10 items was 
devised as a Before measure. A rough content 
analysis was made 
Sketches of the earlier experiments and adjectives 
were chosen on this basis, The form of the check 
list was that of adjectival polar opposites with 


of the Structured Personality : 
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bo. spaces provided for choices of “very,” “moderately,” 


or “slightly” modifying the adjectives. ‘Actually, it 
can be questioned whether the items are polar op- 
posites, but they are clearly positive and negative. 
For half of the items the positive adjective was on 
the right. For the others, this was reversed in a 
random sequence in order to avoid response set. 
The items were: courageous-spineless, sincere- 
insincere, overbearing-cooperative, popular-unpopu- 
lar, antisocial-sociable, friendly-unfriendly, negative- 
positive, conceited-humble, pleasant-unpleasant, ob- 
noxious-attractive. 

The same check list was administered three times 
during the experimental procedure, after Round 3, 
after Round 5 (immediately after the attack), and 
at the end of the procedure aíter the subject had 
Written the Structured Personality Sketch. 'The 
subject always filled out the check list for both the 
attacker and the vicar. 

The purpose of administering the check list after 
Round 3 was twofold. Primarily it was an attempt 
to acquaint the subject with the procedure so that 
he would complete it quickly when it was presented 
after the attack, This was successful since the 
average time in filling out the check list after the 
attack was 2 minutes, In addition, this gave a 
preattack measure. 

The introduction of the check lists and the other 


Procedural differences increased the power of 
measurement over the earlier experiments con- 
siderably, 


1, The Round 3 check list provides a baseline 
measure for all subjects. 

2. The change from Round 3 to Round 5 provides 
a measure of instigation. 

3. The final check list after the Structured 
Personality Sketch provides a validation of the 
Check list and can be compared with the other 
measure. 

4. A change score can be computed from the 
Round 5 check list to the final check list and can 

_ be related to amount of hostility shown in the 
Structured Personality Sketch, thus testing the 
catharsis hypothesis. 

5. Primarily, however, a regression analysis of 
the Round 5 check list and the Structured Person- 
ality Sketch yields an actual change score by con- 
ditions which is necessary for precise evaluation of 
the hypotheses, 

Immediately after Round 6, the experimenter 
Visited each subject individually, giving him the 
form for the Structured Personality Sketch as de- 
scribed by Rosenbaum and deCharms (1960) and 
the check lists on which he was to write a first 
draft of a personality sketch of both the attacker 
and the vicar, Envelopes were supplied and the 
Subjects were told that the completed forms would 
be delivered directly to the participant described 
for his comments, after which the subject would 
l out a final draft for the experimenter. 

aen the subject completed the personality 
ratings, the experimenter visited each subject in- 
 Gividually telling him that he had finished first and 
_ Would have to wait to receive the ratings from 
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the others. A structured interview was then carried 
out and recorded, starting with an inquiry about 
the electrical equipment and leading up to the 
subject’s feelings about the situation, both other 
participants, and any indications of suspicion. 
Throughout the procedure an attempt was made 
to conceal from the experimenter which experimental 
condition was being used. After the experimental 
Session special care was taken to explain the pro- 
cedure to the subjects and to obtain cooperation 
in not discussing the experiment. 


Analysis of the Data 


The primary dependent variable in the earlier 
studies was the content analysis of the Structured 
Personality Sketch which formed residual hostility 
scores. In the present study such scores were ob- 
tained in an exactly comparable fashion. The 
Structured Personality Sketches were broken into 
thought units and each unit was judged either 
positive, neutral, or negative with respect to the per- 
son being described. A check was made to be sure 
that the mean number of units in the Structured 
Personality Sketch was comparable between the 
experimental cells. Analysis of variance showed no 
significant differences. 

The taped discussions in Rounds 5 and 6 were 
also content analyzed in this manner, Here, how- 
ever, it was necessary to score with regard to 
either the attacker or the self and to provide an 
irrelevant category for units not relevant to the 
person under consideration. 

Scoring was all done without knowledge of the 
experimental condition and a standard reliability 
check resulted in 95.7% agreement between scores. 

Change scores. The Before-After change score in 
the present experiment was computed between the 
Round 3 and Round 5 check lists (instigation), 
the Round 5 check list and the Structured Per- 
sonality Sketch (change in hostility), and the 
Round 5 and Round 6 (final) check list (change 
in hostility). The change score is computed from 
a regression coefficient, obtaining for each subject 
a predicted score which is then subtracted from 
his actual score giving a residualized change score 
which is uncorrelated with initial score. The tech- 
nique has advantages over simple difference scores 
(DuBois, 1957) and, of course, simple change scores 
could not be computed between the check list and 
the Structured Personality Sketch. 

The regression coefficient, from which change 
Scores were computed, was derived from the No 
Com group exclusively. The mean change score in 
this group is therefore arbitrarily set at zero and 
any differences found’ in other groups are re- 
flected in a difference from zero. Thus the No 
Com group is truly used as a control and all 
evidence of change must be attributable to differ- 
ences between the control situation and the other 
conditions. However, one cannot say whether 
factors (such as time) which are constant in all 
groups led to a reduction or increase in hostility. 

It ds important here to note that there is no 
systematic relationship between the change score, 
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which is the dependent measure, and the number 
of units the subject wrote in the Structured Per- 
sonality Sketch (r=—.17, p>.05). This means 
that the length of the person’s Structured Personality 
Sketch does not affect his change score as measured 
by the Structured Personality Sketch and Round 
5 check list. There were no significant differences 
in mean number of units in the Structured Per- 
sonality Sketch by experimental condition. 


RzsULTS 
Evaluation of Experimental Procedures 


In order to test the hypotheses dealing with 
catharsis, (@) initial instigation should be 
demonstrable, (5) greater hostility should be 
demonstrable in the communication conditions 
than in the control, (c) other conditions 
should be the same so that differential frustra- 
tion is ruled out, (d) initial level of negative 
statements under no instigation should be 
controlled. These criteria are discussed by 
Berkowitz (1958). In addition, it is main- 
tained that (e) a relationship must be demon- 
strable within experimental conditions between 
the amount of aggressive behavior in the 
interpolated communication period and the 
change in hostility level over that period. 

In the present experiment initial instigation 
is clearly demonstrable. Check list means for 
all cells after the attack show more hostility 
than before the attack. The change scores of 
pre- and postattack check list (Rounds 3 and 
5) are highly significantly different from zero 
(t = 15.1, p < .0005). Analysis of variance 
shows that mean change (instigation) is 
comparable between cells. 

In the two communication conditions 
greatest hostility is shown by subjects in 
responding to the attack (the interpolated 
communication) under Com Med (17 of 21 
subjects made hostile statements). Under Com 
Max 1 subject of 21 made one hostile state- 
ment. In the control (No Com) no statements 
could be made. 

Criteria c, no differential frustration, and 
d, initial instigation, are controlled by the 
experimental design and by using change 
scores, respectively. 

The Before-After measure allows assess- 
ment of relationship between the amount of 
interpolated hostility and the change score 
taken from measures before and after this 
interpolated period. 
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Data from Dependent Variables Relevant to 
the Main Hypotheses 


Several measures deriving from the Struc- 
tured Personality Sketch, the Round 5 and 
the final check lists were examined in con- 
nection with the main hypotheses. Inspection 
of the data shows that except for minor dif- 
ferences the results show the same pattern no 
matter which measure is used. The mean 
change scores based on the Structured Per- 
sonality Sketch and the Round 5 check list 
were those designed to comprise the dependent 
measure. Analysis of variance showed that the 
independent variables accounted for the most 
variance in these scores. Table 2 presents 
the mean change scores and Table 3 presents 
the completed analysis of variance of these 
Scores. 

Catharsis hypothesis. The major com- 
parison in Table 2 relevant to the catharsis 
hypothesis is that between the No Com and 
Com Med conditions. The catharsis hypothe- 
sis predicted a reduction in hostility in the 
Com Med condition. The results show a sub- 
stantial increase in hostility which, however, 
is not statistically significant. These results 
tend to disconfirm the catharsis hypothesis, 
especially since it has been shown that a 
significant amount of hostility was manifested 
by subjects in Round 6 of the Com Med 
condition, 

The more stringent statement of the ca- 
tharsis hypothesis calls for a negative relation- 
ship between the amount of hostility shown 
in Round 6 in the Com Med condition and 
the reduction in hostility as measured by 
regression of the Structured Personality 
Sketch score on the Round 5 check list. Since 
the distribution of hostile statements made 
in Round 6 was skewed, subjects were split at 
the median into those showing high amounts 
and those showing low. Ties accounted for the 
unequal split. Subjects were further classified 
as to whether the Structured Personality 
Sketch measure showed a relative decrease or 
increase in hostility over the Round 5 check 
list. The table of frequencies and results of a 
chi square test of association are presented in 
Table 4. The data strongly refute the catharsis 
hypothesis. Far from draining off their hos- 
tility in Round 6, subjects seem to build up 
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TABLE 2 


Mean CHANGE IN Hostitity SCORE: 
REGRESSION OF % NEGATIVE Units SCORE FOR THE STRUCTURED PERSONALITY 
SKETCH ON Rounp 5 Cueck List 


Direct Communication Vicarious Communication 
No Communi- 
cation: 2 2 Total 
FN ferent Positive Negative 
High self-esteem 
N 9 9 8 10 16 52 
M 00.0 —13.3 +12.5 +4.7 +14.2 +4.9 
S 546.0 540.8 319.0 249.9 248.8 
Low self-esteem 
N 12 12 13 11 5 53 
M 00.0 00.0 +3.3 +9.8 +23.8 +51 
St 4514 536.7 255.1 346.2 618.5 
"Total 
N 21 21 21 21 21 105 
M 00.0 —54 +6.8 +74 +16.5 +5.0 


Note,—A positive number indicates an increase in hostile responses relative to the No Communication condition. 


Analysis of variance is shown in Table 3. 


their hostility in proportion to the amount 
they show in the interpolated period of 
Round 6. 

The expectations derived from the catharsis 
hypothesis posited that reduction of hostility 
would be greater in the Com Med condition 
than in the Com Max condition. This predic- 
tion was based on the expectation that more 
hostility would be shown under Medium 
Restraint than under Maximum Restraint. It 


TABLE 3 


ANALYSIS OF VARIANCE OF MEAN CHANGE IN HOSTILITY 
Scores SHOWN IN TABLE 2 


Source df | MS R 
Individual comparisons 
No Communication versus 
Communication 
Maximum Restraint 1| 44.2 | 1.09 
Medium Restraint 1| 62.4 | 1.54 
Positive Vicarious 1| 52.6 | 1.30 
Negative Vicarious 1| 361,0 | 8.91** 
ommunication 
Maximum versus Medium 
Restraint 1| 200.1 | 4.94* 
Overall table 
Level of self-esteem 1| 353 | 0.87 
'ommunication conditions 4| 1845 | 4.55*** 
teraction 4| 38.6 | 0.95 
Residual 95| 40.5 
*p «.05. 
M5 <.005, 
i^) S001, 


was shown earlier that the manipulation of 
restraint was highly successful in eliciting 
many hostile statements in the Com Med 
condition (17 of 21 subjects) and practically 
none in the Com Max condition (1 of 21 
subjects). Comparing these conditions for the 
amount of change in hostility results in an- 
other direct contradiction of the catharsis hy- 
pothesis, The only condition in the experiment 
which showed a tendency toward reduction of 
hostility was the Com Max condition, and 
this was significantly different from the Com 
Med which showed an increase in hostility 
(p < 05). 

It might be argued that the amount of 
time in Round 6 is too short a time in which 
to expect catharsis, although no time limita- 
tions have been set on the hypothesis. It is 


TABLE 4 


Frequency oF Supjects SHowinc HicH or Low 
Hostiity IN ROUND 6 AND DIRECTION OF CHANGE 
IN HosrILITY 


Volume of hostile statements 


in Roun 
Hostility 
High Low 
Reduced 2 7 
Increased 10 2 


Note.—x,* = 5.446, p < .025. 
€ 
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clear that the data strongly call the hypothe- 
sis into question for the short time,span, but 
it is possible to derive another test from the 
data which involves a longer time span. If 
the Structured Personality Sketch itself is 
considered an interpolated activity in which 
varying amounts of hostility are expressed, 
the catharsis hypothesis can be tested by 
relating the measure of expressed hostility in 
the Structured Personality Sketch to the 
change score derived from the Round 5 check 
list and the final check list which was filled 
out immediately following the Structured Per- 
sonality Sketch. The final check list was filled 
out on the average about 10 minutes after 
the attack since the Structured Personality 
Sketch takes about that long to complete. 

The period between the Round 5 check list 
and the final check list includes the 1.5 
minutes of Round 6 devoted to the various 
experimental conditions. For this reason cor- 
relations between the amount of hostility 
shown on the Structured Personality Sketch 
and the change score derived from the check 
lists were computed separately for each cell. 
Within the No Com, +Vi Com, and —Vi 
Com cells, subjects have had no opportunity 
to express hostility in Round 6. In the two 
Com conditions, the correlations may be af- 
fected by the differentially hostile statements 
in Round 6. 

Table 5 presents the correlations, It should 
be clear that the catharsis hypothesis predicts 
positive correlations. This is because the 
change score for this analysis was computed 


TABLE 5 


CORRELATIONS BETWEEN AMOUNT oF HosriuTY iN 
THE STRUCTURED PERSONALITY SKETCH AND THE 
CHANGE SCORE DERIVED FROM THE PRE- AND 

PosTPERSONALITY SKETCH CHECK Lists 


Experimental condition r 
No Communication —.214* 
Maximum Restraint Communication E90 4 EPF. 
Medium Restraint Communication 9032009 
Positive Vicarious Communication -,587*5e* 
Negative Vicarious Communication —415** 


Note.—Since the change score was computed as a lar, 
score pos a large ase and the interpolated hostility 
score is high for a large amount of hostility, the catharsis hy- 
pothesis predicta positive correlations. 

> .10. 


RICHARD DE CHARMS AND Epwarp J. WILKINS 


so that a high score indicates a large reduc- 
tion in hostility which should be positively 
correlated (according to the hypothesis) with 
a large amount of expressed hostility in the 
interpolated activity. Obviously, these data 
again are significantly in the wrong direction 
predicted by the catharsis hypothesis. All 
correlations are negative and all are significant 
except the No Com correlation. 

The data in Table 5 represent five inde- 
pendent tests which show that expression of 
hostility in an interpolated activity is associ- 
ated with an increase in hostility. The inevi- 
table conclusion from these data is that 
hostile statements increase rather than de- 
crease the probability that further hostile 
statements will occur under the conditions of 
the experiment. 

Self-enhancement hypothesis. The self- 
enhancement hypothesis postulated a relation- 
ship between reduction in hostility toward the 
attacker and the number of positive state- 
ments about self made during the interpolated 
communication period. The data show a 
change toward increased hostility for subjects 
above the median in percentage of positive 
self statements (+4.5) and a change toward 
decreased hostility for subjects below the 
median (—3.3). These results are the opposite 
of those predicted by the hypothesis. They 
are not significant, however. 

Hypotheses concerning vicarious phenomena, 
The general hypothesis concerning vicarious 
phenomena predicts that when the subject 
observes (hears) the vicar perform responses 
his behavior will be be affected as if he him- 
self had performed them. It was originally 
postulated (the catharsis hypothesis) that a 
reduction in hostility would occur in the Com 
Med condition. It has been demonstrated, 
above, however, that when the subject shows 
hostile responses during the communication 
period (Com Med) he has a tendency to 
become increasingly hostile toward the at- 
tacker. To be consistent, the vicarious phe- 
nomena hypothesis now must be modified to 
predict that a similar tendency toward in- 
creased hostility will occur when the vicar 
calls down the attacker in Round 6 (—Vi 
Com condition). Reference to Table 3 shows 
a large change in the direction predicted. This 
is the situation which elicited the greatest 


15^ increase in hostility, an increase which is sig- 
nificantly different from the No Com condi- 
tion at less than the .005 level. The hypothe- 
sis has given data the reverse of that predicted 
by the catharsis hypothesis but entirely 
consistent with data from the direct test of 
the catharsis hypothesis. 
Evaluations of the vicar. One might expect 
a radical shift toward liking the vicar in the 
vicarious communication conditions as com- 
pared with the other conditions. In the vicari- 
ous conditions the vicar stood up for the sub- 
ject. This effect of greater liking was not 
significant in the previous experiments. In the 
present experiment it was possible to compare 
a change score from Round 5 to Round 6 
(here differences were used for simplicity). 
In the No Com, Com Max, and Com Med 
conditions, the differences are negligible, but 
in the two vicarious conditions the differences 
- Show greater liking for the vicar. Analysis of 
variance demonstrates that this is a significant 
effect (p < .01). This is a trend for the effect 
to be stronger with high self-esteem subjects, 
- but the interaction term of the analysis of 
variance does not approach significance. 
Data relevant to level of self-esteem. In this 
experiment the self-esteem variable proved to 
have little effect on the results, Neither self- 
esteem alone nor in interaction with the ex- 
perimental variables was significant in the 
_ analysis of variance, Examination of Table 3 
shows that the most important trend is found 
in the Com Max condition where the high 
Self-esteem subjects tend toward reduction of 
hostility, whereas low self-esteem subjects 
- Show little change. This reduction cannot be 
attributed to catharsis, since no hostility was 
Produced in the Com Max condition. This 
Tésult is similar to that found for low self- 
- fSteem subjects under the same experimental 
Condition by Rosenbaum and deCharms 
(1960), 
Not all comments in the Round 6 com- 
| Mnunication were hostile in nature. An impres- 
- Slonistic look at these responses to the attack 
 Teveals that they could be classified under 
*OSenzweig's (1935) categories of intrapuni- 
tive and impunitive. These were most charac- 
. teristic of subjects with low self-esteem, while 
_ £xtrapunitive responses came generally from 
high self-esteem subjects. This results in 
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higher hostility means for high self-esteem 
subjects in Round 6. The high self-esteem 
subjects also show a slight tendency to de- 
crease in hostility if they build themselves up 
in Round 6. On the other hand, low self- 
esteem subjects show a hesitancy to attack A. 
However, the most interesting trend in this 
regard is that those low self-esteem subjects 
who run themselves down on Round 6 tend 
to decrease in hostility toward the attacker. 
Conversely, those low self-esteem subjects 
who build themselves up in the Round 6 
communication tend to increase in hostility 
(the opposite trend of that found with high 
self-esteem subjects). This latter trend may 
indicate that when low self-esteem subjects 
build themselves up they find their own ele- 
vating comments difficult to believe and thus 
this response to the attack is ineffectual in 
meeting its threat and in reducing hostility. 


Discussion 


The main results of this experiment clearly 
demonstrate that hostile verbal Tesponses in- 
crease rather than decrease the probability of 
further hostile verbal responses. It is impor- 
tant to note specific conditions in the experi- 
ment which may have contributed to these 
results. 

1. Hostile responses in this experiment are 
in reaction to a hostile verbal attack. This can 
be considered a blocking of the ongoing ac- 
tivity of presenting oneself in a favorable 
light, but the strictly personal nature of the 
attack probably carries more sting than 
frustration of a task activity. 

2. The measures of hostility in each case 
derive from material specifically directed to 
the attacker by the subject, rather than from 
material where indirect, displaced, or fantasy 
hostility is measured. 

This type of hostility could be perceived 
by the subject as instrumental behavior 
intended to change the attacker's behavior 
(change the threatening stimulus) and reduce 
the threat of further attack. The subject 
receives no information that his instrumental 
behavior has succeeded. It appears that under 
these conditions he increases the intensity 
of the hostility as would be predicted by 
Amsel (1958) when an instrumental act is 
unsuccessful. 
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One would predict that if the hostile be- 
havior were successful in changing the threat- 
ening stimulus such behavior would decrease 
in the specific situation since the stimulus 
conditions have changed, but when another 
similar threatening stimulus presents itself the 
probability of hostile behavior should be 
increased because of the past success or re- 
inforcement. A cognitive interpretation would 
also stress the importance of the subject’s 
interpretation of the attacker’s behavior in 
influencing the subject’s feelings of hostility. 
Either analysis would emphasize the impor- 
tance of careful scrutiny of external stimulus 
events which are related to changes in hostile 
behavior. Worchel (1961) has shown that 
status restoration which is bestowed by the 
original attacker leads to a decrease in hos- 
tility. The present experiment failed to find 
a similar effect as a result of self-enhancing 
statements made either by the subject himself 
or by the vicar. Thus it appears that status 
restoration by means of an actual change in 
the attacking stimulus person successfully 
changes hostile behavior toward him. Status 
restoration which leaves the attacking stim- 
ulus unchanged has no apparent effect on the 
hostility toward the attacker. 

The experiment gives unequivocal evidence 
of changes (increase) in hostility which are 
a function of the number of interpolated 
hostile statements. It is unclear why this 
experiment should result in increased hostility 
for all subjects while the Rosenbaum and 
deCharms (1960) study showed decreased 
hostility for low self-esteem subjects only. Of 
the several differences which might have 
led to these discrepant findings, the most 
promising seems to be the status differential 
which is a function of the relatively low status 
(freshman) of subjects in this experiment 
vis à vis the attacker who was presented 
as an upperclassman. The subjects in the 
Rosenbaum and deCharms experiment were 
primarily upperclassmen and hence of equal 
status with the attacker. 

Whatever the variable is which led to this 
difference between the experiments, it is clear 
that its effects are consistent in affecting 
either direct self-communication or the vicari- 
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ous situation. The general prediction for 
vicarious phenomena, namely, that it will 
affect the subject’s responses as if he had 
acted rather than the vicar, is nicely con- 
firmed. Rosenbaum and deCharms (1960) 
show a decrease due to direct communication 
for subjects with low self-esteem and the same 
phenomena for the vicarious situations. The 
present experiment shows an increase for both 
direct and vicarious communications for all 
subjects regardless of self-esteem. 

The term vicarious was originally used to 
connote an inferior substitute for self-action. 
The data, however, indicate that the *vicar's" 
attack has a stronger effect on the subject’s 
subsequent behavior than his own hostile re- 
sponses. This result can best be explained as 
a result of group support. It again emphasizes 
the importance of variables external to the 
subject, once hostility has been aroused, and 
makes it clear that changes in hostile behavior 
may be primarily a function of such variables 
rather than of self-initiated possibly unsuc- 
cessful aggression which is supposed to drain 
off tension. 
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SCALE JUDGMENTS OF TRIPLETS OF 
OPINION STATEMENTS * 


WALTER WEISS 
Boston University 


115 students at Boston University 


evaluated the attitude positions of 


anonymous sources when different sets of 3 opinion statements were attributed 
to them. The triplets varied in the extent to which their constituent state- 
ments reflected strongly favorable to strongly unfavorable sentiments in 
regard to capital punishment. The results indicate that the scale judgment of 
a triplet and confidence concerning judgment are affected by the mean of 
the scale values of the component statements and by the dispersion of the 
scale values around the mean. The extreme triplets are judged even more pro 
or con than any of their constituents. The judges’ own positions on the issue 
did not affect their evaluations of the triplets, 


Since the appearance of the monograph by 
Thurstone and Chave (1929), many attitude 
researchers have collected people’s judgments 
of the scale positions of opinion statements. 
The primary purposes of most of the studies 
were to develop attitude scales of one kind or 
another or to test assumptions needed to 
permit the use of scales as attitude measures. 
Recently, however, attention has turned to the 
examination of such judgments as exemplifica- 
tions or tests of principles of judgment or 
instances of general phenomena of judgment 
(Fehrer, 1952; Hovland & Sherif, 1952; 
Segall, 1959; Sherif & Hovland, 1953). The 
utility of such principles for an understand- 
ing of attitudinal phenomena and communica- 
tion effects has also been remarked on by a 
number of others (Helson, 1948; Krech & 
Crutchfield, 1948; Volkmann, 1951). Al- 
though significant information can be obtained 
through judgments of single opinion state- 
Ments, the more ordinary circumstances of 
attitude expression and evaluation involve the 
Statement of a variety of opinions on an issue 
by the same person. Hence, to apply such 
Judgmental phenomena to the more usual 


- Communication setting, systematic research is 


heeded of people’s evaluations of another’s 
attitude when a set of opinion statements is 
attributed to him. A few investigators have 
Presented brief or elaborate communications 
for judgment (Hovland, Harvey, & Sherif, 

l'The research reported in this article was sup- 
Ported by Grant M-1351 from the National Institute 


of Mental Health, United States Public Health 
Service, 


1957; Manis, 1960; Weiss, 1958). But the 
difficulty of providing a wide range of varia- 
tions with systematic control over the factors 
whose influence is to be examined limits the 
utility of such an approach at present. As an 
intermediate step between judgments of single 
statements and of full communications, the 
method used in this study was adopted. People 
were asked to evaluate the attitude positions 
of anonymous sources who purportedly ex- 
pressed three opinion statements on an issue. 
Admittedly, this is not the same as experi- 
encing and judging the customary kind 
of persuasive communication. Nevertheless, 
through systematic variation of the triplets, 
we can examine the relationship between the 
single scale judgment of the triplet and such 
factors as: the mean or median of the scale 
values of the component statements, the 
dispersion of the scale values around the 
average, and the judges’ own attitudes on the 
issue. 

The current study using opinion statements 
as judgment stimuli can also be viewed as an 
instance of experimental research on judg- 
ments of groups of objects. In this sense, it is 
related to investigations by Gulliksen (1956) 
on food preferences; by Corbin, Reese, Reese, 
and Volkmann (1956) on subjective statistics; 
and by Willis (1960) on ratings of people 
as dates or mates. Willis’ experiment, which 
appeared in print after the current one was 
completed, is the closest in approach. Com- 
parisons between the results of the two will 
be made in the Discussion section. 
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METHOD 
Subjects 


Students taking various courses at Boston Univer- 
sity during the Spring semester, 1960, constituted the 
sample. The N was 115. 


Procedure 


Questionnaire booklets were distributed to the 
students during one of their regular class periods. 
On the first inside page, each subject rated his own 
position on the issue of capital punishment on a 
1-9 graphic scale which ranged from strongly op- 
posed to strongly favorable. After this, he read the 
detailed rationale that stated our interest in studying 
people’s judgments of the attitudes of others, when 
the only information available is a few of their 
opinions on an issue, 


To do this, we asked a large number of people 
like yourselves to give us their opinions on the 
topic of capital punishment. They did this by 
Picking from a large number of statements the 
three that best expressed their thinking on the 
topic. Now we would like you to read a number 
of these sets of three opinions, and give us your 
judgments concerning how favorable or unfavor- 
able in general each person is toward capital 
punishment. 


To illustrate the procedure, a sample triplet of state- 
"ments was presented. The subject's judgment of the 
anonymous person's attitude was to be made on a 
1-11 scale, whose end categories were labeled “very 
strong approval" (1) and "very strong disapproval" 
(11), and whose middle category (6) was designated 
"neither approval nor disapproval.” After making the 
judgment, he was to indicate the degree of confidence 
he felt in his rating by marking a confidence scale 
that ranged from 0% (“none at all”) to 100% 
(“complete confidence”) in steps of 10%. This 
Sequence of a scale judgment and then a confidence 
rating followed all triplets. The subject was told 
that more than one set of statements could be 
‘assigned the same Position on the scale and also 
that not all of the 11 steps need be used, for he 
may feel that none of the sets is a reasonable 
example of a particular position on the attitude scale, 
He was also cautioned against permitting his own at- 
titude on the issue or his agreement-disagreement 
with the anonymous person to influence his scale 
judgments. Finally, he was told not to take too much 
time in reading or thinking about each set of opinions 
but rather to let the judgment that came naturally 
and readily to mind be the one he gave. Returning 
to pages that had already been turned was explicitly 
proscribed, and the experimenter's close surveillance 
of the students maintained this restriction in force, 
A total of 28 triples was judged. Of these, 25 
comprised a 5X 5 factorial arrangement based on 
the combinations of five single opinion statements 
with five pairs of statements, Each pair was con- 
joined with each of the single statements to form 
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five triplets, and hence a total of 25 triplets was 
obtained. Each of the following opinion positions 
was represented by one of the single statements and 
also by both members of each of the pairs: Strong 
Pro, Moderate Pro, Neutral, Moderate Con, Strong 
Con. These are the five varieties of each factor of 
single statements and pairs of statements. The desig- 
nations of the relative favorableness-unfavorableness 
of the statements’ reflect the scale values assigned 
by other subjects who in a previous experiment 
(Weiss, 1961) had judged the statements individually. 
The full range of scale values was represented by the 
statements used in the current study. All statements 
are from the Peterson-Thurstone scale of attitude 
toward capital punishment. 

In addition to these 25 triplets, there were three 
of the following kind: compared with a homogene- 
ous triplet,? each exhibited a wider range of scale 
values around an identical median. For example, one 
of these special triplets contained the same state- 
ment whose scale value was the median of the three 
constituting the homogeneous, Moderately Pro triplet, 
but instead of the other two statements had a Strong 
Pro and a Neutral one. Similarly, another triplet 
comprised a Neutral, a Moderately Con, and a Strong 
Con statement. And the third contained a Moderately 
Con, a Neutral and a Moderately Pro statement. 
In all three triplets, the scale value of one statement 
was. approximately as far above the median as the 
other was below. 

The first page of triplets in each booklet always 
contained two homogeneous ones that comprised the 
three extreme pro and the three extreme con state- 
ments. Each of the remaining 13 pages contained 
two triplets whose median scale values, based on the 
values of their constituents, would mot fall near each 
other on the attitude scale. These 13 pages were 
reordered from booklet to booklet following a table 
of random permutations of numbers. 
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The "raw? category assignments of the 25 
triplets constituting the basic 5 x 5 factorial 
arrangement were examined by the procedure 
of analysis of variance, The relevant data are 
in Tables 1 and 2, The major sources of 
variation are overwhelmingly significant. 
When pairs of consecutive columns are ex- 
amined by separate analyses of variance, the 
Same sources of variation are again found to 
yield highly significant Fs. Hence, the vari- 
able third members of the triplets influence 
judgments, producing scale shifts in accord 
with their changing evaluative significance. 
As might be expected, changing the pairs of 

? For ease of reference, the five triplets containing 
statements of relatively comparable degree of 


favorableness-unfavorableness will be called the 
“homogeneous triplets.” 


"statements has a greater effect than changing 
the single statements. Also, on the average, 
"changing a statement from Moderately Pro 
_ or Con to Neutral has a greater effect than a 
change from Strong to Moderate Pro or Con. 
- Although the evidence is not conclusive, this 
- result is not due to the moderate statements 
being closer to their respective extremes. 
Rather the particular import of a neutral 
assertion seems implicated. 

To show that the results are not due to the 
use of raw category assignments, the scaling 
" procedure of Attneave and Chamblis (1957) 
was used to transform each subject's judg- 

ments into values that reflect his scale of 
^ judgment. The same types of analyses of 
"variance were applied to these data as were 
“used in examining the untransformed judg- 
ments. The mean squares associated with 
Pairs of statements, single statements, and 
- interaction were again highly significant. 

Since transformed scale values for the indi- 
" vidual statements based on the Attneave and 
Chamblis procedure were available from the 
data of a previous study (Weiss, 1961), the 
mean of the scale values of the statements 
constituting each triplet could be compared 
P with the similarly scaled judgment of the 
triplet as a whole by the subjects in this 
Study. The product-moment correlation be- 
tween the two sets of scale values was .87 for 
all 28 triplets and .92 for the basic factorial 
Set of 25. Based on the means of the raw 
j Judgments (see Table 1), the correlation was 
L 96. These data suggest a linear relation be- 
| tween the overall integrative judgments of the 
triplets and the values of their constituents. 
| *0wever, the continued appearance of sig- 
- Bificant interactions between pairs and single 
Statements indicates that the effect of the 
Variable third statement is not a simple 
| additive one, Also, as the data in Table 1 
reveal, the mean judgment of each nonneutral 
homogeneous triplet is more extreme, i.e., 
More pro or con, than the mean of the mean 


1 2 Although the data of the two studies were 
Bathered about a year apart, the subjects in both 
Were Boston University students attending the same 
4 of courses, Also, the mean self-ratings of 
Attitude on the same type of 1-9 graphic scale were 
Quite comparable; by ; test, the p value of the 
Comparison exceeded .50, two-tailed. 
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TABLE 1 
MEAN UNTRANSFORMED JupcMENts 
OF TRIPLETS 
Single statements 
Pairs of 
statements 
Strong | Moderate} Moderate} Strong 
Bro | Pro | Neutral} Mederate| Stron 
Strong Pro 
M- 1.15 2.37 3.61 4.38 4.35 
ee 32) 3) $31) 410) (5:08) 
.87)* , (3.81 i £ 
Moderate Pro 4 
M 4.12 3.80 5.60 6.03 5.95 
SD 2.08 2.02 2.08 1.82 1.85 
(3.72) | (4.27) | (166) | (5:55) | (5/93) 
Neutral 
M 5.12 5.59 6.28 7.31 7.40 
SD 1.70 1.49 1.88 1.80 1:67 
(4.89) | (5.44) | (5.83) | (6.73) | (7.10) 
Moderate Con 
M 6.37 6.92 843 9.62  |10.11 
SD 1.41 1.66 1.94 1.77 1.56 
(6.59) | (7.14) | (7.53) | (842) | (8.80) 
Strong Con 
M 6.50 7.22 9.01 9.78 | 10.83 
SD 1.56 1.76 1.87 2.01 0.44 
(7.47) (8.02) (8.41) (9.31) | (9.68) 


* Mean of the mean judgments of the three constituents 
based on data from Weiss (1961). 


judgments of its constituents, Actually, the 
mean untransformed or transformed judg- 
ment of each of these triplets is more extreme 
than the comparable mean for any of its 
constituents. In addition, the interquartile 
range of judgments is also smaller than that 
for any of its component statements. Hence, 
while judgments do reflect the evaluative sig- 
nificance of the variable opinion statement, 
they also reveal the patterned effect of the 
triplet as a unit. 

The clear-cut effect of the varying third 
member indicates that the median of the scale 
values of the triplets is not concordant with 
the judged attitude position of the hypo- 
thetical survey respondent. The dispersion of 
statements around a constant median position 
does affect judgment. This can be tested more 


TABLE 2 


ANALYSIS OF VARIANCE OF MEAN 
SCALE JUDGMENTS 


Source df MS F 
Subjects 114 9.12 2.99* 
Pairs of statements (A) 4 | 2927.12 | 950.71* 
Single statements (B) 4 1068.36 | 350.28* 
AXB 16 2424 7.95* 
Residual 2736 3.05 

Total 2874 


*p «.001. 
¢ 
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TABLE 3 
DIFFERENCES BETWEEN JUDGMENTS oF Narrow 
RANGE, HOMOGENEOUS TRIPLETS, AND THEIR 
SPECIAL, WIDE RANGE COUNTERPARTS 
Pro Neutral Con 
Wide | Narrow] Wide | Narrow] Wide | Narrow 
M 435*| 3.80 | 6.79 | 6.28 | 7.57 | 9.62 
7 SDait. 241 243 1.62 
[22 <.01 «.05 <.001 


^ The higher the mean, the greater the opposition to or the 
less the approval of capital punishment. 
b Two-tailed, 


critically by comparing the three special 
triplets of wide range with their homogeneous 
counterparts among the factorial set of 25. 
The relevant data are in Table 3. As can be 
seen, an increase in range around the same 
median statement causes the Moderately Pro 
and Moderately Con triplets to seem less 
extreme; this occurs despite the inclusion in 
the more heterogeneous triplets of a state- 
ment that is more extremely pro or con than 
any contained in the homogeneous ones. It is 
not obvious why the homogeneous neutral 
triplet is adjudged less opposed to capital 
punishment than the comparable heterogene- 
ous one; the median raw judgment of the 
latter places it on the con side of the scale, 
while the median judgment of the homogene- 
ous triplet falls in the neutral interval, 

The means of the scale values of the three 
Statements constituting each of these com- 
parison triplets are 1.33 versus 1.54, 2.24 
Versus 2.26, and 3.33 versus 3.46, respectively, 
The differences are slight; and although the 
difference between the pro pairs favors a 
More extreme judgment for the homogeneous 
triplet, on the assumption of a positive rela- 
tion between the mean of the scale values for 
the individual statements and the judgment 
of the triplet as a whole, the difference be- 
tween the con triplets favors a more extreme 
judgment for the wide range one. Also, despite 
almost identical means for the neutral triplets, 
their scale judgments differ. Hence, range of 
scale values is clearly a factor influencing 
judgment, even with mean scale value rela- 
tively constant; additionally, the data suggest 
that for nonneutral triplets the range is in- 
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versely related to extremeness of scale judg- ' 


ment. Consistent with this evidence are the 
differences between the mean confidence 
ratings of the compared triplets. Each homo- 
geneous triplet has a significantly higher 
mean confidence rating than does the cor- 
responding wider range one; ¢ test $ values 
are <.001 for the pro and con pairs and 
<.02 for the neutral one. The average dif- 
ference is 12 points on the 0-100 confidence 
scale. 

Table 4 contains the mean confidence 
ratings that the subjects assigned to their 
judgments of the basic set of 25 triplets. 
Analysis of variance reyeals highly significant 
Fs for rows, columns, and interaction; the p 
values were all <.005. The uncertainty pro- 
duced in the subjects as the triplets became 
more and more heterogeneous is evident from 
an examination of the table. Except for the 
Moderately Con row, the highest confidence 
rating in each row is associated with the 
homogeneous triplet. And in general, rated 
confidence declines steadily from that point 
as homogeneity decreases. Considering the 
five homogeneous triplets alone, we observe a 
U shaped relationship between scale value and 
mean confidence rating. 

The distribution of the subjects’ self-ratings 
on the issue of capital punishment were tri- 
chotomized to form three groups: those indi- 
cating that they were at least moderately 
opposed (Intervals 1-3), those indicating that 
they were at least moderately favorable 
(Intervals 7-9), and those placing themselves 


TABLE 4 
MEAN CONFDENCE RATINGS 
Single statements 
Pairs of 
statements 
Strong| Moderate Moderate| Strong 
Pro | Pro | Neutral! = Con | Con 
Strong Pro 
M 87.78 | 81.20 | 70.09 | 54.14 | 55.00 
SD 21.70 | 20.39 | 27.63 | 30.87 |32.59 
Moderate Pro 
M 72.41 76.92 74.66 67.26 66.24 
SD 21.93 18.17 19.45 23.03 23.33 
Neutral 
ab 60.00 $940 | 7590 | 73.88 71.62 
z 23.83 20.3: 20.51 22. 
Moderate Con 2 
M 55.74 | 62.70 | 80.95 | 85.73 | 87.18 
SD 29.71 26.60 16.42 14.28 16.23 
Strong Con 
M 52.41 | 6457 | 81.22 | 86.67 | 89.49 
SD 30.44 | 26.50 | 14.82 | 14.18 | 17.90 


A 
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TABLE 5 


NUMBER OF OPINION STATEMENTS ALLOCATED TO 
CERTAIN SEGMENTS OF THE SCALE BY SUBJECTS 
DIFFERING IN INITIAL POSITION 


Initial position 
Scale segments 
1-3 4-6 7-9 
Favorable 
M 11.13 10.62 10.60 
SD 3.06 447 3.51 
Neutral 
M 3.72 4.19 3.96 
SD 2.40 4.78 2.84 
Unfavorable 
M 13.04 13.10 1344 
SD 3.74 3.24 2.82 
N 46 21 48 


at or next to the uncertain position (Intervals 
4-6).* The respective Ns were 46, 21, and 48. 
By separate analyses of variance applied to 
the untransformed judgments, neither the 
between-groups variance nor the Groups X 
Stimuli interactions were found to approach 
Significance. The means for the groups are 
6.8, 6.7, and 6.8. In regard to only a few 
Scattered triplets were reliable differences 
found between the judgments of any pair of 
groups. As a further test of the comparability 
of the three groups in their judgments of the 
triplets, the number of triplets allocated to 
the Favorable segment of the scale (1-5), the 
Neutral category (6), and the Unfavorable 
Segment (7-11) were examined. The data 
are in Table 5. None of the between-groups 
differences for any segment of the scale ap- 
Proaches significance. These results hold even 
When the three extreme pro (1-3) and the 


‘three extreme con (9-11) categories are ex- 


amined. The same lack of intergroup differ- 
ences in judgment is obtained when the 
transformed data are analyzed. All intergroup 
Product-moment correlations between the 
Median scale judgments of the 28 triplets were 
Sreater than .95; and those between the 
reciprocals of the interquartile ranges of 
judgment were at least .85 for any pairing of 
8toups. Also, none of the comparisons based 


tOn the rating scale, Intervals 1, 3, 5, 7, and 9 
Were labeled “strongly opposed,” “moderately op- 
Posed,” “in doubt,” “moderately favorable,” and 


“strongly favorable.” The remaining intervals were 
Unlabeled. 
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on the subjects’ ratings of felt confidence in 
their scale judgments revealed significant 
intergroup differences. Hence, based on a 
variety of assessments, initial position on the 
issue of capital punishment was not found to 
be related to judgments of the triplets. 


Discussion 


As Willis (1960) has observed and as is 
confirmed here, judging the general position 
of a set of social objects on a particular 
attribute dimension involves more than a 
simple conceptual averaging of the values 
of the constituents. Although the individual 
components do contribute separately to the 
overall judgment, the final categorization is 
also affected by the patterned quality of the 
set as a single complex unity. However, it 
should be kept in mind that in both studies 
the judges were under instructions to view 
each set of objects as a psychological unit. 
This enforced binding is perhaps a little more 
reasonable in the current research. The above 
general principle has of course been pressed 
for some time by a number of people (e.g., 
Asch, 1952; Krech & Crutchfield, 1948). 

Although the social objects and the judg- 
ment dimension used by Willis differed from 
those in this study, a number of the formal 
patterns of his data are replicated in outline 
by the results reported here. Hence, empirical 
principles seem implicated by the following 
common findings. The very high relationship 
between the mean of the component values 
and judgment of the triplet, since it was 
exhibited here over a wide range of means, 
extends his results which were based on a 
dichotomous classification of mean scale 
values. The significant effect of heterogeneity, 
which he defined as the mean deviation of 
scale values around the mean scale value, is 
substantiated in particular by the compari- 
sons between the special wide and narrow 
range triplets. Also, his finding that sets of 
low mean scale value are judged below their 
means and sets of high mean scale value above 
their means is exhibited even more sharply 
by the data showing that the nonneutral 
homogeneous triplets are judged more pro or 
con than any of their constituents, The lack 
of effect of skewness of distribution of indi- 
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vidual values is neither confirmed nor dis- 
confirmed by this study; for, although 
changing the third member of a triplet while 
holding constant the other two introduces 
increasing skewness, it also varies the mean 
coordinately. When the few triplets of very 
comparable mean values but of widely differ- 
ing skewness are examined, their scale values 
are found to be relatively close to each other. 
A more systematic study of this variable is 
obviously needed. Interestingly, in his study, 
sets composed of three objects revealed a 
major effect more than did sets of two ob- 
jects; in this experiment, only triplets were 
used, and the effects were clearly exhibited. 

A few studies (Cartwright, 1941; Hovland 
& Sherif, 1952; Sherif & Hovland, 1953; 
Weiss, 1959, 1961) have demonstrated that 
judges’ own attitudes can affect their cate- 
gorizations of single opinion statements on 
a dimension of favorableness-unfavorableness 
toward a particular point of view. In fact, 
Weiss (1961) confirmed the relationship, 
using the items that in this study composed 
the triplets for judgment. Yet, the current 
data did not manifest any association between 
attitude and scale judgments. If we take this 
finding to be reliable, then the availability 
of three opinion statements stands as the 
likely procedural difference producing the 
discrepant results, Although post hoc, the 
following conjectures are offered as the outline 
of an interpretation of the inconsistency. The 
judgment of any single statement is a re- 
sultant of intertwined and variegated influ- 
ences. Some of them derive from the wording 
of the opinion statement. For instance, certain 
words and phases are commonly used to 
express direction and degree of an evaluation 
and are so understood by members of the 
same linguistic community; or the evaluative 
import of certain words receives uniform 
interpretation. Additional influences can arise 
from assumptions about other statements that 
would be made by a person who utters the 
particular opinion being evaluated or from 
silent hypotheses concerning the kind of indi- 
vidual who would express such a sentiment. 
Also, the imaginative comparison between the 
statement being. evaluated and statements 
previously judged or held in mind as stand- 
ards may contribute to the judgment. Other 
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influences derive from the judge’s own at- 
titude on the issue. His general position may 
act as a standard of judgment, or the scale of 
judgment he applies may be affected by his 
views on the issue, or the attributes taken 
into account and integrated to yield a judg- 
ment on the general evaluative dimension of 
favorableness-unfavorableness may be differ- 
ently influenced. Also, a feeling of agreement- 
disagreement with the opinion statement, 
which is likely to be aroused despite all in- 
structional emphasis to inhibit such a reaction, 
may affect the scale position assigned. 
Obviously, other specific determinants of re- 
sponse can be proposed. But for our purposes, 
the set can be divided into two general cate- 
gories: those deriving from or related to the 
judge’s attitude on the issue and those de- 
riving from nonattitudinal factors. Clearly 
then, any characteristic of the judging situa- 
tion that related to or engaged one type more 
than another would increase the influence of 
such factors on the final decision. However, 
the general or “natural” salience of attitudinal 
factors must also be taken into account; for 
their relative dominance is probably directly 
related to the involvement of the judge in 
the issue and in his position on the issue. 
An extreme affect laden attitude would have 
its influence excluded or minimized only if the 
nonattitudinal determinants were sufficiently 
engaged and demanded a particular judgment. 

Tf a single statement contains unequivocal 
linguistic cues to position, then the influence 
of attitudinal factors will be reduced and 
the idiosyncratic influence of nonattitudinal 
factors will be restricted, thereby raising 
the dominance of certain relatively uniform 
factors. The result is a narrowing of the range 
of positions that can be assigned the state- 
ment. Such a statement is likely to express 
an opinion of extreme favorableness or un- 
favorableness. When the language is less uni- 
vocal in evaluative implication, factors related 
to attitude will increase in influence. Other 
kinds of determinants will still be involved in 
the final decision. But the range of possible 
assignments will be sufficiently extensive that 
attitudinal influences can now produce dis- 
criminably different scale values. When three 
statements are made available, as in this 
study, the same kinds of nonattitudinal fac- 
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tors may be evoked by each. Such multiple 
engagement would strengthen their influence 
"on the final decision and reduce the possible 
effect of attitudinal factors. However, depend- 
ing on the statements composing the set to 
be judged, personal idiosyncratic influences 
may still be involved and lead to individual 
differences in scale assignments even though 
attitudinal influences are not significantly im- 
plicated. When the statements composing the 
triplet are reasonably homogeneous in import, 
the possible region of assignment is con- 
stricted or narrowed. Hence, individual dif- 
ferences in judgments will be diminished, as 
evidenced by the smaller interquartile ranges 
of judgments associated with the nonneutral 
homogeneous triplets than with any of their 
constituent statements. 

Hovland et al. (1957) found a relationship 
between judges’ attitudes and their evalua- 
tions of a communicator’s general position as 
expressed in a recorded talk. Since several 
opinions were probably given by the com- 
municator, the effect of attitude should have 
been minimized according to the preceding 
discussion. However, several considerations 
Suggest that these data are not contrary to 
the implications of the theoretical analysis. 
As previously noted, the strength and saliency 
of attitudinally related influences vary with 
the degree of the judge’s involvement in the 
issue and in his position; in their study, the 
issue had been widely debated in the media 
shortly before the experiment was conducted 
and had been the subject of a recent refer- 
endum. Adherents of an extreme “dry” posi- 
tion were drawn from members of groups 
unalterably opposed to the use of alcohol on 
religious or moral grounds. Proponents of an 
extreme “wet” position were however ob- 
tained with difficulty and constitute only a 
small proportion of the total subject group. 
Their data indicate that the effect of attitude 
9n the scale judgment of the position of the 
communicator was exhibited most clearly by 
the extreme “dry” judges. Also, their com- 
. Munication expressed a moderate position on 
the issue and probably contained the usual 
elaborative, filler, and connective comments 


| Which round out a recorded communication. 


- Hence, we would not expect uniform agree- 
. ment on what words represented the opinions 
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(there was no question about this in the 
current study), nor on what opinions were 
being stated;:nor would we expect that con- 
verging nonattitudinal influences would be 
overly predominant or clearly relate to a par- 
ticular evaluation. Also, the judges may not 
have been forewarned that they would be 
assessing the communicator’s position and 
hence not alerted to provide an objective or 
realistic appraisal of his viewpoint while they 
were listening to the talk. This probably per- 
mitted attitudinal factors to have relatively 
greater influence than they might otherwise 
have. Most of the preceding comments are 
equally relevant to the experiment by Mani? 
(1960), who in addition had his judges evalu 
ate the communicators?’ positions through the 
semantic differential. 

When each item of an attitude question- 
naire has been assigned a scale value based 
on a judgment method, a rule has to be 
adopted to determine the single attitude score 
a person should receive if he has agreed with 
more than one item. Different people have 
recommended the mean or the median of the 
scale values, often neglecting to state a ra- 
tionale for their preferences. For instance, 
Thurstone and Chave (1929) suggest the 
mean; but attitude scales developed by 
Thurstone’s procedure often recommend the 
median. When discussing the method of equal- 
appearing intervals, Edwards (1957) says 
that either the mean or median may be used 
to assign a single attitude score to the re- 
spondent; but when considering the method 
of successive intervals, he suggests the median. 
Loevinger (1948) proposes the median as the 
“appropriate” score for homogeneous differ- 
ential scales; so does Green (1954) who 
justifies this preference on the basis that “the 
median is less sensitive to the particular set 
of scale values in the scale.” When the items 
agreed with have scale values that form an 
unbroken segment of the attitude continuum 
or if only two items are agreed with or if the 
items agreed with have scale values- that 
approximate an arithmetic series, then there 
is little to choose between the mean and the 
median on the basis of metric considerations. 
However, “gaps” in the scale values of the 
items accepted and a skewed distribution of 
scale values are quite often observed together. 
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And when scales of undetermined dimension- 
ality are used, the interpretation to be given 
this pattern is not at all clear or unequivocal. 
Then the decision to use the mean or median 
scale value as the assigned attitude score be- 
comes essentially an arbitrary one. Patently, 
a rule for assigning an attitude score need not 
relate to people’s judgments of the attitude 
positions of others, even though such evalua- 
tions are based on information about several 
opinions. But neither should such empirical 
evidence be ignored. Furthermore, the basic 
values assigned the individual statements and 
entering into the attitude score, however de- 
termined, were derived from just such judg- 
ments of single statements. Hence, the clear- 
cut relation between judgment and the mean, 
rather than the median, of the scale values 
of the statements composing the triplets sug- 
gests that the former be used as the summary 
attitude score. In the previous study by 
Weiss (1961) a high correlation was found 
between the subjects’ self-ratings of position 
on the 1-9 graphic scale and the median of 
the scale values of the opinion statements 
they agreed with. This result would not be 
inconsistent with the metric import of the 
current data, if the mean were found to 
covary with the median. An analysis of the 
data of the previous study substantiated this 
surmise; for the product-moment correlation 
between the attitude scores based on the 
mean and median of the scale values of the 
statements agreed with is .98 for the sample 
of subjects. 


5 Tn his study, Willis (1960) used the mean of the 
individual scale values as the score to be related to 
the single judgment of the set of objects. However, 
Parducci, Calfee, Marshall, and Davidson (1960) in 
a test of adaptation level theory provide data that 
the mean of a stimulus series is not a uniformly 
adequate estimate of the obtained adaptation level, 
measured as the average physical value of the stimuli 
Placed in the midpoint category. The median and 
midpoint (the value of a stimulus that would be 
half-way between the largest and the smallest) of 
the series are more accurate as predictors of adapta- 
tion level, Although stimuli and procedures, including 
the essential task requirements of the studies, are 
grossly different, it is of interest that the median 
values of the components of the triplets were 
relatively insensitive in this experiment, while the 
midpoint values were as adequate as the mean. 
The product-moment correlation between the latter 
two was .96. 
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The results of this study also have implica- 
tions for the development of persuasive com- 
munications. When addressing himself to a 
particular issue, a communicator will custom- 
arily offer a variety of opinion statements 
either in one communication or in a set. If 
the unequivocal communication of a specified 
position is desired, then the range of the 
evaluative implications of the opinions he 
expresses should be narrow. A wide range or 
a skewed distribution that affects the mean 
value may lead to interpretations of his posi- 
tion that diverge slightly or sharply from the 
desired one or are held with great uncertainty. 
(Of course, a wily propagandist may take 
advantage of these effects by varying pur- 
posively the significances of his opinions, ex- 
pressing both extreme and moderate or quali- 
fied views in the same communication or at 
different times.) Besides affecting judgment, 
the pattern of expressed opinions may in- 
fluence the persuasive effectiveness of a 
communication; for research has intimated, 
though not unequivocally, a relationship be- 
tween communication effectiveness and the 
discrepancy between the audience's judgments 
of a communicator's viewpoint and their own 
initial positions (Hovland & Pritzker, 1957; 
Weiss, 1958). 
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INTRAINDIVIDUAL INSTABILITY AND CONFLICT * 


LEONARD WORELL ? 
Oklahoma State University 


Concern here is with the prediction of the instability or apparent unpredict- 
ability of an individual's behavior when faced with the same situation on 
repeated occasions. A theory is presented that depicts the variable individual in 
any specific set of situations as a comparatively conflicted one in those 
situations. 3 corollaries of the theory are tested. These are: (a) that strong 
conflict produces greater intraindividual instability than weak conflict, (b) 
that the more variable persons as contrasted to the less variable ones in any 
given situation will behave as though that situation contained greater conflict, 
and (c) that the intraindividually unstable person brings into a new situation 
the instability that was developed in previously experienced similar situations. 
4 experiments are reported which provide supportive evidence for the foregoing 


propositions, 


It would scarcely seem necessary to docu- 
ment the point that much of an individual's 
behavior over time may be characterized by 
instability or variability, and that individual 
organisms differ from one another in the 
amount of instability shown in both ap- 
parently “old” and “new” situations. Despite 
the widespread occurrence of variable be- 
havior, comparatively little attention has been 
given to this problem, for most psychologists 
have been primarily concerned with the aver- 
age, modal, or relatively consistent aspects of 
the responses of both their individual sub- 
jects or groups. This paper is based on the 
view that the instability of an individual’s be- 
havior over time is an important and predict- 
able event. Further, it is believed that both 
a theoretical and experimental clarification of 
the phenomenon of instability should con- 
tribute to the adequacy of our predictions of 
the behavior of single individuals. 

As conceived here, variability or instability 
in its simplest form is the difference in quality, 
magnitude, etc. between any two responses, 
such that the greater this difference the larger 
the variability. This definition makes it clear 
that the critical need of any presumed 
“theory” of variability is to be able to ac- 


1This study was supported in part by Grant 
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count for this difference between responses. 
Before presenting such a position, however, it 
is worth taking note of a useful distinction 
between types of variability made by Fiske 
and Rice (1955), who suggest that variable 
behavior may be viewed as occurring either 
in relation to changing environmental stimula- 
tion or to constant and repeated stimulation. 
Tn relation to the first of these, it is apparent 
that psychologists have been working for a 
good many years on the simple proposition 
that behavior changes are correlated with ob- 
jective stimulus changes. This form of vari- 
ability, however, is not our main concern; 
rather our interest here is in the occurrence 
of variable behavior in the face of apparently 
identical and repeated stimulation. Thus, for 
example, we are asking how it comes about 
that some persons show highly unstable time 
performances in the classical reaction time 
situation while others are relatively consistent, 
that some shift their responses in a yes-no 
test given on repeated occasions while others 
tend to reproduce their initial performance, 
and so on. 

Our theory has its roots in the phenomenon of 
conflict. Conflict is viewed here as the simulta- 
neous arousal of incompatible response disposi- 
tions, In general, the present formulation 
suggests that in any particular situation the un- 


3 Fiske and Rice (1955), in their comprehensive 
review, actually distinguish three types of variability. 
Reference should be made to them for a more 
extended treatment of the distinctions. 
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INDIVIDUAL INSTABILITY 


stable or variable individual is a relatively con- 
flicted one, while the nonvariable individual is 
relatively free of conflict. In order to see the 
application of conflict more clearly, let us 
look at what is taking place for both a rela- 
tively variable and nonvariable individual 
when they are exposed to the “same” stimula- 
tion on repeated occasions. In order generally 
to account for the responses, whatever they 
may be, of either individual, we make the 
conventional assumption that any response 
which occurs in relation to any particular 
stimulus at any given time is the strongest 
response in an individual’s repertory. More- 
over, this is assumed to hold irrespective of 
whether strength of response is measured on 
a continuum such as speed or qualitatively 
such as “yes” and “no” alternatives. Taking 
now the behavior of a relatively nonvariable 
individual under the same repeated stimula- 
tion, the present position would see such a 
person as possessing either one or a few rela- 
tively strong response dispositions in re- 
lation to the stimulus context. And al- 
though the situation may arouse some alterna- 
tive and competing dispositions for this indi- 
vidual, these are consistently weaker than the 
few or lone dominant one(s). Thus, in the 
language of conflict, the nonvariable person 
is relatively conflict-free. In contrast, the 
comparatively variable individual is construed 
' as one in whom the stimulus arouses two or 
more competing response dispositions, such 
that on one stimulus occasion one tendency is 
Stronger and, in others, competing dispositions 
assume dominance. It would also follow from 
this that the more nearly equal in strength 
the competing dispositions aroused by the 
Stimulus, the more variable would the in- 
dividual’s behavior appear. Hence, the un- 
Stable individual is, in this situation, a com- 
Paratively conflicted one. Note should be 
made of the compatibility of the present view 
With that recently expressed by Fiske (1961) 
Where variability is regarded as being in- 
- Versely related to response strength. Although 
admittedly simple, it is believed that the 
Present formulation in terms of conflict con- 
.. tains a large number of testable implications 
and our purpose here is to examine three of 
these. 

First, if the variable or unstable individual 
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is in fact in greater conflict as proposed, then 
conflict situations of varying degrees of 
severity should be systematically associated 
with varying amounts of intraindividual in- 
stability. It has been noted on several occa- 
sions (Miller, 1944) that more conflictful as 
contrasted with less conflictful situations do 
produce greater variable behavior in groups 
of subjects. These observations are, however, 
only indirectly relevant to our immediate 
aims, for first, it was not determined whether 
increased variability was also systematically 
characteristic of individual subjects, and sec- 
ond, the presence of avoidance stimuli in the 
situation somewhat complicates the interpreta- 
tion of whether punishment or conflict or both 
produced the group variability. Consequently, 
we have designed an experiment to assess 
intraindividual instability in a discrimination 
conflict setting in which aversive aspects have 
been reduced to a minimum. Within this 
setting, our formulation suggests that greater 
conflict will be associated with more unstable 
intraindividual behavior. 

Second, our characterization of the variable 
individual as more conflicted leads us to 
expect that such an individual should behave 
as though he was in greater conflict than a less 
variable individual even when the objective 
situation does not appear to be a particularly 
conflictful one. Stated in another way, this 
would mean that for a group of individuals 
exposed to the same stimulation, the more 
variable individuals, as determined in that 
situation, should respond as if the situation 
were conflict arousing. In order to get at this 
hypothesis, it is necessary to obtain a response 
measure that has consistently been associated 
with conflict and a task that is relatively non- 
conflictful. With respect to the former, the 
literature on conflict has repeatedly indicated 
that greater conflict produces slower perform- 
ance (Berlyne, 1957; Broadhurst, 1957; 
Brown, 1942; Cartwright, 1941; Castaneda 
& Worell, 1961; Finger, 1941; Worell, 1962b; 
Worell & Castaneda, 1961b). In relation to 
the latter, the classical simple reaction time 
(RT) task in which a single stimulus is 
presented would not appear to be overly 
complicated by very many competing disposi- 
tions. Taking task and response measure to- 
gether, our theory provides the hypothesis 


* 
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that the more unstable individuals will mani- 
fest slower performance in simple RT. 

Finally, accepting the reasonableness of the 
preceding hypothesis, the question arises as 
to how it comes about that one individual is 
more variable than another when confronted 
with the same stimulation repeatedly. Al- 
though it is arguable that the stimulation 
given on successive occasions is not the same 
and thus produces variations in response 
(Guthrie, 1944), this view has not proved 
especially fruitful because of the elusiveness 
of objectively specifying the nature of these 
stimulus changes. As a result, our preference 
is for an approach that rests on the supposi- 
tion that variability is something which an 
organism brings into a situation. The problem 
then becomes one of what is brought in. 
Our general formulation suggests a simple 
solution: if the variable or unstable individual 
is conflicted, then it is conflicting response dis- 
positions which are brought into a situation. 
Elsewhere, and in a related context, several 
studies (Worell, 1962a, 1962b; Worell & 
Castaneda, 1961a) have been reported on the 
general problem of the generalization of con- 
flict, The findings have given strong support 
to the position that prior exposure to different 
severities of conflict generalizes differentially 
to new situations and that the magnitude of 
these generalization effects is dependent upon 
the similarity of the new situation to the 
original conflict training circumstances. Thus, 
previous exposure to strong as compared to 
weak conflict contexts led subsequent situa- 
tions to be responded to as though they were 
more conflictful. Drawing on these results and 
our general formulation, we anticipate that 
prior exposure to different degrees of conflict 
will be associated with varying amounts of 
intraindividual instability in a newly en- 
countered similar situation. 

In the following, three experiments will be 
reported. Our theory suggests that stronger 
conflict will produce greater intraindividual 
instability than weaker conflict, that the more 
unstable individuals in a situation will respond 
to that situation as though it was more con- 
flictful, and that the more variable individual 
carries into a new situation conflicting dis- 
positions that were developed in previously 
similar contexts. 
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GENERAL METHOD 


Historically, conflict or the simultaneous arousal 
of incompatible response dispositions has been experi- 
mentally induced by one of two major procedures— 
either training or discrimination conflict. Although 
the distinction between these may be kept descrip- 
tively separate, the separation is not a pure one. 
Thus, with respect to the first, training procedures 
have taken one of two principal forms: that in which 
different amounts or kinds of training are given to 
either different stimuli or the same stimulus, as 
illustrated by the work of Sears and Hovland (1941), 
and that in which the previous learnings of the 
individual are used and instructions induce dif- 
ferential response strengths. The recent work of 
Festinger and associates aptly illustrates this latter 
approach where subjects have been instructed to 
provide a "commitment" or approach response, for 
example, subjects asked to lie (Festinger & Carl- 
smith, 1959), subjects asked to eat a disliked 
vegetable (Brehm, 1959). 

As an outgrowth of work by Pavlov and associates 
and that on so-called “experimental neurosis,” the 
second major type of conflict inducing procedure is 
that in which two or more stimuli, ie., tones, lights, 
etc. of varying similarity to one another are simul- 
taneously or consecutively presented. Apart from 
tradition, however, Miller (1944) has provided a 
rationale for the use of discrimination conflict which 
is firmly based on the empirical phenomenon of 
stimulus generalization. Thus, it has been repeatedly 
shown that when an organism is trained to respond 
to a given stimulus, the strength of response follows 
a declining gradient as stimuli progressively dis- 
similar from the training stimulus are presented. In 
relation to discrimination conflict, when an organism 
is presented simultaneously with two stimuli that are 
very similar, the organism is in greater conflict than 
when confronted with two quite dissimilar stimuli 
because the responses aroused by the more similar 
stimuli are more equal in strength. Thus, greater 
conflict results from the greater generalization be- 
tween the response dispositions to the two more 
similar stimuli. 

The blurring of the distinction between the two 
conflict inducing procedures becomes apparent when 
we note that in the discrimination conflict setting: 
different amounts or kinds of training are given to a 
stimulus with lower organisms, and with human 
organisms instructions are frequently used to induce 
a greater response strength to one of the simul- 
taneously presented stimuli. Nothing that we know 
of at the present time suggests that these basic 
inducing procedures are fundamentally different. 
Thus, such conventionally accepted indices of con- 
flict as vacillation, increased reaction time, hesitancy, 
etc., have been repeatedly found to be associated with 
both procedures. Consequently, in the following we 
have elected to use the discrimination conflict 
paradigm because of the relatively greater control 
over experience factors afforded by such a paradigm 
in dealing with human adult subjects. 


INDIVIDUAL ĪNSTABILITY 


Equipment. The apparatus used in all three experi- 
ments is the same as that described elsewhere 
(Worell, 1962a, 1962b; Worell & Hill, 1962). In 
brief, it consists of two circular, milk glass windows 
which may be illuminated by separate 7-watt lamps 
varying in brightness. The windows are set 11 inches 
from one another in a black panel on a horizontal 
plane. At the base of the panel a start platform is 
located at equal distances from each window, and 
5 inches above each of the right and left edges of 
the start platform is a protruding circular, concave 
push button. Centered to the platform and slightly 
above the windows is a small ready-signal light. 

This apparatus permits the presentation of either 
simple RT or discrimination conflict. The lighting of 
either window simultaneously activates a Standard 
Electric timer which is stopped by a subject lifting 
his hand from the start platform, The lights in the 
windows are extinguished by the exertion of slight 
pressure on either push button. In all experiments 
the response measure obtained was response initiation 
latency which consisted of the time from the onset 
of illumination to removal of the hand from the 
start platform. All latencies were recorded to the 
nearest 1/100 of a second. 


EXPERIMENT I: CONFLICT AS A DETERMINANT 
OF INTRAINDIVIDUAL INSTABILITY 


Method 


Subjects and procedure. One of two levels of dis- 
crimination conflict was presented to 48 female sub- 
jects, half of whom were given strong conflict and 
the remaining half weak conflict. The stimuli for 
the conflict conditions were made up of some com- 
bination of two brightness levels, 23 and 2914 
foot-candles. These values were determined with a 
Macbeth illuminometer and the exposure of the ap- 
Propriate illumination in either window was ac- 
complished by the setting of variable transformers 
in circuit with each lamp. The weak conflict con- 
dition consisted of the simultaneous presentation of 
each brightness value, while strong conflict was com- 
Posed of the exposure of the 2914 foot-candle 
illumination in both windows. 

Each subject, participating individually, was in- 
formed that the experiment was concerned with the 
Accuracy of visual discrimination. The subject was 
further told that she should designate the "brighter" 
of the two exposed lights by raising her hand from 
the start platform and pressing the button below 
the window with the correct stimulus. Each subject 
Used her preferred hand throughout and was in- 
Structed to keep her hand on the start platform 
until a decision had been made about the location 
-. of the correct stimulus. Since previous work (Worell 
_ & Worell, in press) suggests that the typical instruc- 

lions used in RT situations seem to possess energizing 
or motivational properties which could complicate 
- interpretations here, the present instructions did not 
contain any implication that speed was considered 
Important, 
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All subjects were presented with 16 trials of the 
relevant conflict condition. In the case of the weak 
conflict condition where the two stimuli were dis- 
similar, the location of the correct stimulus was 
varied such that it did not appear in the same posi- 
tion more than three consecutive times. 


Results 


Each subject's response latencies for each 
conflict trial were grouped, following a log- 
arithmic transformation, into two  8-trial 
blocks. Within each trial block, a separate 
standard deviation was computed for each 
subject in order to obtain a measure of intra- 
individual variability. The mean intraindivid- 
ual variability by trial block is presented in 
Table 1. A Lindquist (1953) Type I analysis 
of variance, summarized in Table 2, was per- 
formed on the variability data and indicated 
that all three effects were significant. The 
main effect of Conflict was highly reliable 
(p < .001) and reveals that strong conflict 
produces far greater intraindividual variability 
than weak conflict. While the Trial block find- 
ing and Trials X Conflict interaction are both 
significant (p < .05), these trial results do 
not appear to be particularly stable as noted 
below. 

Previous research (Miller, 1944) indicates 
that different strengths of conflict are as- 
sociated with different amounts of group vari- 
ability. In order to separate the contribution 
of conflict to intraindividual instability from 
that of group variability, we have initially 
made an analysis of the differences in group 
variance (Table 3) between the two conflict 
groups. An F test on these group variances was 
significant (F — 2.66, df — 23/23), supporting 
the view that strong conflict leads to increased 
group variability. In order to eliminate these 


TABLE 1 


MEAN INTRAINDIVIDUAL VARIABILITIES UNDER STRONG 
AND WEAK CONFLICT OVER TRIAL BLOCKS 


Original groups Revised groups 


Conflict 


group Trial block Trial block 
NO Peas et = 
1 2 1 2 
Strong | 24| .149 | .149 | 20} .155 | .158 
Weak |24| .088 | .057 | 20] .085 | .058 
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TABLE 2 
ANALYSIS OF VARIANCE OF INTRAINDIVIDUAL 
VARIABILITIES FOR STRONG AND WEAK 
Conriicr Groups 
Original groups Revised groups 
Source 
MS | df F MS | df F 
Conflict (C) |.13922| 1 | 26.22**|.14500| 1 | 22.66** 
Error (b) :00531| 46 .00640| 38 
Trial blocks |.00574| 1| 4.16* |.00224| 1| 1.32 
T Qe 00572) 1| 4.14* |.00522 1| 3.07 
Error (w) .00138| 46 .00170| 38 


*p «.05. 
"^j <.001. 


group variance differences, we removed the 
two fastest and two slowest (mean latency) 
Subjects from the Strong Conflict group and 
randomly discarded 4 subjects from the Weak 
Conflict group. Thus, with 20 subjects remain- 
ing in each group (Table 3), the mean dif- 
ference in decision latency performance be- 
tween Strong and Weak Conflict is still highly 
significant (F = 101.31, df = 1/38), but the 
difference in group variance associated with 
these means is now insignificant (F = 1.56, 
df = 19/19). Hence, significant mean per- 
formance differences exist between the two 
conflict groups but no group variance differ- 
ences are present. Further, with these new 
groups, an analysis was performed on the dif- 
ferences in intraindividual variability between 
strong and weak conflict. The F of 22.66 
(df = 1/38) in Table 2 is still significant well 
beyond the .001 level, while neither trials nor 
its interaction with conflict is significant. The 
conflict finding reveals that although no group 
variability differences are present, intraperson 
variability is highly affected by conflict. The 
unreliability of the individual is therefore 
sharply influenced by increased conflict. 


TABLE 3 


MEAN Loc LATENCIES AND SDs FOR STRONG AND 
WEAK CONFLICT GROUPS OVER TRIALS 


Count Original groups Revised groups 
group 

N M SD N M SD 
Strong 24 |3891| 4.13 20 |38.82| 3.12 
Weak 24 |29.33| 2.53 | 20 |29.64| 2:50 
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EXPERIMENT IT; SITUATIONAL INTRAINDIVID- 
UAL INSTABILITY AND CONFLICTLIKE 
PERFORMANCE 


Method 


Subjects and procedure. A total of 102 subjects, 
divided equally acording to sex, were exposed to a 
simple RT situation. This involved the use of only 
one of the windows in the discrimination apparatus. 
Left-handed subjects were presented with the visual 
stimulus in the window to their left and right- 
handed subjects in the window on their right. The 
stimulus in either case had a brightness value of 
90.9 foot-candles. Each subject was requested to 
press the button below the light following its ap- 
pearance. A total of 20 trials was administered to 
each subject with an interval of 12 seconds interven- 
ing between the flashing on of the ready signal and 
the appearance of the light in the appropriate 
window. As before, no indication was given that 
speed was considered important. 


Results 


Our analysis is based on the response 
latencies of each subject during the second 
block of 10 trials. Again, a separate standard 
deviation was computed for each subject de- 
scribing his performance on these trials. Then, 
each of the male and female groups was 
divided into equal thirds of high, medium, and 
low intraindividual variability according to 
the magnitude of the SD and a 2 x 3 analysis 
of variance was performed on the response 
latencies of the subjects in the various groups. 

Reference to the mean latencies of each group 
for the 10-trial period in Table 4 will show 
that there is a progressive increase in speed of 
responding as one moves from the most vari- 
able to the least variable groups for both 
males and females. Our analysis of variance 
revealed only one significant effect; the main 
effect for variability was significant well 
beyond the .001 level (F = 10.73, df = 2/96). 
Neither the main effect of sex, nor its inter- 
action with variability, was significant. In 
order to assess the more specific nature of the 
differences in latency performance between 
the three variability groups, Duncan’s (1955) 
multiple range test was used. This analysis 
showed that the performance differences be- 
tween the low variable group and both of the 


moderately and highly variable groups were 


* The data collected here is part of a larger project 
on the relationship between personality variables and 
performance, 
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TABLE 4 
Mean LATENCIES AND SDs ror HicH, MEDIUM, AND Low INTRAINDIVIDUAL VARIABILITY 

GROUPS OVER TEN TRIALS 

Experiment II Experiment IV 
Od Males Females 

N M SD 
N M SD N M SD 

High 17 366.71 89.36 17 382.65 73.77 10 401.80 93.66 
Medium 17 360.47 69.34 17 377.41 62.87 10 314.90 47.80 
Low 17 321.53 56.66 17 286.29 47.26 10 286.60 62.88 


Significant at the .001 level, while that be- 
tween the latter two groups themselves failed 
of significance. Thus, relatively performance 
stable individuals tended to behave as though 
they were conflict-free, while both moderately 
and highly variable persons manifested sig- 
_ nificantly more conflictlike behavior. 

A further examination was made here of 
possible differences in group variances as- 
sociated with the mean latency differences of 
our high, medium, and low intraindividual 
) variability groups. Bartlett’s test provided an 
uncorrected x? of 7.65 (df = 5) which is not 

‘Significant. Thus, although there is no sys- 
_ tematic significant relationship between group 
Means and group variances, intraperson in- 
Stability is highly associated with the ap- 
_ pearance of conflictlike behavior. 


_ Exrermment III: GENERALIZATION OF INTRA- 
4 INDIVIDUAL INSTABILITY * 


Method 


Subjects and procedure. A three-stage design, em- 
- Ployed in former work (Worell, 1962a, 1962b), was 
Used here to assess the generalization of intraindivid- 
ual instability, This design consists of a preconflict 
training stage during which all subjects are exposed 
to the Same stimulus conditions for a predetermined 
Series of trials; a conflict training stage where sub- 
Jects are subjected to varying severities of conflict; 
4nd finally a postconflict training stage in which 
the original preconflict training situation is rein- 
troduced. The preconflict phase serves the purpose 
E———— 

“The data here are a portion of a study on the 
Tole of conflict in learning. The same analysis that 
pireented here was performed on the data obtained 
[ two previous comparable studies (Worell, 1962a, 
1962b). Since the results of these analyses duplicated 
those that are presented here, we have omitted their 
More detailed presentation. 


of insuring that individuals who are later exposed 
to differing treatments are initially comparable in 
performance. 

Here, 24 female college subjects were initially 
exposed to a weak conflict situation for a series of 
16 trials. Following this, half of the subjects ex- 
perienced strong conflict while the remaining half 
continued under weak conflict for an additional 24 
trials. Finally, all subjects were again exposed to the 
original weak conflict situation for 24 trials. As in 
Experiment I, weak conflict consisted of the exposure 
of the 2.8 and 291.4 foot-candle brightnesses in the 
windows, while strong conflict was made up of the 
brighter value being presented in both windows. 
In all other respects the procedure was the same 
as that in the first experiment. 


Results 


The three stages of the experiment were 
analyzed and will be discussed separately. 

Preconflict. The mean intraindividual vari- 
abilities for the strong and weak conflict 
groups on the 8 trials of weak conflict im- 
mediately preceding the conflict training series 
are presented in Table 5. Since we attempted 
to make the two groups comparable by match- 
ing intraindividual variabilities, it is not sur- 
prising that no significant difference is found 
between the groups in preconflict? Thus, 
before entering the differential conflict train- 
ing condition, the strong and weak conflict 
groups were comparable in the intraindividual 
variability of their response to weak conflict. 

Conflict training. The middle portion of 
Table 5 shows the mean intraindividual vari- 
abilities obtained by the strong and weak con- 
flict groups during the 24-trial conflict train- 


6 Twenty-eight subjects had initially participated 
in this experiment; four of these subjects, two from 
each conflit group, were discarded in order to 
equalize the preconflict individual variabilities. 
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TABLE 5 
MEAN INTRAINDIVIDUAL VARIABILITIES 

s Conflict 5 
Cat Preconflict idoli Postconflict 
group 

Trials9-16 | Trials 17-40 | Trials 41-64 

Strong 7.29 142.57 18.53 
Weak | 7.29 6.96 746 


ing series. A simple analysis of variance re- 
vealed that strong conflict was associated with 
heightened intraindividual instability at better 
than the 1% level (F — 9.63, df — 1/22). 
This result parallels that obtained in Experi- 
ment I and indicates that the two groups dif- 
fered significantly from one another in terms 
of intraperson variability at the termination 
of the conflict training period. 

Postconflict. Our major interest lies in the 
results obtained here. The mean intraindivid- 
ual variabilities for the strong and weak con- 
flict training groups during the 24-trial post- 
conflict period are found in the right-hand 
portion of Table 5. A simple analysis of 
variance again revealed that the conflict effect 
was significant beyond the 1% level (F = 
13.95, df — 1/22). It seems clear then that 
strong conflict training provided for signifi- 
cantly greater intraperson variability during 
the posttraining period. 


Discussion 


Our concern here has been in predicting 
the unpredictability of the individual. We 
have framed our general approach within a 
theory of conflict. Compatible with this for- 
mulation, we have found that strong conflict 
situations produce reliably greater intrain- 
dividual instability than weak conflict con- 
texts. Moreover, even when a situation does 
not appear to be especially conflict arousing, 
the more variable individuals in that situation 
behave as though the situation did contain 
greater conflict than do individuals who are 
less variable. And finally we have noted that 
persons who are exposed to strong conflict 
experiences generalize the greater intraindivid- 
ual instability developed in these situations 
to new, but similar situations. Thus, prior 
exposure to conflict leads other related situa- 


LEONARD WORELL 


tions to arouse more intraindividually un- 
reliable behavior. 

One aspect of our findings requires addi- 
tional attention. Given the early stage of our 
formulation of intraindividual instability, we 
have made the simplifying assumption that 
there is a direct linear, or at least a monotonic, 
relationship between the degree of intra- 
individual instability shown in a particular 
situation and the appearance of conflictlike 
behavior in that situation. It will be recalled 
that our second experiment which dealt with 
simple RT provided results in line with a 
monotonic position, in that there was a pro- 
gressive increase in speed (or decrease in 
conflictlike behavior) as one moved from the 
most to the least intraindividually variable 
subjects. However, the performance difference 
between the moderately variable and most 
variable subjects was not significant. In order 
to arrive at some clue as to how this came 
about, we have looked more closely at the 
distribution of individual variabilities in our 
three variability groups. This examination 
revealed that the moderately variable group 
was rather strongly skewed toward the high 
variable subjects, so that this group was more 
like the most variable subjects than the least 
variable subjects in terms of intraindividual 
variability. With this state of affairs, our 
general formulation would lead to the expecta- 
tion of a greater similarity in performance be- 
tween the moderately and highly variable sub- 
jects and this of course is what was found. 

In order to check this out further we have 
analyzed the data from a portion of a previous 
study (Worell & Worell, in press) where sub- 
jects were exposed to the same simple RT 
situation as was used here By using a 
tripartite division of individual variabilities as 
before, we again obtained significant latency 


7 A total of 32 subjects had been used in this study. 
These were divided into high, medium, and low 
groups in terms of their individual variabilities with 
the 11th and 22nd ranked individuals being dis- 
carded for purposes of analysis. The mean latency 
performance in .01 second for the 10 subjects in 
each of the high, medium, and low variability groups 
over the 10-trial series is shown in the right-hand 
portion of Table 4 under Experiment IV. Analysis of 
variance indicated that the effect of individual 
variability was significant beyond the 1% level 
(F — 648, df — 2/21). 
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performance differences as a function of vari- 
ability. In contrast to the former analysis, 
however, the moderately variable group here 
differed significantly from the highly variable 
subjects, but they did not differ reliably from 
the least variable subjects. Turning again to 
an inspection of the individual variabilities in 
these new groups, we found that the moder- 
ately variable group was skewed in this case 
toward the least variable subjects in terms 
of intraindividual variability. It would appear 
then that the magnitude of conflictlike be- 
havior appearing in moderately intraindivid- 
ually variable subjects strongly depends upon 
how closely these individuals approach the 
intraindividual variabilities of either extreme 
of variable subjects. In any event, the find- 
- ings of both studies are consistent with the 
proposal of a monotonic relationship, and very 
probably a linear one as is shown in Figure 1, 
between the amount of intraindividual in- 
stability and conflictlike behavior. 

While we have confined our attention thus 
far to relatively simple experimental situa- 
tions, our findings may be extended to broader 
areas of behavior. We assume that all conflicts 
—whether they be called attitude conflicts, 
discrimination conflicts, belief conflicts, drive 
- conflicts, opinion conflicts, etc.—fundamen- 
tally consist of: competition among simulta- 
neously aroused incompatible response dis- 
Dositions, and the necessity to make a dis- 
: crimination between different strengths of 
, Tesponse alternatives. In terms of the present 
- formulation, the smoker who repeatedly ver- 
balizes the intent to stop smoking but con- 
- tinues to do so, the voter who expresses a 
clear-cut political preference but repeatedly 
remarks about the “good” qualities of both 
candidates, the patient who states that his 
boss is a great guy and also considers the 
Possibility of taking another job, etc. would 
all be viewed as relatively intrapersonally 


; 5 As in Experiment II, a further analysis was per- 
_‘ormed on the group variances (Table 4) associated 
With the mean latency differences between the high, 
Egon, and low intraindividually variable groups. 
Bartlett’s test provided an uncorrected x? of 4.12 
(ar — 2), indicating that no significant group vari- 
ability differences were present. This absence of 
Btoup variance differences further emphasizes the 
Importance of intrapersonal instability in the ap- 
Pearance of conflictlike behavior. 
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Fic. 1, Mean response initiation latencies over 10 
trials of simple reaction time as a function of the 
magnitude of intraindividual variability. 


conflicted individuals. These individuals would 
be expected to behave in similar, and often 
apparently nonconflictful, situations as though 
they were in comparatively strong conflict— 
by delaying a response, refusing to respond, 
etc. Moreover, they would also be expected 
to manifest relatively unreliable behavior in 
repeated encounters with these situations. In 
view of this it would seem fitting to attempt 
to identify and measure the sources of con- 
flict aroused for an individual in any specific 
situation in order to maximize the prediction 
of that individual's behavior. Finally, it is 
worth remembering that the conflict condi- 
tions used here were of relatively short dura- 
tion and not especially aversive, so that future 
work will need to determine whether more 
long-term and more avoidant conflicts affect 
the individual's performance instability in the 
same way. 
a 
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EXTENDED AND TELESCOPED PHANTOM LIMBS IN 
UNILATERAL AMPUTEES + 


SAMUEL A. WEISS anp SIDNEY FISHMAN 
New York University 


The incidence of extended (same length) and telescoped phantom limbs was 
studied by interview in 239 adult unilateral amputees, above and below elbow 
and knee (AE, BE, AK, BK). Differences were analyzed by nonparametric 
statistics. Conclusions were as follows: (a) AEs reported the greatest, and 
BKs the smallest proportions of shortened phantoms; (b) BE and AK groups 
were approximately similar; (c) telescoped phantoms of proximal amputations 
were mainly partly telescoped—of distal amputations, fully telescoped; (d) 
phantom presence, persistence, strength, shortness, and tactile pressure 
sensitivity were positively related, paralleling the cephalocaudad principle; 
(e) limb image (established by cortical and learning factors); and (f) intensity 
of stump cues (dependent on the cephalocaudad gradient, etiology, and 
sensitization by amputation), affect perception of phantom length and related 


sensations. 


Following the traumatic or surgical ampu- 
tation of a limb, the illusion of its presence is 
often experienced although the amputee knows 
it is gone. Some amputees experience a whole 
or continuous limb. Usually, however, the 
phantom limb has gaps. Only the distal parts 
of the extremities—hand, foot, or digits—are 
felt and the intervening portions are missing. 
While some amputees experience the phantom 
“most,” or “all the time,” the greater majority 
feel its presence only “sometimes” or “in- 
frequently.’ The phantom limb may be ex- 
perienced as of the same length as the missing 
limb—extended; as somewhat shortened—a 
retracted or partly telescoped phantom; as 
completely shortened, the digits may be felt 
within the stump or protruding from it—a 
completely telescoped phantom; or, on rare 
occasions the phantom may be experienced as 
longer than the limb it represents. 

1This investigation was supported in part by a 
Research Special Project grant from the Office of 
Vocational Rehabilitation, Department of Health, 
Education, and Welfare. 

The authors are especially grateful to the personnel 
of the New York University-Prosthetic and Orthotic 
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Tor the assistance offered in the conduct of the study. 

We also wish to acknowledge the help of our 
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Both central and peripheral factors appear 
to be related to these experiences (Haber, 
1954, 1956). Various conditions or procedures 
affecting the stump have been found to 
elicit, modify, or abolish phantom experiences 
(Gallinek, 1939; University of California, 
1952). Nevertheless, central factors play an 
important role as is evident in the occasional 
feeling of superadded sensations such as sores, 
bandages, and clothing on the phantom; the 
occasional influence of preamputation experi- 
ence, such as disease, on the position main- 
tained by the phantom parts; the more fre- 
quent reporting of sensation of those distal 
parts populated more densely with receptors 
as well as having greater cortical representa- 
tion on the Penfield-Boldrey (1937) ho- 
munculus; the consistent finding in a num- 
ber of studies, from those of Head and 
Holmes (1912) to Hécaen, Penfield, Bertrand, 
and Malmo (1956), that abolition of some 
phantom limbs occurs following excision of 
the postcentral gyrus; and the finding that 
phantom limb experience is not related to 
any pattern of sensation that might be in- 
itiated by the major peripheral nerves (Living- 
stone, 1945). Moreover, elimination of peri- 
pheral neuromata often fails to modify phan- 
tom limb symptoms of persistence. In brief, 
both central and peripheral factors (neuro- 
physiological and psychogenic experience) 
play important roles in eliciting phantom 
sensation, 


489 = 


490 


As one aspect of the phantom phenomena, 
the telescoping or shrinking phantom has not 
been clearly understood. At one time this 
occurrence had been regarded as the natural 
course of all phantoms (Henderson & Smyth, 
1948; Katz, 1920, 1921), but recent reports 
have found extended phantoms in many 
amputees long periods after amputation 
(Simmel, 1956; Weiss, 1958). 

The purpose of the study was to investigate 
the incidence of extended and shortened 
phantom limbs in the four types of unilateral 
amputations, above elbow (AE), below elbow 
(BE), above knee (AK), and below knee 
(BK); to examine the relationship between 
phantom length and various premorbid and 
postamputation experiences; and to determine 
whether a theoretical conceptualization can 
be formulated to account for these findings. 


MetHop 
Subjects 


The sample consisted of 239 adult above-elbow 
(AE), below-elbow (BE), above-knee (AK), and 
below-knee (BK) unilateral amputees. These were 
individuals participating in various prosthetic studies. 
Forty-three of the subjects were non-Caucasian. 
Nineteen amputees were women. Most of the ampu- 
tees had lost their limbs in war combat or in trau- 
matic accidents, with the upper extremity (UE) 
amputations almost exclusively so. In the lower 
extremities (LE), circulation problems, diabetes, 
osteomyelitis, infections, and cancer were also rep- 
resented. For the entire sample the ages ranged from 
15 to 76, with a mean age of 42.8 years, The number 
of years since amputation ranged from 1 to 59, with 
a mean of 13,9 years. 


Procedure 


Each subject was interviewed for about 20-30 
minutes. Haber’s (1954) questionnaire on phantom 
limb sensation, modified to include information on 
LE amputees, was employed to determine the 
presence and incidence of same length and shortened 
phantom limbs in the four amputation types. Vari- 
ables relating to presence, persistence, intensity of 
phantom, affective tone, tactile Pressure sensitivity, 
and amputation etiology were also studied as to 
their effect on phantom length. 

While the subjects were encouraged to talk freely, 
care was taken not to influence their responses, The 
term “phantom limb” was avoided unless it was 
introduced by the subject. If the interviewer was 
pressed for an opinion on the subject’s phantom 
experience, the latter was told that there were no 
correct or incorrect answers and that whatever his 
experience other amputees have similar ones, Often, 
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the subjects spontaneously related their experiences 
and provided relevant information even before 
specific questions were asked. As the interview 
progressed subjects generally responded to the non- 
judgmental and permissive atmosphere by com- 
municating their experiences freely. Some individ- 
uals who initially seemed somewhat hesitant about 
describing particular qualities of phantom experi- 
ence later relaxed and spoke of their experience of 
strong phantom sensation, pain, or pleasure. Draw- 
ings of both the phantom and intact limbs were also 
obtained. After the interview a few subjects who had 
originally reported same length phantoms indicated 
on their drawings that they had a shortened 
phantom. 


Response Categories 


Response categories for certain of the variables 
require specification. 

Phantom length. As already indicated, phantom 
limbs were described as same length, retracted or 
partly telescoped, and completely telescoped. A few 
amputees experienced a "longer" phantom because 
it was painful or because of a premorbid dropped 
foot condition. These were included with the same 
length phantoms. A few AE and AK amputees 
experienced their shortened phantoms as bent, or 
retracted and bent, with the phantom midjoint, 
elbow, or knee, present. These were included in the 
retracted category. 

Persistence. Phantom sensation was described as 
felt, “all,” “most,” or “some” of the time. 

Phantom intensity. For the purposes of analysis 
the descriptions of phantom intensity were divided 
into five categories: 


Strong or very strong (consistently strong) 

Strong and less strong (inconsistently strong) 

Mixed reactions (The “mixed” category con- 
sists of those amputees who characterized their 
phantom intensity as mixtures of the four other 
categories without specifically mentioning the word 
""strong.") 

Moderate and less 

Faint or vague only 


Affective tone. Descriptions of phantom affective 
tone were divided into four main categories: 


Neutral (Unaccompanied by any mention of 
positive or negative affective tone.) 

Positive (Phantom sensation was experienced as 
a “pleasant” or “good” feeling.) 

Negative (These included such descriptions as 
“painful” ; “occasionally painful”; “unpleasant, but 
not painful”; and “annoying, but not painful.”) 

Mixed (This category included the experience 
of both painful and pleasurable sensation or un- 
pleasant and pleasurable sensation.) 


Nonparametric statistical tests, such as the chi 
square and Fisher exact tests were employed. Refer- 
ences to levels of significance are two-tailed and 
refer to the 5% level unless otherwise indicated, 
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f RESULTS 
Experiences Associated with Amputation 


Seventeen amputees denied ever having 
experienced phantom sensation. Eight others 
admitted the experience in the past but were 
unable to provide any. information. Five 
others could not specify length of phantom, 
and six amputees reported that the length 
varied, leaving 203 who described their phan- 
toms as same length or shortened. 

None of the 25 AE amputees interviewed 
denied ever having phantom sensation, as 
compared to 12 of the 92 BE amputees 
(13%). One BE (3%) and four of the AK 
amputees (4%) denied ever having any 
phantom. 

The AE amputees reported. the strongest, 
and the BK amputees the weakest, phantom 
sensations. The BE and AK amputees re- 
ported the same proportions. 

The UE amputees as a group experienced 
phantom sensation more persistently, i.e., 
"all" or *most of the time," to a greater 
extent than the LE group. 

A study (in progress) of tactile sensitivity 
(light touch threshold) in the stumps of four 
amputation types showed a gradient of de- 
creasing sensitivity from the AE to the BK, 
as has previously been found in nonamputees. 
This parallels the proximo-distad principle of 
development. 

When the four amputation types were 
tanked on the variables of phantom presence, 
persistence, intensity, increased tactile pres- 
Sure sensitivity (light touch threshold), and 
shortened phantoms, a high degree of rela- 
tionship was noted which generally parallels 
the cephalocaudad or proximo-distad gradient. 


Same Length (Extended) and Shortened 
(Telescoped) Phantom Limbs 


Contrary to some reports of the gradual 
Shrinking of phantoms after a few years, 
Numerous amputees experiencing extended 
Phantoms had been amputated many years 
earlier, No relationship was found between 
Phantom length and the variables of age, 
time since amputation, or sex. 

For the entire population the distribution 
Í same length and shortened phantom limbs 
differed significantly in relation to amputee 
types (see Table 1). 


PHANTOM LIMBS 
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TABLE 1 


DISTRIBUTION OF SAME LENGTH AND SHORTENED (Re- 
tracted and completely telescoped) PHANTOM 
Lois IN THE FOUR UNILATERAL 
AMPUTATION TYPES 


Do Shortened 
Amputation 
type 
Total | Total ekartea OR gto 
Entire Sample* 
AboveElbow} 11] 12|( 8 + 4 ) 
Below Elbow| 19 SUL. Oke AD: 3 T 
AboveKne | 54 | 27 | ( 22 + 5 } 
Below Knee 60 12 ( 2 + 10 
N 144 | 59 |( 32 + 27 ) 
Caucasian Subsample^ 
Above Elbow 3 &|Q 7. dy 
BelowElbow| 14 8: [nop st: cvi 
AboveKne | 44 | 23 | ( 18 + 5a) 
Below Knee 49.0 1105: LAC sa a PAS] 
N 110| 50|( 27 + 2 ) 


a Same length X Shortened, x? = 12.06, df 3, p < .01. 
b Same length X Shortened, x? = 14.01, df = 3, p < .005. 


The AE amputees reported the greatest, 
and the BK amputees the smallest proportion 
of shortened limbs. The BE and AK amputees 
had approximately similar distributions. When 
the Caucasian population was considered 
separately, higher levels of significance were 
obtained. The non-Caucasian group, lower in 
socioeconomic status, showed a nonsignificant 
trend to report extended phantoms. 

The proximal amputations as a group (AE 
and AK), had significantly greater propor- 
tions of shortened phantom limbs than the 
distal amputations. The UE group (AE and 
BE) had greater proportions of shortened 
phantoms than the LE group (AK and BK) 
although the difference was beyond the 596 
level only in the Caucasian subsample (see 
Table 2). 

As regards differences within the shortened 
phantom limb category (retracted versus fully 
telescoped), the distal amputations as a group 
were characterized by telescoped limbs, while 
the proximal amputations showed a greater 
proportion ($ < .001) of retracted limbs 
(Table 3). Highly significant differences in 
this respect (range: p < .01 to < .001) were 
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TABLE 2 


Cur Square VALUES OF COMPARISONS OF SAME 
LENGTH X SHORTENED PHANTOM LIMBS IN THE 
Four UNILATERAL AMPUTATION TYPES 


Comparison and population x df 
Proximal X Distal 

Total 6.55** 2 

Caucasian 4,37* 2 
Upper X Lower Extremities 

Total 3.18 2 

Caucasian 4.78* 2 
AE* X AK X BE X BK 

Total 12.06*** 3 

Caucasian 14.01**** 3 
AE X (AK 4- BE) X BK 

Total 11,93**6* 2 

Caucasian 13,98*«*« 2 
AE X BK 

Total 9.84*** 1 

Caucasian 1139104 1 
AK X BK 

"Total 4.73* 1 

Caucasian 3.36 1 
AE X AK 

"Total 1.97 1 

Caucasian 4.32* 1 
AE X (AK + BE) 

"Total 2.42 1 

Caucasian 4.42* 1 
BK X (AK 4- BE) 

"Total 4.76* 1 

Caucasian 4.04* 1 


Note.—See Table 1 for N in each group. 
* AE = Above Elbow, AK = Above Knee, BE = Below 
Elbow, ioe ‘ata Knee. 


also found in the individual comparisons of 
the distal BE versus the proximal AE, as 
well as between the distal BK and the prox- 
imal AK. The amputees with proximal ampu- 
tations reported a significantly greater amount 
of gradual shortening than those with distal 
amputations who usually claimed that their 
phantoms had always been telescoped. 

The UE amputations as a group reported a 
greater proportion of telescoped limbs, and 
the LE group a greater proportion of retracted 
limbs, although the difference was not sig- 
nificant. 

Persistence and phantom length. Shortened 
phantoms were experienced more persistently, 
Le. “all” or “most of the time,” than were 
extended phantoms, although these differences 
were not statistically significant, 

Phantom intensity and length. In the LE 
amputation group, and especially in the AK 
group, the reported length of phantora limb 
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was significantly related to the strength or 
intensity of phantom sensation. 

The experience of phantom sensation as 
consistently strong or very strong (Intensity 
I) was predominantly associated with same 
length rather than with shortened ones (sce 
Table 4). This was not the case for moderate 
sensation (Intensity IV). These differences 
between consistently strong or very strong 
sensation and moderate sensation were at the 
.01 level. 

It should be noted, however, that 12 of the 
13 BK amputees (in comparison to 8 of the 
13 AK amputees) who experienced only faint 
or vague sensations also reported extended 
phantoms. 

Affective tone and phantom length. Painful 
and pleasurable qualities (positive-negative 
affective tone) were consistently associated 
with extended phantoms to a greater extent 
than were purely neutral qualities in both 
AK and BK, but not in UE amputees. These 
differences (see Table 5) between neutral and 
positive-negative affective tone (also including 
“mixed” reactions) in the LE amputation 
group were significant (p < .05). It should be 
pointed out that the BK amputees in par- 
ticular were markedly affected by “painful” 
phantom sensations. Painful tone was associ- 
ated with 24 same length phantoms and only 
1 shortened phantom while the figures for 
neutral sensation were 14 and 6, respectively 
($ < .05). 

Thus, both phantom strength and positive- 


TABLE 3 


RETRACTED AND TELESCOPED PHANTOM LIMBS IN 
PROXIMAL AND DISTAL AMPUTATIONS 


ai EU Retracted Soane 
Entire Sample* 
Proximal 30 9 
Distal 5 18 
Caucasian Subsample> 
Proximal 25 6 
Distal 2 17 


Note.—Proximal = AE + AK; Distal = BE + BK. 
* Proximal X Distal, x? = 21.24, p <.001, 
* Proximal X Distal, x? = 20.58, p < ,001, 
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TABLE 4 


RELATIONSHIP BETWEEN LENGTH OF PHANTOM LIMB AND INTENSITY OF PHANTOM SENSATION 
IN THE LOWER EXTREMITY UNILATERAL AMPUTATIONS 


(Total sample) 
Length of phantom limb 
Phantom sensation experienced as: 
Same length Shortened (Retracted + Completely 
Total Total telescoped) 
Above Knee* 
I Strong or very strong 23 4 

II Strong and less 13 7 f E iF 9 ! 
III “Mixed” 4 3 TEK EE es.) 
IV Moderate 5 8 ( 5 + 32) 
V Faint or vague 8 5 CR SUT te ORL. 
N 53 27 (12422)! eect Su) 

Below Knee* 
I Strong or very strong 17 2 (ncs gea coe thay 
II Strong and less 12 2 (0i ass 227) 
III “Mixed” 5 0 (50: Tor SO IRE) 
IV Moderate 13 7 (qe UI 6 ) 
V Faint or vague 12 1 3080; Tf TR. 
N 59 12 (42.1 cb MOT 


* No information was available for one AK and one BK amputee. 


negative affective tone were associated with 
Same length, rather than with shortened 
phantoms in the lower extremities. 

r Phantom intensity and affective tone. Posi- 
tive-negative affective tone (pleasure-pain) 
was significantly associated with stronger 
phantoms in the AK (range: < .001 to 
< 01) and BK groups (range:  .01 to .02). 
Nonsignificant trends in the same direction 
were found in the BE and, to a lesser extent, 
in the AE amputations. 


Amputation Etiology 


Comparison of mechanical trauma and cir- 
culatory disease. As reported, our sample of 
UE amputees consisted almost exclusively of 
individuals amputated because of mechanical 
trauma, Occurring from automobile or in- 
dustrial accidents, missile wounds, and the 
like. In this sample the proportions of 
Shortened phantoms were greater in the AE 
than the BE amputee group. 

In contrast to the UE group, a substantial 
number of LE amputees lost their limbs be- 
Ee of circulatory problems such as vascular 

ISease, osteomyelitis, or diabetes, or had 
fhese circulatory diseases in addition to the 
. Mechanical trauma. The greater proportions 


of shortened phantoms in the AK as compared 
to the BK amputation are primarily attribu- 
table to this disease subgroup. When those 
amputations solely involving mechanical 
trauma were considered, the AK and BK 
amputees had the same proportions (25%) 
of shortened phantoms (13 of the 53 AK, 
and 11 of the 44 BK).? Specifically, none of 
the 23 BK amputees in the disease subgroup 
had a shortened phantom while 7 of the 19 
AK amputees did (p < .01). The differences 
between these two LE amputation types were 
still significant when the diabetics were dis- 
regarded ( < .01), or even when only the 
vascular cases were compared (p < .05). The 
differences between the BK disease and 


2As previously reported, the BK phantom 
traverses a shorter distance to become fully tele- 
scoped, while the AK phantom traverses a greater 
distance and is usually partly telescoped. 

As regards stump lengths, a longer amputation 
stump facilitates more satisfactory leverage, pros- 
thetically. AK stumps are made as long as feasible. 
“Long” BK stumps, however, are susceptible to 
circulatory problems, leading to possible reamputation 
at a considerably higher level. The BK stump is 
therefore made relatively shorter than that of the 
AK. Nevertheless, the difference between AK and 
BK amputees, as regards amount of the LE re- 
tained, is considerable. 
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TABLE 5 


RELATIONSHIP BETWEEN LENGTH OF PHANTOM Live 
AND ÅFFECTIVE TONE oF PHANTOM SENSATION 
IN THE LOWER Extremity UNILATERAL 


UTATIONS 
(Total sample) 
Length of phantom limb 
Affective 
tone 
pon Shortened | (Retracted + Completely 
Above Knee* 
Neutral (i 9 (CIRT apa 2 
Positive 4 1 ( 1 + (0) 
Negative 37 16 (439 4 SM 
Mixed 5 1 ( 1 + O) 
N 53 27 (i) ae 3:02) 
Below Knee 
Neutral 14 6 CRUE E $ ) 
Positive 5 0 AOS = 0 ) 
Negative | 37 6 CURE A 3 2) 
Mixed 4 OS | ming usi M. S 
N 60 A EOD 00313 


* No information was available for one AK amputee. 


mechanical trauma subgroups are similarly 
significant and cannot be attributed to differ- 
ences in stump length, phantom intensity, or 
affective tone, 

Amputees with circulatory involvement are 
usually older individuals, Nevertheless, age 
did not appear to be significantly related to 
extended and shortened phantoms in this or 
any other etiology. 

Cancer and diabetes. Eleven LE amputees 
with cancer as etiology reported phantom 
Sensation. Seven of the nine AK subjects and 
one of the two BK subjects reported shortened 
phantom limbs—a predominance of phantom 
shortening. The difference in phantom lengths 
between these LE subjects and those ampu- 
tated for other reasons was significant (p 
< 05). All eight diabetic amputees were LE 
subjects and had extended phantoms, The 
differences in phantom length between the 
diabetic and cancer cases were significant 
(b < .01). 

In addition to cases cited above, three other 
amputees were amputated because of cancer, 
An AE subject reported a retracted and bent 
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phantom. One AK and one BK amputee 
reported no phantom sensation. 

Missile and vehicle groups. Forty-nine AK 
amputees, without circulatory disease, who 
had incurred amputation as a result of vehicle 
accidents or missile wounds, were studied, 
Seven of the 13 amputees in the vehicle group 
reported shortened phantoms in comparison 
to only 5 of the 36 missile subjects (5 < .02). 
These differences might be attributed to the 
greater prevalence of positive-negative affec- 
tive tone in the phantoms of the missile group 
(P < .01). However, when only those ampu- 
tees reporting pain were considered separately, 
the vehicle group had greater proportions of 
Shortened phantoms than the missile group 
(Ż < .01). A comparison of the vehicle and 
missile groups on phantom intensity found the 
missile group reporting significantly greater 
Proportions of persistently strong phantom 
sensation (Intensity I) than the vehicle group 
(p < .05). Apparently, the stronger phantom 
intensity experienced by the subjects in the 
missile group also accounted for the greater 
prevalence of extended phantoms in this 
group. 

Discussion 


It is proposed that the relationship between 
phantom limb length and sites of amputation 
can be understood in terms of the interaction 
between stump cues and the limb image of 
the missing distal part. Stump cues are af- 
fected by the cephalocaudad gradient, stump 
conditions, etiology, and current psychogenic 
experiences. Limb image is related to cortical- 
central representation and associated sensa- 
tions, learning factors, preamputation psy- 
chogenic experiences, and etiology. 

As regards the cephalocaudad or proximo- 
distad gradient, this principle postulates that 
physical growth, especially embryological 
growth, tends to begin in the head end and to 
Progress toward the tail end and is usually 
generalized to include also the tendency for 
growth to proceed from the shoulder and 
thigh toward fingers and toes (proximo-distad 
enr (English & English, 1958, p. 

1); 


Studies have consistently shown that ampu- 
tation sensitizes the stump, with the distal 
end becoming more sensitive than the proxi- 


PHANTOM LIMBS 


mal part. Also, proximal and distal stump 
tissues are more sensitive than the homologous 
areas of the intact limb. The increased sensi- 
tivity of the amputated stump seems to pro- 
vide cues which compete with the cortically 
activated phantom hand or foot sensations. 
This discrepancy between the cues initiated 
by the sensitive stump and the image of the 
“same length” phantom activated by central 
processes is greater in the proximal amputa- 
tions (AE or AK). These amputations create 
a larger gap between stump and distal por- 
tions of the phantom than is true for the 
distal amputations (BE or BK) since more 
- Space exists for the phantom hand or foot to 
“move” closer to the stump. 
Strong stump cues may tend to influence 


PANEL"A" 


a BRAIN (1) 


| 
{ 
| 


» PHANTOM FOOT 


RELATIVELY WEAKENED OR MODERATE IMAGE 


PANEL "B" 


D 
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the amputee to perceive a moderately strong 
distal image at, or nearer to, the stump (see 
Table 4 and Figure 1). If the distal image is 
more than moderately strong, it can compete 
effectively with the stump cues, and be per- 
ceived in extended position. On the other 
hand, especially in the BK where stump cues 
are weak, if the distal image is faint or vague, 
the discrepancy and conflict between it and 
the stump cues is less pronounced and in- 
volves a lesser need for perceptual reorganiza- 
tion. The tendency to perceive the weak distal 
image at the stump is, therefore, considerably 
diminished. 

The interaction between central and stump 
cues factors proceeds as follows: The longer 
BE stump, lower on the cephalocaudad 


PANEL"C" 


SHORTENED 
PHANTOM 


STRONG DISTAL IMAGE 
FEN 


Fic. 1. Same length and telescoped phantom limbs in an AK amputee. (Panel A: Brain—1—projects dis- 
fal phantom limb image and related sensations to 1a. Relatively strong AK stump cues conflict with the 
lal image sensations at la. “Gap”—3—or the discrepancy between stump cues and distal phantom, en- 
courages perceptual reorganization. Panel B: Strong distal image—4—is able to compete against stump 
cues and maintain extended limb image established before amputation. Panel C: If distal image is not suf- 
F ficiently Strong—5—to compete against stump cues, foot tends to be perceived nearer to stump, such as 
- Positions 6 and 7 or higher—partly or fully telescoped phantoms. Stronger phantom sensation is associated 
More with a painful or pleasurable phantom than with a neutral phantom.) 
e. 
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gradient, is less sensitive and provides weaker 
cues than that of the shorter AE. In addition, 
the BE involves a smaller gap and con- 
sequently a smaller discrepancy between 
stump and distal phantom hand. Therefore, 
the BE has a lesser tendency to be character- 
ized by shortened phantoms than the AE. 
Similar reasoning obtains in a comparison of 
the AK and the BK amputations. In this 
analysis, the proximo-distad principle relating 
to the difference in gradient sensitivity be- 
tween the UE and LE, and between proximal 
and distal amputations, is operative. 

The results show that strength and affec- 
tive tone (pleasure-pain) are related to the 
length of phantom in the LE, but not in the 
UE. It is suggested that strength and promi- 
nent affective tone produced by central proc- 
esses, reinforce the extended phantom foot 
and allow it to compete more effectively with 
the cues in the stump, thus facilitating and 
maintaining an extended phantom. This is 
truer for the LE than the UE because the 
strength of the stump cues is less in the LE 
than in the UE because of the proximo-distad 
gradient; in the preamputation state the more 
limited motility of the LE and the foot’s role 
in body support, establishes a more durable 
extended limb image than is true for the 
highly motile UE. That is, if a stable extended 
limb image is established before amputation, 
as in the lower limbs, the strong distal image 
can compete with stump cues and produce a 
same length phantom. If, as in the upper 
limbs, no stable extended limb image was 
established, then even a strong distal image 
cannot compete against the very intense stump 
cues present in the UE stump, due to the 
stump’s proximal position on the proximo- 
distad gradient, 

. Etiological considerations. This discussion 
has related the role of stump cues to the 
proximo-distad gradient. The stump is also 
influenced, however, by disease and injury, 
which modify or accentuate the intensity of 
the cues. Studies (Selye, 1962) have shown 
that mechanical trauma or irritation, however 
slight, leads to the deposition of calcium in 
the injured soft tissues (followed by inflam- 
mation, sclerosis, and finally to hyalinization). 
The sensitizing calcium is transposed to the 
Soft tissue, sometimes after a latent period, 

; 
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from the adjacent bones, the skeleton, and 
circulatory system. Selye has coined the term 
“calciphylaxis” for this condition of calcium 
induced tissue hypersensitivity. The reverse 
of “calciphylaxis” is desensitization or 
“anergy” which may be produced by large 
doses of some toxic and other chemical 
substances. 

As was previously pointed out, the greater 
proportions of shortened phantoms in the AK 
as compared to the BK amputations is pri- 
marily attributable to the disease subgroup. 
When those amputations solely involving 
mechanical trauma were considered, the AK 
and BK amputees had the same proportions 
of shortened phantoms. Circulatory problems 
apparently minimize “calciphylaxis” or mag- 
nify “anergy” so that only the greater sensi- 
tivity of the AK stump, due to its position 
on the proximo-distad gradient, is reflected in 
its larger proportions of shortened phantoms. 

In AK amputees without circulatory in- 
volvement, amputations caused by missile 
wounds were significantly associated with 
same length phantoms while the reverse was 
the case as regards the vehicle group. The 
greater prevalence of positive-negative affec- 
tive tone and strong phantom sensation in the 
missile group apparently accounts for these 
findings. Nearly all amputees in the missile 
group incurred their wounds in war combat. 
The heightened tension in soldiers, who con- 
stantly expect death or severe injury results 
in the subsequent increase in phantom in- 
tensity, and positive-negative affective tone, 
especially pain. (The prevalence of pain 
in the war veterans in the University of 
California, 1952, study is relevant here.) In 
contrast, limb loss in the vehicle accident 
group was unexpected, with consequently less 
emotional involvement in the limb before its 
loss. 

The relationship between central-cortical 
and stump cue factors in affecting phantom 
length is supported by the following con- 
siderations: 

1. Nearly every diabetic seen in this sample 
and our other studies reported an extended 
phantom. Diabetics suffer from neuropathy 
(poor nerve conduction) (Lawrence & Locke, 
1961; Weiss, 1961) and their stumps are less 
sensitive, as regards tactile and pain sensation, 


— 
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than those of other amputees. Diabetics also 
tend to have painful phantom sensation. Thus, 
the cues which might effect a shortening of 
the phantom are diminished with the extended 
cortical image also reinforced by pain. The 
impaired or depressed perceptual functioning 
of aged amputees with diabetes and circula- 
tory disease has been described (Weiss, 1961, 
pp. 220-221). 

2. Amputees reporting cancer as amputa- 
tion etiology have a great predominance of 
shortened phantoms. These subjects were 
probably concerned with possible recurrence 
of symptoms and may therefore focus atten- 
tion on the amputation stump, thus increasing 
its cue producing sensitivity and leading to 
the great prevalence of shortened phantoms in 
these cases. The phantom limb in cancer may 
be a “dangerous reminder” resulting in either 
suppression of the phantom or its foreshorten- 
ing. Another possibility is heightened phys- 
iological sensitivity in cancerous stumps. 

3. Paraplegics with extensive spinal cord 
injury have phantoms which do not become 
shortened. Amputees who simultaneously be- 
come paraplegics also retain their extended 
phantoms. The retracted phantoms of ampu- 
tees do not continue to shorten if the am- 
putees later become  paraplegics (Bors, 
1951). Apparently, the stump loses its sensi- 
tivity after extensive spinal cord injury and 
cannot provide the cues which tend to retract 
the phantom. 

4. In Haber's (1954, 1956) study, the 
Stumps of 12 amputees with fully telescoped 
phantoms were significantly more sensitive on 
the tactile measure of point localization than 
the 10 subjects with retracted, and the 2 with 
same length phantoms (5 < .01). This finding 
points to a relationship between stump sensi- 
tivity, as regards the measure of point local- 
lation, and complete phantom telescoping. 


Subsample Differences 


The tendency for non-Caucasians in our 
Sample to experience more extended phantoms 
May be due to etiological factors such as the 
Prevalence of circulatory disease in these in- 
dividuals of lower socioeconomic status. The 
lower intelligence and educational level of this 
Subsample may also make them less aware of 
Cues in their stumps. 
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CRITIQUE AND NOTES 


INFLUENCE OF THE STIMULUS OBJECT UPON THE COMPLEMENTARY 
AND SUPPLEMENTARY PROJECTION OF FEAR? 


SEYMOUR FESHBACH ? ann NORMA FESHBACH 3 


University of Pennsylvania 


2 studies were conducted bearing on the hypothesis that frightened boys 
attribute malice to male adults but attribute fear to other boys. In the ist 
experiment, a Halloween setting was used to elicit fear. The experimental 
group, in comparison to a matched control group, manifested a reliable in- 
crement in attribution of maliciousness to pictures of men and boys but no 
significant change in attribution of fear. Because of possible complications due 
to hostility arousal, a 2nd experiment was carried out in which judgments 
were elicited from boys who thought they were going to participate in a 
study involving frightening equipment. Under these conditions, the experi- 


mental Ss attributed reliably more maliciousness to men and reliably more 


fear to boys. 


Clinical observation as well as experimental 
studies (Feshbach & Singer, 1957; Murray, 1933; 
Singer & Feshbach, 1962) indicate that a fright- 
ened individual tends to distort his judgments 
of other people. The factors that determine the 
degree and type of distortion that will take place 
have yet to be specified. The present study pro- 
poses to investigate the influence of the stimulus 
person upon the projective process under condi- 
tions in which fear has been aroused and in which 
Social interaction between the perceiver and the 
stimulus is minimized, 

At least two different kinds of changes in social 
judgment have been observed in response to fear 
arousal. Murray (1933), in his pioneer study, 
found that frightened girls perceived pictures of 
adults, primarily male, as malicious and threaten- 
ing. Murray described this effect as "comple- 
mentary projection" to distinguish it from the 
more classic form of "supplementary projection" 
in which characteristics of the perceiver are 
directly attributed to the stimulus. In a study 
by Feshbach and Singer (1957), using college 
males who were subjected to electric shocks as 
the subjects and a film of an adult male taking 
some psychological tests as the stimulus, evidence 
of supplementary as well as complementary pro- 


1This study was supported by Research Grant 
M3213 from the National Institute of Mental Health, 
United States Public Health Service. The authors 
wish to thank Samuel Cameron and Louise Irwin 
for their assistance in carrying out these experiments, 

? Currently a Public Health Research Fellow at 
Stanford University. 

3 Currently a Visiting Lecturer at University of 
California, Berkeley, 
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jection was obtained; that is, under conditions 
of fear arousal, the subjects tended to attribute 
both fear and hostility to the stimulus person. 
Although the above cited studies are concerned 
with the problem of projection, the processes 
involved are probably quite distinct from that 
mediating psychoanalytic projection (Murstein, 
1959). In the case of the latter, the attributed 
trait is unacceptable and the function of the 
projection is the reduction of anxiety, In the 
case of the fear arousal studies, one appears to 
be dealing with a process which is more akin 
to stimulus generalization in that the perceiver 
generalizes from his own state to that of the 
person being judged. In terms of this process 
which has been previously labeled “infusion” 
(Feshbach & Singer, 1957), the degree of 
“projection” will be a function of the degree of 
similarity between the perceiver and the stimulus 
person. From this point of view, egocentricity of 
the perceiver or the extent to which he is pre- 
occupied with his own feelings should exaggerate 
the influence of his feelings upon his judgment. 
While these assumptions can account for the 
phenomenon of supplementary projection, they 
do not explain the attribution of maliciousness 
found by Murray (1933) under conditions of 
fear arousal. A comparison of the differences 
between the Murray and the Feshbach and Singer 
studies provides some suggestions as to the 
conditions which elicit a particular type of 
“projection.” There are a number of differences 
in procedure and subject population between 
these two experiments which may have been 
responsible for the differences in results including 
the very obvious fact that Murray had not ob- 
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tained ratings of the degree to which the stimuli 
appeared anxious or fearful. However, we suggest 
that even if Murray had obtained judgments of 
fearfulness, changes in the ratings of this 
dimension would have been less marked than 
the observed changes in the ratings of malicious- 
ness, because the children judged only adults. 
It is the principal contention of this paper that a 
critical variable determining the kind of distor- 
tion in judgment that is likely to take place under 
conditions of fear arousal is the relationship 
between the perceiver and the stimulus person. 

More specifically, a frightened child who is 
observing an adult, in the absence of information 
about the adult, is likely to attribute to him 
those feelings and attitudes which adults dis- 
played in previous situations in which he had 
been frightened. If the past experience of fear 
has been associated with a punitive adult, then, 
under subsequent conditions of fear arousal, the 
child will tend to attribute maliciousness to 
similar adults; that is, he will manifest comple- 
mentary projection. However, when a child judges 
another child, the psychological situation is 
markedly different. The similarity of the stimulus 
person to the perceiver is likely to foster the 
direct attribution of the affect of the perceiver 
to the stimulus. Also, the variable of past experi- 
ence, although less clear in its consequences than 
in the case of interaction with adults, would 
probably facilitate the direct attribution of the 
child’s feelings to the stimulus child; that is, 
when children judge others who are similar to 
them in age and sex, supplementary projection 
should be manifested. 

On the basis of these considerations, it is 
hypothesized that frightened boys tend to per- 
ceive other boys as fearful and tend to perceive 
adults, particularly males, as malicious. The pre- 
diction regarding adult males is based upon 
Murray’s previous findings and upon the assump- 
tion that, in general, boys in our culture are 
likely to be more frightened of male figures, 
Particularly their fathers, than female figures. 
This assumption receives some support from 
Kagan’s (1956) finding that 6-10 year old 
children view their mothers as friendlier and 
less punitive than their fathers. 


EXPERIMENT I 
Method 


The first experiment to be described which bears 
upon these hypotheses is essentially a replication 
of various features of Murray’s (1933) original study 
with some important modifications. Ten neighbor- 
hood boys, ranging in age from 9 to 12, were invited 
to a Halloween party at the home of the experi- 
menters. The parents of the children had previously 
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been informed of the purpose of this “party” and 
had consented to their children participating, being 
reassured by the fact that the experimenters’ son was 
also to be a subject in the study. 

When the boys arrived, they found themselves in 
a completely darkened house except for one room 
which was illuminated by a jack-o’-lantern. The 
eeriness of the atmosphere was further enhanced by 
having the children form a circle and relate ghost 
stories to each other. The second author sat in the 
center of the circle and initiated the stories. (Al- 
though the diameter of the circle was about 11 feet 
at the beginning of the story telling, by the time 
the last ghost story was completed, it had been 
spontaneously reduced to approximately 3 feet.) 
After the ghost stories, she introduced the boys to 
a game called “murder” in which they were required 
to hide alone and conceal themselves from the indi- 
vidual who, without their knowledge, was designated 
as the murderer. Immediately after he was caught, 
the child was taken to the “testing” room where 
he was presented a series of 16 pictorial stimuli 
consisting of an equal number of men, women, boys, 
and girls, arranged in a random sequence with the 
restriction that no two pictures within the same 
age-sex category should follow each other. Three 
such random sequences were employed. The pictures 
were first rated for fearfulness and then for ma- 
liciousness. Each rating scale consisted of seven 
alternatives which in the case of fearfulness ranged 
from “fearless” to “extremely scared” and in the 
case of maliciousness ranged from “extremely good 
and kind” to “extremely bad and cruel.” 

A control group of 10 boys was obtained from the 
same school which the experimental subjects at- 
tended. The controls and experimental subjects were 
matched on the basis of age, intelligence, school 
performance, religion, and race. Each control subject 
was seen individually and after rapport was estab- 
lished, the subject rated the standard set of stimuli. 
Both control and experimental subjects were subse- 
quently retested under relaxed conditions with the 
same stimuli after an interval of 4 weeks. 


Results 


Scores ranging from one to seven were assigned 
to each judgment of a photograph, a higher score 
reflecting a higher rating of maliciousness or 
fearfulness, according to the scale being used. 
Ratings of a particular characteristic for each 
set of four photographs within an age-sex 
grouping were separately summed for each child, 
For every subject, then, judgments of men, 
women, boys, and girls were determined for 
each of the two testing sessions. Change scores 
were obtained by subtracting the second set of 
ratings from the corresponding initial set so that 
for the experimental group, for example, a posi- 
tive score reflects a greater degree of attribution 
of the trait or affect in question under conditions 
of fear arousal than under relaxed conditions. 
Mein changes in attribution of maliciousness and 
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TABLE 1 
MEAN CHANGE SCORES IN ATTRIBUTION OF MALICIOUSNESS AND FEARFULNESS AS A Fi 'UNCTION OF 
EXPERIMENTAL TREATMENT AND SEX AND AGE OF THE STIMULUS 
(Experiment I) 
Attribution of maliciousness Attribution of fear 
Group 

Boys Men Girls Women Boys Men Girls Women 
Experimental T9 +2.6 se —1.0 :52:3. 12.6 T9 F13 
Control —2.7 =2:3 =2:3 —3.5 Spee) ne) T1. —4 

p <.01 <.02 <.10 <.10 


than 


fear by the experimental and control groups are 
reported in Table 1. With respect to attribution 
of fear, although the experimental group displays 
larger positive increments than does the control 
group, the differences are not significant. How- 
ever, there are marked, statistically significant 
differences in the attribution of maliciousness. 
The increments in perception of maliciousness are 
significantly greater for the experimental than 
for the control group in response to male stimuli, 
but are not significant for the judgments of girls 
and women. 

The data then provide evidence of comple- 
mentary projection only and fail to support the 
pediction of supplementary projection when 
judging similar age boys. In addition, the results 
are not consistent with a simple model of com- 
plementary projection since distortions of this 
type were evidenced in judgments of boys as 
well as adult males. The data can be accounted 
for within the proposed theoretical framework 
if one assumes that the experimental situation 
aroused hostile impulses as well as fear. The 
telling of ghost stories and the murder game 
may well have aroused aggressive fantasies, As 
an incidental but pertinent note, the boys were 
rather aggressive and unrestrained when dessert 
was served following completion of the experi- 
ment, (The experimenters, at least, interpreted as 
aggressive the use of the white dining room walls 
as targets for chocolate ice cream balls fired 
from improvised sling shots.) The attribution 
of malice to the male stimuli may then have been 
a function of generalization of hostile impulses 
as well as, in the case of the adult males, the 
redintegration of negative percepts associated 
with previous fear arousal situations. 


EXPERIMENT II 
Method 


Because of the ambiguous nature of the findings 
in this initial study, it was felt desirable to obtain 


* All p values presented in the tables are based on a two-tailed Mann-Whitney U test. 
40. 


Dash entry indicates p value greater 


similar data but under conditions in which fear 
arousal would be maximized and hostility arousal 
minimized. This second study was conducted in a 
YMCA setting. Twenty-seven volunteers from several 
boys’ clubs were assigned at random to an experi- 
mental or control condition. These boys ranged in 
age from 8 to 12. The subjects in the experimental 
or fear arousal condition entered a room in which 
in one corner was situated a black stool to which 
were attached a number of metallic extensions. 
Several wires led from the stool to an adjacent black 
box which had elaborate dials and a series of 
flickering colored lights. A long hypodermic needle 
placed on a white napkin near the box was also 
visible to the subjects. Midway in the experiment, 
a male assistant dressed in a white coat entered the 
room and conspicuously filled the hypodermic with 
a milky white fluid. The experimental subjects were 
initially told by the senior author that there were 
two phases to the experiment, the first consisting of 
judgments of a series of pictures and the second 
having “something to do with the equipment in the 
corner.” As in the first study, the subjects were 
given a series of stimuli to rate on scales of fear- 
fulness and then maliciousness, except that in this 
second experiment, the Photographs were projected 
onto a screen. The photographs used in this second 
study were the same as those initially employed 
except that a new, male adult photograph was sub- 
stituted for one used in the Halloween study. There 
was an additional variation in procedure: before 
either of the ratings was made, the subjects were 
asked to select an adjective from the following list— 
relaxed, mean, Pleased, scared, kind, unhappy—which 
best fitted the person in the picture. Thus, the photo- 
graphs were presented three times—once for the ad- 
jective description, and once for the ratings of 
fearfulness, and finally once more for the ratings of 
maliciousness, It is important to note that the judg- 
ments of the experimental group were elicited while 
the fear arousing stimuli were in clear view. The 
“second part” of the experiment involving these 
stimuli was, of course, never carried out and a party 
was substituted instead. The control group followed 
the same procedure as the experimental group except 
for the absence of the fear inducing objects that 
Were present in the experimental room. 
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TABLE 2 
A COMPARISON OF EXPERIMENTAL AND CONTROL MEAN RATINGS OF FEARFULNESS AND 
MALICIOUSNESS AS A FUNCTION OF SEX AND AGE OF THE STIMULUS 
(Experiment IT) 
Attribution of maliciousness Attribution of fear 
Group 
Boys Men Girls Women Boys Men Girls Women 
Experimental 19.0 17.6 13.2 14.1 14.8 12.1 10.8 15.1 
Control 18.0 13.4 12.4 13.5 11.6 10.1 9.1 12.0 
P? — <.01 — — .053 = = <.10 
Results that the only substantial differences occurred in 


The ratings obtained in the YMCA study were 
summed and analyzed according to the age and 
sex of the stimulus person. The pertinent data 
are presented in Table 2. These results are much 
more consistent with the experimental hypotheses 
than was the case for the Halloween data. As 
Table 2 indicates, the experimental subjects at- 
tributed significantly more maliciousness to the 
adult male photographs than did the control sub- 
jects. The attribution of maliciousness to adult 
males is quite specific since in no other instance 
did the differences in ratings of maliciousness 
approach statistical significance. One notes that 
under control conditions, a markedly greater 
degree of malice was attributed to boys than 
to any of the other stimuli. While it is possible 
that the absence of an experimental effect in the 
judgment of boys is due to these high ratings, 
the fact that the Halloween subjects who also 
attributed greater maliciousness to boys under 
control conditions manifested a reliable increment 
under experimental conditions argues against this 
possibility. 

When the fearfulness ratings are examined, a 
quite different pattern emerges, The experimental 
group perceives boys as more frightened, the 
b value of the difference being .053. In addition, 
the experimental subjects tend to attribute more 
fear to women than do the controls, the differ- 
ence being significant at the .10 level. 

Inspection of the adjective choice data revealed 


TABLE 3 


A COMPARISON OF MEAN FREQUENCIES OF 
ATTRIBUTION OF “SCARED” AND “UNHAPPY” 
DESCRIPTIONS TO STIMULI OF VARYING 
AGE AND SEX BY THE EXPERIMENTAL 
AND CONTROL GROUPS 


(Experiment II) 
Group Boys Men Girls Women 
Experimental 24 12 12 24 
ontrol 1.4 1.1 1.5 1.6 
$ <.05 = v E 


the attribution of the adjectives "scared" and 
"unhappy" to boys. The experimental subjects 
used these adjectives significantly more fre- 
quently than did the controls, a finding which 
supports the difference obtained in the direct 
ratings of fearfulness of boys. The mean fre- 
quency with which "scared" and "unhappy" were 
employed for the various age and sex groups is 
presented in Table 3. Although the difference is 
unreliable, one notes that the experimental sub- 
jects tend to attribute more of these same 
adjectives to women than do the controls, a trend 
consistent with the rating results. 


Discussion 


The data of the YMCA study conform rather 
well to the experimental predictions. Fear arousal 
is seen to result in qualitatively different dis- 
tortions depending upon characteristics of the 
stimulus being judged. Complementary projec- 
tion is observed in the judgments of adult males; 
that is, the frightened boys tended to perceive 
men as malicious. Supplementary projection is 
observed in the judgments of the young males 
similar in age to the perceiver; that is, the 
frightened boys tended to perceive other boys as 
similarly frightened and unhappy. The greater 
attribution of fear to boys was reflected by both 
the adjective choice measure and the ratings of 
fearfulness. Significant differences in the attribu- 
tion of malice to men were obtained for ratings 
of maliciousness but not for adjective choice. 
The lack of sensitivity of the adjective choice 
measure in the latter instance may have been 
due to the fact that only one adjective—‘mean” 
—out of the six provided could be used as an 
index of complementary projection while two 
adjectives—“scared” and "unhappy"—were avail- 
able as indices of supplementary projection, thus 
providing a more reliable datum. 

Although an attempt was made in the second 
experiment to maximize fear arousal and reduce 
hostility arousal, the conditions for arousing fear 
are, so similar to the conditions under which 
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hostility is elicited that one cannot be certain 
that the boys, in addition to being frightened, 
were not also aggressive. However, the possibility 
that the complementary projection observed was 
actually a case of supplementary projection— 
namely, the boys directly attributing their ag- 
gressive impulses to the adult male figures—seems 
unlikely for the following reason. If supple- 
mentary projection of aggressive impulses was 
the essential process, then one might have 
expected the attribution of maliciousness to boys 
as well as to men. In contrast to the results 
from the Halloween study, this effect did not take 
place, The results of the YMCA study are more 
readily accounted for by the predicted interaction 
of fear arousal with the type of stimulus person 
being judged. 

The interpretation of this. interaction which 
is offered here stresses past rather than current 
sources of threat. The latter may also be a 
relevant variable and, under some circumstances, 
could override the influence of past experience. 
A child recently frightened by a very hostile 
female may tend to generalize and attribute 
malice to other females rather than to males, 
One could argue that the perception of male 
adults as malicious by the experimental subjects 
in the YMCA experiment was due to the fact 
that while a female associate was present during 
part of the experimental session, a male acted 
as the primary experimenter. The force of this 
argument is weakened when one considers that 
the primary experimenter in the Halloween 
experiment was female and yet there was an 
increase in attribution of maliciousness to males 
and not to females, 

Still another interpretation of the data could 
be offered in terms of Heider’s. (1958) balance 
theory. From this point of view, the primary 
mechanism would be the need to maintain 
cognitive consistency, The child’s perception of 
other children and adults balances as it were his 
perception of his own emotional state. Since 
what constitutes an appropriate match for the 
child's affective state may well depend upon past 
experience, the implications drawn from balance 


*' The issue could be readily resolved by repeating 
the YMCA study, using a female as the primary 
experimenter, However, it is the opinion of the 
present authors that the procedure used in the YMCA 
study may be too disturbing for some children. The 
authors feel rather fortunate in that they were readily 
able to reduce the anxiety of two boys who were 
quite upset by the experimental situation. In investi- 
gating future problems in this area, one could take 
advantage of natural fear producing situations such 
as children awaiting a polio shot or 5ome form of 
surgery. 
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theory could be quite similar to those based upon 
the redintegration hypothesis. 

The tendency of the frightened boys to at- 
tribute a greater degree of fear to women, while 
never reaching the .05 level, seems sufficiently 
consistent to warrant some comment. It may be 
that women, particularly mothers, are more likely 
than men to have occasion in which they share 
dysphoric feelings of their children; that is, in 
the past, when a child has been frightened and 
upset he has perceived similar affects in his 
mother. If the present analysis is correct, it 
should be possible by obtaining prior information 
as to the perception by the child of his parents— 
whether he perceives his mother as more sym- 
pathetic than his father, whether he views his 
father as a more severe punitive agent than his 
mother—to predict more precisely the specific 
kind of distortion that a particular child will 
manifest when he is frightened. 

In conclusion, perhaps the most significant 
implication that one can draw from this study 
is that the influence of fear upon the cognitive 
functions involved in judging the motives and 
feelings of other people is Specific and ordered, 
rather than pervasive and erratic. Under condi- 
tions of fear arousal the perception of only 
certain classes of people reflect the influence of 
complementary and supplementary projection. 
The subjects' perception of girls, for instance, 
in both the Halloween and YMCA studies was 
unaffected by the experimental conditions. These 
results point up the importance of the role of 
the stimulus in determining the influence of sub- 
jective factors upon social judgments. The sub- 
ject’s judgment represents the outcome of the 
interaction of so-called subjective and objective 
variables and both must be considered in order 
to adequately predict his behavior. 
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CULTURAL INFLUENCE ON CONSCIENCE DEVELOPMENT: 


RESISTANCE TO TEMPTATION AND GUILT AMONG SAMOANS 
AND AMERICAN CAUCASIANS * 


ROBERT E. GRINDER? AND ROBERT E. McMICHAEL 


University of Wisconsin Michigan State University 
In this study (a) the socialization effects of a shame and a guilt culture on 
conscience development are compared with Samoan and American Caucasian 
children and (b) the relationships between resistance to temptation and guilt 
are investigated. Inferences about conscience strength were made from behavior 
in a real-life temptation situation and from responses to projective story comple- 
tion items depicting transgressions. The results show that the Samoan Ss were 
significantly less likely than the mainland Ss to resist temptation and to show 
susceptibility toward remorse, confession, and restitution after transgression. 
The projective measures of guilt are shown to be highly interrelated but 


inconsistently related to temptation behavior. 


The methods of social control relied upon by 
societies often have been classified as either 
“shame” or “guilt” (Benedict, 1946; Mead, 
1955; Piers & Singer, 1953), and in essence the 
present study is an exploratory attempt to as- 
sess the relative effects of a shame and a guilt 
culture on conscience development, Shame cul- 
tures generally stress external agents who in 
various ways coerce, threaten, or otherwise arouse 
objective fear. On the other hand, guilt cultures 
emphasize both self-control in the face of temp- 
tation and self-initiated responsibility for one’s 
actions if transgression should occur. If con- 
Science be defined in terms of internalized con- 
trols, individuals in general would be expected 
to develop weak consciences in a shame oriented 
culture, and conversely, strong consciences in a 
guilt oriented culture. The investigation was 
conducted in Hawaii where Samoan and Amer- 
ican Caucasian? children could be compared 
readily, Inferences about conscience strength 
Were made from behavior in a real-life tempta- 
tion situation (Grinder, 1961) and from re- 
Sponses to projective story completion items 
depicting transgressions (Allinsmith, 1960). 

On the basis of anthropological observations, 
Mead (1955) has attempted to show that the 


7A portion of this paper was presented at the 
1962 annual meetings of the American Psychological 
Association, 

This study was carried out while the senior 
author was a member of the Staff of the University 
A Hawaii and the junior author was affliated with 
Ene Division of Mental Retardation, Honolulu, 


j The term “American Caucasian,” as used here, 
GR to Caucasian children from the mainland 
nited States, 


Samoans depend upon external sanctions entirely, 
and that in Samoa, consequently, "there is no 
room for guilt" (p. 655).* Presumably, among 
other factors, the Samoan child would be ex- 
pected to develop a weak conscience because of 
his parents’ practices of living in an extended 
family, delegating their disciplinary roles to the 
child’s older siblings, and stressing age-graded 
socialization sanctions that rely upon fear of 
external reprobation as a major mechanism of 
social control (Pierce, 1956). In contrast, a 
mainland child ought to develop a stronger con- 
science by the greater severity of his parents’ 
socialization practices (Whiting & Child, 1953) 
and by their practices of maintaining exclusive 
family living arrangements, demanding that he 
account for his transgressions to them personally, 
and expecting adherence to adult standards of 
conduct as the primary goals of socialization 
(Whiting, 1959). Accordingly, in comparisons 
between Samoan and American Caucasian chil- 
dren on degree of conscience development, the 
Samoan families’ stress of external sanctions 
(shame) and the American families’ emphasis 
upon internalized sanctions (guilt) lead to the 
expectation that a great proportion of Samoan 
children should demonstrate weak consciences. 
The present study also aims to test a social 
learning hypothesis concerning resistance to 
temptation and guilt after transgression. One 
possible interpretation of the relationship be- 


4 Although in describing cultures the term “guilt” 
denotes aspects of conscience including both resist- 
ance to temptation before transgression and the 
capacity to experience guilt after transgression, in 
describing an individual's conscience in the present 
study, the term will refer only to behavior after 


transgression. 
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tween the two attributes of conscience is that 
they may be independent. Children whose parents 
concentrate all their attention upon breaking up 
approach behavior should learn resistance to 
temptation but not guilt, and children whose 
parents ignore approach behavior but punish 
transgressions by demanding restorative behaviors 
should learn greater capacity for guilt than self- 
control, The practical exigencies of socialization 
suggest, however, that the two measures should 
be positively related. The indeterminateness 
associated with the contingencies of reinforce- 
ment in child rearing probably prevents the child 
from learning one of the behavior patterns to 
the exclusion of the other. Further, once guilt 
is learned, anxiety anticipatory of guilt should 
serve as a cue for resistance to temptation. In 
a variety of studies (Allinsmith, 1960; Grinder, 
1962; MacKinnon, 1938; Rebelsky, 1961), con- 
ducted with relatively homogeneous samples, 
positive relationships between resistance to temp- 
tation and guilt have been reported. The pre- 
diction follows that these results should be cor- 
roborated in the cross-cultural context of this 
investigation, 
METHOD 


Subjects 


Subjects were 34 sixth- and seventh-grade pupils, 
15 American Caucasians (4 boys, 11 girls) and 19 
Samoans (12 boys, 7 girls), enrolled in a rural, 
public school on the northeast coast of Oahu, 
Hawaii. The community in which they lived was 
comprised of approximately 300 Samoans, 100 Cau- 
casians, and 600 Hawaiians, Japanese, Chinese, etc. 

Employed chiefly by local sugar and pineapple 
plantations, the Samoan fathers were either unskilled 
or semiskilled laborers (Pierce, 1956). Most of the 
Samoan subjects were born in American Samoa; 
all had been in Hawaii since 1952; after arriving, 
few had traveled outside the community. Although 
the Samoan subjects could speak English, they 
restricted their use of it to classroom recitation. 
The number of siblings in each of their families 
ranged from 3 to 15; the median was approxi- 
mately 8, 

The Caucasian fathers were employed in various 
supervisory and administrative positions by the 
plantations or by a Protestant church, which main- 
tained regional offices and a small college in the 
community. All of the Caucasian subjects were 
born on the United States mainland. Because most 
of the families were temporary residents, few had 
been in Hawaii for longer than a few years. The 
number of children in each of the Caucasian families 
aoe from two to seven siblings; the median was 

ree. 


Measures 


The apparatus for the assessment of resistance to 
temptation has been described in detail elsewhere 
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(Grinder, 1961). Subjects operated a “ray-gun” 
shooting gallery individually in a realistic game 
situation. Seated 7 feet from a target box, subjects 
were asked to shoot the ray-gun pistol 20 times 
at a rotating rocket. With each pull of the ray-gun 
trigger, prearranged scores from zero to five were 
registered by score lights, also housed in the target 
box. High scores were rewarded with either a 
marksman, sharpshooter, or expert badge (cf, 
Procedure below). Subjects cumulated their scores 
on a paper score sheet. 

Subjects were judged to have resisted temptation 
if the scores recorded on their score sheets indicated 
that they had not earned a badge. They were 
judged to have yielded to temptation if their score 
sheet showed that they had falsified their scores in 
order to earn one of the badges. 

Measures of guilt were obtained from five story 
completion items patterned after those developed 
by Allinsmith (1960) and Allinsmith and Grinder 
(Rebelsky, 1961). In each story the protagonist 
violated socially expected behavior patterns of either 
honesty, trustworthiness, or self-control. The trans- 
gressions were directed against peers or adults. Ap- 
propriateness of the stimuli, vocabulary, and im- 
plied age of the protagonist were controlled, and 
the themes were drawn from experiences that boys 
and girls of both ethnic groups had in common, 
Each story explicitly ruled out the danger of detec- 
tion. By using either a boy’s or girl’s name and 
changing personal pronouns, each story was adapted 
for both sexes, 

Allinsmith’s (1960) analyses, which employed free 
response items, revealed that subjects were very 
likely to depict remorse, confession, and restitution 
in their story responses; therefore, it was decided 
that the evaluations of guilt to be made in this 
study should be based upon these dimensions, In 
order to insure that every subject would be com- 
pared in terms of each dimension, subjects were 
given multiple-choice items rather than free response 
items. Three of the five stories were provided with 
three multiple-choice items, which assessed remorse, 
confession, and restitution, respectively. In the two 
other stories the transgressions were irreparable, 
ie., the protagonist either violated a verbal prohibi- 
tion or betrayed a trust; thus, on these, assessment 
of restitution was precluded and only two items 
were provided. Each multiple-choice item contained 
three alternatives, which were weighted either one, 
two, or three points in terms of the degree to which 
the alternative represented a guilt response. A sub- 
ject’s susceptibility to guilt was inferred from the 
alternatives he chose. His responses of remorse, 
confession, and restitution were summed across 
stories separately to obtain three measures of his 
disposition toward guilt. On the five stories then, 
Scores ranged from 5 (low) to 15 (high) for re- 
morse and confession and from 3 (low) to 9 (high) 
for restitution. 


Procedure 


The behavioral test of temptation was admin- 
istered in accordance with procedures developed 
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previously (Grinder, 1962). Only Samoans and Cau- 
casians played the game; subjects were randomly 
drawn from class by sex and ethnic background. 
About 10 subjects at a time were shown how to 
play the game by the first experimenter (Ei). They 
were instructed to play the game for 20 shots and 
were told that if they obtained a total of 35 points 
they would be awarded a marksman badge; 40 
points, a sharpshooter badge; and 45 points or 
more, an expert badge. However, only 32 points 
appeared on the score lights during the 20-shot 
sequence. To win a badge, therefore, it was necessary 
to falsify scores. After the group had been returned 
to the classroom, each subject was called from the 
class individually, led to the experimental room, 
and at the door was told that E; was busy working 
on some things down the hall. The subject was 
asked to play the game alone and to bring his 
score sheet to E; when he finished. 

Two weeks after the presentation of the shooting 
gallery, a second experimenter (E:) administered 
the story completion items to the subjects in their 
classrooms. The subjects were unaware that the 
game and stories were related. The subjects were 
asked by E; to pretend that they were the boy or 
girl in each of the stories, and were shown on the 
chalkboard exactly how to choose between the 
multiple-choice alternatives. Stories were read out 
loud while the subjects read them silently. After 
reading a story, Es read each of the multiple-choice 
items for that story, pausing between them so that 
Subjects could respond. In reading the stories, Ez 
read alternately a boy's or girl's name. Stories and 
alternatives were read only once. 


RESULTS 


, Examination of the sex distribution in rela- 
tion to resistance to temptation on the behavioral 
measure and to responses of guilt on the projec- 
live stories revealed no differences approaching 
Statistical significance; consequently, the male 
and female data were treated collectively in the 
analyses that follow. 

Table 1 lists the obtained intercorrelations be- 
tween remorse, confession, and restitution and, 
as can be seen, substantial relationships were 
found, Although the correlation coefficients are 
what would be expected if the guilt scales were 
Measuring a simple response set, the findings also 


TABLE 1 


INTERCORRELATIONS BETWEEN THE 
THREE GUILT SCALES 


Confession Restitution 
Remorse qas 69** 
Confession S E 
for ES — Correlation coefficients are Spearman rho's, corrected 


** 
? <.001, two-tailed. 
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TABLE 2 


Mean GUILT SCORES FOR THE SUBJECTS GROUPED 
BY ETHNIC BACKGROUND AND DEGREE 
OF RESISTANCE TO TEMPTATION 


Guilt scale 
Condition Grouping* 
Remorse| Confession] Restitution 
Ethnic back- | Caucasian 
ground (15) 13.67 | 13.53 8.53 
amoan 
(19) 11.58* | 11.21* 6.79* 
Temptation | Resist 
test (7) 1443 | 1343 8.29 
Yield 
(27) 12.00* | 11.93 7.37 


Note.—Ns in parentheses. 

* Differences between the Caucasian and Samoan groups and 
between the resist and yield groups were tested with the Mann- 
Whitney U test, corrected for ties, 

*p <05. 


are consistent with the interpretation that re- 
sponses of guilt may be highly interrelated, even 
though they may be manifested in several dis- 
tinct ways. The data do not justify treating guilt 
as a general factor, however, inasmuch as 
analyses with a summary guilt scale might mask 
high relationships with one or two of the meas- 
ures and a complete lack of relationship with the 
other. Therefore, in subsequent analyses the 
three guilt measures were treated separately. 
The expectation that a higher proportion of 
Samoan than American Caucasian children would 
manifest relatively weak consciences is strongly 
supported by the findings. Among the Samoans 
none of the 19 subjects resisted temptation, but 
among the Caucasians 7 of the 15 subjects 
resisted temptation. Application of Fisher’s exact 
test for a 2X 2 contingency table to these 
temptation behavior data resulted in p<.01. The 
raw score means of remorse, confession, and 
restitution are listed for both ethnic groups in 
the top half of Table 2. Because the latter three 
distributions were extremely skewed and the 
groups were small, analyses were conducted using 
a two-tailed Mann-Whitney U test, corrected for 
ties. Although on each of the guilt scales the 
magnitude of difference between the means is 
slight, the U test analyses revealed that the 
Samoan group was significantly less likely than 
the mainland group to depict remorse, p < .05; 
confession, p< .05; and restitution, p < .05.5 


5 Mann-Whitney analyses between the ethnic 
groups separated by sex revealed that both sexes 
contributed approximately equally to the obtained 
differences, e.g., on the three scales, remorse, confes- 
sion, and restitution, respectively, two-tailed p values 
were <.15, <.20, and <.05 for boys, and < .10, 
< 15, and < .05 for girls. 
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As expected, the proportion of Samoan children 
with strong consciences was significantly less 
on both the real-life and projective measures. 
However, the magnitude of the mean scores ob- 
tained by the Samoans indicates that they have 
moderately strong dispositions toward guilt and 
suggests that they have, albeit rudimentarily, 
internalized certain aspects of conscience. 

The predicted relationship between resistance 
to temptation and guilt after transgression is 
partially confirmed, As shown in Table 2, bottom 
half, Mann-Whitney analyses comparing the 
guilt responses of subjects who resisted tempta- 
tion with those who yielded to temptation show 
that the resist group differed significantly from 
the yield group only on the remorse measure. The 
differences between confession and restitution 
are, however, in the predicted direction, 


Discussion 


The absence of sex differences on both the 
temptation and guilt measures is noteworthy 
inasmuch as other studies (e.g., Burton, Mac- 
coby, & Allinsmith, 1961; Grinder, 1962; Rebel- 
sky, 1961), employing similar type measures, 
have reported differences between the sexes, The 
previous studies were conducted with consider- 
ably larger Ns, however, and in the present 
study, on the basis of the relatively small 
sample size, Type II errors were more probable. 

The results support the major expectation 
that in the comparisons between the Samoan and 
the mainland children Proportionately fewer of 
the Samoan children would demonstrate strong 
conscience development. The confirmation is ap- 
parent whether the behavioral measure of resist- 
ance to temptation or the projective stories are 
used in making estimates of conscience strength. 

The tentative findings suggest the desirability 
of further research with representative samples 
to determine the usefulness of classifying so- 
Cieties into either shame or guilt cultures. In 
the present study the Samoan subjects’ responses 
of guilt show that they are not as free from the 
effects of conscience as Mead (1955) has im- 
plied. The data presented here corroborate 
Ausubel’s (1955) belief that the Samoan social- 
ization sanctions should engender some suscepti- 
bility to guilt. Furthermore, they agree with 
those of Whiting and Child (1953), who rank 
the Samoans just below the median among 35 
Societies on a cross-cultural index of guilt. 

The postulated social learning hypothesis pre- 
dicts that being susceptible to guilt should serve 
as a cue for resistance to temptation, yet of 
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the three guilt scales, only remorse shows a sig- 
nificant relationship with resistance to tempta- 
tion. Since the scales are highly interrelated, 
the discrepancies in the relationships between 
the three scales and temptation behavior appear 
as a surprise. It may be that the remorse scale 
tapped an aspect of conscience which is of a 
different nature than either confession or restitu- 
tion. Both Allinsmith (1960) and MacKinnon 
(1938) observed that individuals who acknowl- 
edge the pain of self-reproach after transgression 
are more likely to resist temptation. Perhaps the 
affect associated with remorse serves as a cue 
for the instigation of other behaviors indicative 
of conscience strength. Under certain circum- 
stances, of course, the affect may be avoided if 
a short latency exists between the environmental 
cues and either resistance to temptation on the 
one hand or reparative behaviors on the other. 
Also, extensive affect might be aroused if activa- 
tion of guilt relieving responses appears impos- 
sible. But for many individuals, anticipation of 
remorse may be a cue for resistance to tempta- 
tion, and the presence of remorse after trans- 
gresion may motivate efforts to make amends. 
Thus, both resistance to temptation and certain 
guilt behaviors may be associated with remorse} 
however, if neither resistance to temptation nor 
these guilt behaviors are very reliable response 
patterns, there is no reason to suppose that 
strong, positive relationships should emerge be- 
tween them. 
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ROLE OF DREAD IN SUICIDAL BEHAVIOR 


DONALD E. SPIEGEL 
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The proposition that inhibition of the experience of dread ordinarily evoked 
by suicidal intention is a necessary condition for suicidal action was evaluated 
by comparison of genuine and faked suicide notes. 3 judges, unaware that 
some notes were simulated, independently rated 33 matched pairs of genuine 
and faked notes in terms of 5 variables. On the basis of a combined x^ analysis, 
confirmation was achieved for 4 of 5 hypotheses (p < .004) derived from a 
theory of suicide. Genuine notes were less explicit in expressing suicidal 
intention, contained less suicide synonyms, had more instructions to the 
reader, and were more disorganized than faked notes. The findings suggest 
that just prior to suicidal action, an individual’s attention may be diverted 
from the imminent self-destructive act or its potentially dread-inducing 


negative ramifications. 


. Research interest in suicidal behavior has 
increased markedly in the past few years. For 
the most part, research effort has concentrated 
on psychological test findings and presuicidal 
behavior clues in terms of their correlations 
with future suicidal activity. Only a few in- 
Vestigations such as those reported by Shneid- 
man and Farberow (1957a, 1957b, 1959) reveal 
much concern with the affective states of in- 
dividuals prior to suicide, and these have been 
mostly descriptive accounts. 

The present investigation derived from a 
theory of suicide by the senior author which 
May be briefly summarized as follows. An 
individual does not necessarily commit suicide 
When the urge to die becomes stronger than the 
Urge to live, for everyone who has an over- 
Tiding urge to die is able to take his own life. 
Most individuals find themselves overcome by 
dread as they approach suicidal action and are 


unable to carry out their wish to die. This 
intense experience of dread which arises in as- 
sociation with imminent personal destruction 
serves as a natural brake against impulsive 
suicidal action. In order for suicidal action to 
become possible, some process must occur in 
the individual which reduces this experience of 
dread. This diminution of dread may be, in part, 
a function of psychological defenses which 
prevent the individual from a full realization of 
the negative ramifications of the suicidal act, and 
in part, a function of the natural decay of 
intense emotional experience. 

In this study an attempt was made to deter- 
mine whether prior to genuine suicidal action 
some process occurs in an individual which re- 
sults in a reduction in the intensity of the 
experience of dread associated with the im- 
minent suicidal action or its anticipated ramifi- 
cations, to explore and assess the nature of such 
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a process, and to discuss the feasibility of 
employing therapeutic procedures to counteract 
such a process as a suicide prevention measure. 

It was anticipated that analysis of suicide 
notes might reveal the operation of various 
defenses which would tend to minimize the 
experience of dread. In general, the notes were 
expected to reflect a process of self-deception 
with regard to the imminent suicidal event or a 
decathexis of the intended suicidal action, It 
was expected that this might be manifested in 
a tendency to avoid discussing or alluding to 
the suicidal intention or act, in a tendency for 
the writer to concentrate attention upon events 
other than suicide, or in a relatively bland or 
unemotional suicidal communication, An ex- 
tremely disorganized note might reflect a psy- 
chotic process which serves to protect the in- 
dividual from realizing the full implications of 
his imminent action, thus rendering dread un- 
necessary. 

From these theoretical considerations, a set 
of testable hypotheses were developed concern- 
ing differences in the content of genuine suicide 
notes and notes written by matched individuals 
asked to simulate suicidal intention. As com- 
pared to the simulated notes, the genuine notes 
would (a) be less explicit in stating suicide as 
an intention, (b) contain fewer words or terms 
Synonymous with suicide, (c) contain more in- 
structions with regard to such things as dis- 
posal of property or the assuming of responsi- 
bilities by others, (d) have a less dramatic 
impact upon the reader, and (e) contain more 
instances of extreme disorganization. 


METHOD 


A series of 33 matched pairs of genuine and 
faked suicide notes had been developed by Shneid- 
man and Farberow (1957b). The real notes were 
collected from the files of the Los Angeles County 
Coroner’s office, whereas the faked notes were 
gathered from normal individuals who were asked 
to write the suicide note that they would write if 
they were going to commit suicide. The notes 
Were matched for age and Occupation of the note 
writer, All the note writers were male, Caucasian, 
Protestant, married, native-born, and between the 
ages of 25 and 59. In addition, the normal subjects 
had been rigidly screened for the presence of psy- 
chopathology. 

The three independent judges! used in the main 
experiment were first instructed to rate each of 
10 practice “suicide notes” in terms of five variables, 


+The authors wish to thank Charles W. Acker, 
Linda c. Miller, and Luann Campeau for contribut- 
ing their time as judges for this study. 


A five-point rating scale was used for Variable 1 
(relative explicitness of expressed suicidal intention) " 
Variable 4 (dramatic quality of the note), and 
Variable 5 (disorganization of thought processes). 
Dichotomous judgments were required for Variable 2 
(presence of the word "suicide" or suicide syn- 
onyms), for Variable 3 (presence of instructions to 
the reader with respect to the disposal of property 
or the assuming of responsibilities). Rating criteria 
were explicitly stated for each variable and scale 
points defined for variables using the five-point 
scale. 

After satisfactory completion of the training 
session, which involved discussions between experi- 
menters and raters regarding the practice ratings, the 
experimental set of 66 randomly ordered coded 
notes were given to the judges independently along 
with a set of written instructions and a rating 
sheet. Judges were requested to rate each of the 
66 "suicide notes" in terms of the same five variables. 
They were unaware that half the notes were 
simulated. 

After the ratings were completed, the results 
were tabulated and evaluated by the chi square 
method for Hypotheses b and c. The Mann-Whitney 
U method was used to evaluate Hypotheses a, d, 
and e, since the data were in ordinal form. The 
combined probabilities from the three judges were 
then evaluated by the chi square method suggested 
by Gordon, Loveland, and Cureton (1952). This 
is a method by which n independent rating samples 
can be evaluated to determine the presence of a 
statistical trend. 


RESULTS 


The results of this study confirmed four of the 
five derived hypotheses in a Statistically signifi- 
cant manner. Less explicitness, fewer mentions of 
suicide or its synonyms, greater number of 
instructions to the reader, and greater disorgani- 
zation were found for the genuine notes, No 
difference was observed in the dramatic quality of 
the genuine and faked notes. 

An examination of the 330 ratings by the three 
judges revealed that Judge 1 agreed with Judges 
2 and 3, 79% and 78% of the time, respectively, 
and Judge 2 agreed with Judge 3, 71% of the 
time. 

A summary of the tabulations of the ratings 
made by the three judges on the 33 matched pairs 
of genuine and simulated suicide notes for the 
five test variables is found in Table 1. The results 
for all judges were similar in terms of the direc- 
tion of the ratings and were in the predicted 
direction except for Variable 4 (mean amount of 
drama in the notes), The genuine notes were 
judged to be less explicit in terms of intention 
to die than the faked notes. Suicide or its syno- 
nyms were mentioned more often in the faked 
notes than in the genuine notes. Instructions to 
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TABLE 1 
Mean RATINGS FOR THE FIVE VARIABLES MADE BY THE THREE JUDGES ON THE 33 MATCHED 
PARS or GENUINE AND SIMULATED SUICIDE NOTES 
Judge 1 Judge 2 Judge 3 
Variable 
Genuine | Simulated | Genuine | Simulated | Genuine | Simulated 
notes notes notes notes notes notes 
1, Mean amount of explicitness of expressed 2.27 3.12 2.03 3.42 1.45 3.00 
suicidal intent 
2. Number of notes in which suicide or a syn- 4 16 6 13 8 15 
onym is mentioned 
3. Number of notes mentioning instructions 21 11 21 11 15 11 
to the reader 
4, Mean amount of drama in the notes 2.90 2.69 2.84 2.51 3.18 2.93 
5. Mean amount of disorganization in the notes 3.27 2.75 2.54 1.78 2.96 2.57 
Ota for Variables 1, 4, and 5 are mean ratings along a five-point scale ranging from small to large amounts of the 
variable, 


TABLE 2 


RESULTS OF THE STATISTICAL ANALYSES OF THE RATINGS FOR THE Five VARIABLES MADE BY THE THREE JUDGES 
ON THE 33 MATCHED Pars OF GENUINE AND SIMULATED SUICIDE NOTES 


Judge 1 Judge 2 Judge 3 
Variable Statistical method 
Statistical Statistical Statistical 
value ? value ? value ? 

1. Explicitness of expressed 

suicidal intention Mann-Whitney U test 332.5 |.0024| 294.5 |.0005| 263.5 | .0001 
2. Frequency of suicide syn- 

onyms Chi square test 8.680 |.0016 | 7.922 | .0025| 2.402 | .0618 
3. Frequency of instructions | Chi square test 4.913 |.0136| 4.913 | .0136 .571 | 2266 
4, Dramatic quality Mann-Whitney U test 485.5 | .2177| 525.5 | 4013) 465.5 | 1469 
5. Disorganization Mann-Whitney U test 399.0 | 0262] 373.5 | .0013| 441.5 | .0853 


the reader were found more often in the genuine 
notes than in the faked notes. The judges rated 
the genuine notes as being more disorganized and 
dramatic than the faked notes. 

These results were then evaluated by the chi 
square and Mann-Whitney U methods in order 


TABLE 3 


Cut SQUARES AND COMBINED PROBABILITIES Associ- 
ATED WITH THE RATINGS MADE BY THE THREE 
JUDGES on THE 33 Pares or MATCHED GENUINE 
AND SIMULATED NOTES FOR THE FIVE VARIABLES 


Variable x 
1. Explicitness of expressed 
suicidal intention 40.0602*** 
2. Suicide synonyms 31.8385*** 
3. Frequency of instructions 20.3456** 
4. Dramatic quality 8.7316* 
5. Disorganization 26.0450*** 
A a, =6. 
= 192, 
ae es 


to test the statistical significance of the differ- 
ences between genuine and simulated notes shown 
in Table 1. The results of the analyses are to 
be found in Table 2. The derived hypotheses were 
confirmed by Judges 1 and 2 for four of the 
five test variables at p< .03. For Judge 3, 
Variable 1 was significant at p < .0001, Variables 
2 and 5 at p <.10; however, the distribution 
for Variable 3 might easily have occurred by 
chance. No statistically significant differences 
were found by any of the judges for Variable 4 
(dramatic quality). 

A summary of the overall probabilities associ- 
ated with the ratings made by the three judges 
is presented in Table 3.? 

It would appear that when the ratings made 
by the three judges were combined statistically, 
three of the previously confirmed hypotheses 


2 With this statistical method (Gordon, Loveland, 
& Cureton, 1952), the chi square test is used to 
determine the probability that the p levels achieved 
by all three independent judges for each variable 
might have occurred by chance. 
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(Variables 1, 2, and 5) were further substanti- 
ated at < .001, and a fourth hypothesis (Vari- 
able 3) at p < .004. Only the hypothesis concern- 
ing the dramatic quality of the notes (Variable 4) 
failed to achieve confirmation. 


DISCUSSION 


Although the theoretical propositions advanced 
appear to have received general support from the 
results of this study, it is recognized that the 
rating of genuine and simulated suicide notes 
can hardly be considered crucial in establishing 
or rejecting the validity of any theory of suicide. 

It should be understood that since only a small 
proportion of the individuals who commit suicide 
leave notes (Shneidman & Farberow, 1957b), 
it would be inappropriate to generalize interpreta- 
tions of the present findings to the thought proc- 
esses or feeling states of suicidal individuals who 
do not leave notes. 

Insofar as the present study is concerned, it 
would seem that the most convincing argument in 
behalf of the notion that some process inhibiting 
the experience of dread occurs in the individual 
prior to suicide comes from the confirmation of 
Hypotheses a and b. With regard to the “suicide 
synonyms" variable, some discussion of the 
results of Judge 3 are in order. An actual count 
of the number of times "suicide" is mentioned 
reveals that this word does not appear at all 
in the genuine notes. Even explicit allusions 
to suicide, such as “electrocuting myself," “take 
my life,” “end my life,” “end my suffering fore- 
ever,” or “end it all,” appear only in the simu- 
lated notes. The failure of Judge 3 to achieve 
significance appears to have resulted from a 
different interpretation of the instructions con- 
cerning suicide synonyms than used by Judges 1 
and 2. Nevertheless, a p level of -06, combined 
with p levels of .01 for the other judges, yields 
a significant combined probability of < .001. 

Even though no support was found for the 
hypothesis that a difference in dramatic impact 
would occur between genuine and simulated 
notes, it should be mentioned here that the 
majority of both the genuine and simulated 
notes were judged to be no more than moderately 
dramatic by all judges, 

The proposed rationale for the more frequent 
concrete instructions to the reader and for the 
greater disorganization present in the notes 
seems reasonable, It is interesting that Osgood 
and Walker (1959), using different Measures of 
disorganization, found no significant differences 
for this variable in comparing the two sets 
of genuine and simulated notes used in this 
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study, nor in comparing genuine suicide notes 
with ordinary letters to friends and relatives, 
although they had anticipated that greater dis- 
organization would occur in suicide notes as 
a manifestation of the disruptive effect of 
heightened drive level. The difference between 
results obtained in the present study and in 
the study by Osgood and Walker might easily 
be a function of the differences in the criteria 
used to assess disorganization. Their criteria 
involved a frequency ratio of errors in the struc- 
ture of the composition as well as length of 
independent clauses, whereas a global rating of 
disorganization on a five-point scale was used in 
the present study. 

It has been proposed that suicide becomes 
possible when dread fails to arise because it has 
somehow been rendered unnecessary through 
self-deceptive maneuvers which may take the 
form of an individual conceiving of himself as 
intending something other than suicide or avoid- 
ing the confrontation of negative ramifications 
of a recognized suicidal intention. For example, 
his thoughts may focus upon going away, upon 
the end state of release from torment, or upon 
freedom from pain or responsibility. If self- 
deception becomes completely successful one 
might conjecture that the individual might even 
appear relaxed and content and readily con- 
sidered by others to be out of danger of 
suicide. Under such circumstances it would not be 
surprising that many patients have been re- 
leased from hospitals as improved and shortly 
found dead to the amazement of their relatives 
and hospital personnel (Shneidman & Farberow, 
1957a). 

Tf one is to take seriously the possibility that 
the successful suicide must somehow have 
managed to prevent dread from arising in 
sufficient strength to block suicidal action, two 
questions are immediately cogent concerning 
therapy for individuals who have threatened or 
attempted suicide. 

Ts it possible, for example, that certain “thera- 
peutic procedures” actually enhance the likeli- 
hood of eventual suicide? It would seem that 
if an individual brings about the inhibition of 
dread through some self-deceptive maneuver, as 
has been Suggested, this process may be rein- 
forced by relatives, therapists, or hospital ward 
personnel who may feel that it is best to avoid 
discussing the suicide issue with the patient, 
fearing that discussion might serve to perpetuate 
disturbing and obsessive thoughts of suicide, 
and who, instead, concentrate their efforts only 
upon helping the patient to find renewed in- 
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terest in possibilities for living. Without dis- 
couraging the effort to stimulate new interests, 
might it not be more therapeutic to be persistent 
in helping the patient confront in detail the 
more dreadful personal ramifications of suicide 
with the assumption that a healthy amount of 
dread may be revived (or conditioned) in as- 
sociation with thoughts of suicidal action which 
might then later be aroused by suicidal impulses 
in sufficient strength to block suicidal action? 

Is it possible that the administration of 
tranquilizers to suicidal patients not only re- 
lieves certain depressive symptoms, but simul- 
taneously reduces the very anxiety or dread in 
relation to suicidal notions which may stand 
as the principal barrier to suicidal action when 
all life interests have faded? It seems a little 
surprising that although it has been generally 
publicized that a high positive correlation exists 
between suicide rate and the use of tranquilizers 
such as reserpine or phenothiazine derivatives, 
investigators have failed to pay more atten- 
tion to this possible drug effect. Beisser and 
Blanchette (1961), for example, while calling 
attention to this correlative phenomenon, com- 
ment that “the reasons for this correlation of 
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high suicide rate and the use of tranquilizers 
are not evident" (p. 369). 
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DEFENSIVE STYLE IN THE FAMILIES OF SCHIZOPHRENICS 
AND CONTROLS + 
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Biological parents of groups of Good and Poor Premorbid schizophrenics and 
neurotic controls were administered the Rorschach test in order to evaluate 
the relative level of ego maturity between parental groups. A rating schedule 
designed to reflect styles of defensive behavior was devised in an effort to 
objectify the dimension of ego maturity. It was found that this scale cor- 
related significantly with age, intelligence, and Rorschach response total of 
the parents employed in the sample. The level of maturity of defensive 


behavior manifested by the parents was found to differ significantly. Parents 
of Poor Premorbid schizophrenics produced a greater amount of immature 
defensive behavior than parents of Good Premorbids, with the parents of 
neurotic controls falling intermediate between the Schizophrenic groups. The 


difference between parents of "Poors" 


and neurotics approach significance, 


while the difference between parents of *Goods" and neurotics was clearly 


unreliable. 


A number of recent investigations of the 
families of schizophrenic patients have demon- 
strated the presence of severe disturbances in 
these families as compared to controls, An in- 
ordinate degree of interpersonal conflict has been 
found in the home (Frazee, 1953; Lidz, Corneli- 
son, Fleck, & Terry, 1957), and the relationships 
between family members have been described as 
distant, stereotyped, and unsatisfying (Bowen, 
1959, 1960; Lidz & Lidz, 1949; Wynne, Ryckoff, 
Day, & Hirsch, 1958), Patterns of disproportion- 
ate dominance and submission have been found 
in the daily transactions of the parents (Gerard 
& Siegel, 1950; Kohn & Clausen, 1956), and the 
entire family tends to use styles of communica- 
tion which are inefficient, contradictory, and 
conflict arousing (Bateson, Jackson, Haley, & 
Weakland, 1956; Weakland, 1960). 

The deviant relationships and intense discord 
observed in these families have suggested to some 
investigators that the ego maturity of the parents 
of schizophrenics is grossly inadequate to that of 
control parents. Lidz, Cornelison, Fleck, and 
Terry (1958) have characterized the schizo- 
phrenic disorder as a folie à familie phenomenon 
in which serious pathology is present in the family 
as well as the identified patient, Brodey (1959) 


1This study was Supported in part by a grant 
from the North Carolina State Hospitals Board of 
Control, and in part by funds from Grant OM-124, 
“Family-Centered Treatment in Schizophrenia,” Na- 
tional Institute of Mental Health, United States 
Public Health Service, 

? Now at the University of Kentucky, 

3 Now at the University of Illinois, 


has also described these families as pathological, 
proposing the mechanisms of externalization and 
narcissistic gratification as constructs which offer 
a partial explanation for the disturbances. Bowen 
(1960) has suggested that the schizophrenic dis- 
order may be the culmination of at least three 
generations of psychopathology. The present 
study was aimed at evaluating such family con- 
ceptions of schizophrenia by comparing the ego 
maturity of parents of schizophrenics with parents 
of control patients, 

Another group of investigators has drawn 
attention to differences between subgroups of 
schizophrenics in family relationships (Farina, 
1960; Garmezy, Clarke, & Stockner, 1961; Gar- 
mezy & Rodnick, 1959; Rodnick & Garmezy, 
1957). According to these studies, male schizo- 
phrenics whose prepsychotic adjustment has 
reflected a comparatively adequate level of social 
participation and heterosexual activity (Good 
Premorbids) tend to come from homes domi- 
nated by a harsh, tyrannical father; with the 
mother occupying a relatively weak, passive, and 
ineffectual role. Poor Premorbid patients, on the 
other hand, whose prepsychotic adjustment has 
reflected a greater degree of social isolation, and 
whose sexual activity has been markedly di- 
minished, tend to come from homes containing 
more interparental conflict and maternal domina- 
tion. In accord with these studies, it was also 
conjectured that the relative ego maturity of 
parents of Good and Poor Premorbid schizo- 
phrenics should differ, with the parents of Poors 
reflecting greater immaturity than parents of 
Goods. 
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The concept of ego maturity has received wide 
usage as an explanatory construct in the clinical 
literature. Efforts to objectify the variable have 
been somewhat slow in appearing however. 
Several investigators have cited differences in 
patterns of ego defensive behavior as an indica- 
tion of the level of maturity of ego function 
(Freud, 1954; Horney, 1939; King & Schiller, 
1960; Klein, 1948; Miller & Swanson, 1960). 
According to these reports, many of the defense 
mechanisms can be ascribed to a continuum of 
developmental maturity, with the relatively 
primitive defenses, such as denial, defining one 
pole of the dimension and the relatively more 
mature defenses, such as displacement and subli- 
mation, defining the other extreme. The present 
study compared the defensive styles of parents 
of schiozphrenics and controls as an index of 
the relative maturity of their ego function. 


METHOD 
Subjects and Procedure 


Biological parents of groups of male and female 
schizophrenics and neurotic control patients were 
identifed according to the following plan. Patients 
were chosen from consecutive admissions to a large 
state psychiatric facility in order to fulfill the fol- 
lowing criteria: age between 18 and 45 years; a 
psychiatric diganosis of either schizophrenia or 
psychoneurosis, uncomplicated by clinical indications 
of organic brain pathology or mental deficiency; and 
the possession of both biological parents alive and 
under the age of 70 years. Ratings of the adequacy 
of premorbid adjustment were made from the case 
histories of all schizophrenic patients using the 
Phillips (1953) Scale of Premorbid Adjustment. 
Balanced groups of Goods and Poors were defined 
for both males and females by dichotomizing patients 
at the midpoint of the distributions for each sex 
separately. Thus, six groups of patients were 
identified: male and female patients, classiñed as 
Good or Poor Premorbid schizophrenics or neurotic 
controls, 

Parents of the patients were contacted by mail 
and asked to participate in the study on the occasion 
of their first visit to the hospital. Their cooperation 
was strongly encouraged and they were informed 
that their participation would be of clinical value 
in the treatment of the patient. Many of the parents 
actually cooperating were subsequently eliminated 
from the final sample, however, due to inability to 
comply with the task, excessive intellectual or visual 
deficits, language difficulties, or in some cases, the 
failure of both parents to respond. No differences 
Were observed which would indicate that more 
Parents were excluded from specific patient categories, 
although it was apparent that more follow-up letters 
and telephone contacts were required in securing the 
Participation of both parents of Good Premorbids. 

is difficulty seemed to be attributable to the fact 
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TABLE 1 


DESCRIPTIVE CHARACTERISTICS OF 
PATIENTS AND PARENTS 


Male Female 
Patient group 
Neu- Neu- 
Good | Poor | Fotie | Good | Poor | totic 
Patients 
M age* 24.8| 24.2| 282| 26.0| 30.4) 27.0 
Phillips scale 
(range) 8-13 | 19-22) — 3-9| 14-23| — 
Mothers 
M aged 5 51.6| 61.4| 48. 54.2| 54.2 
M IQ« 97.6 | 129.8| 101.8 | 100.6| 104.4) 101.2 
MR‘ 16.8 17.2| 13.8| 14.0] 14.8| 14.0 
Fathers 
M age> 55.0| 55.2| 62. 52.2| 55.8| 56.2 
M IQ* 97.4| 129.2| 94.0| 110.6| 106.4| 103.4 
MRà 12.0 16.4| 10. 15.2| 12.8| 10.8 


Note.—N =5 in each cell. 

a SD —5.00 for each cell, computed from the entire table. 

b SD =2.81 for all comparisons between parents, computed 
from the entire table. 

e SD =22.89 for all comparisons with parent dimension col- 
lapsed, computed from the entire table. 

4 SD =5.29 for all comparisons between parents, computed 
from the entire table. 


that Goods are more often married and usually live 
some distance away from the parental home. 
Roughly 70% of the parents contacted were included 
in the final sample. 

Groups of five mothers and fathers were obtained 
to fill each of the six cells of the design. Twenty- 
nine of the 30 parental pairs included both bio- 
logical parents of the patient. The remaining pair, 
in the female Good Premorbid group, was composed 
of the mother and father of different patients, since 
it was not possible to obtain a fifth intact family 
for this group. The inclusion of this pair of parents 
was necessary in order to preserve the balanced 
design. 

The Rorschach test and a short form of the 
Wechsler Adult Intelligence Scale were administered 
to all parents when they visited the hospital. The 
Rorschach was administered individually, in accord- 
ance with Beck’s procedure, except that the examiner 
faced the parent in all cases. All responses and other 
verbal productions were recorded verbatim, and an 
effort was made to note as much of the nonverbal 
behavior as possible. The Vocabulary, Comprehension, 
and Picture Completion subscales of the WAIS were 
also obtained in order to estimate the intellectual 
level of each of the parents, Performance on these 
subscales was prorated to a Full Scale estimate of 
intellectual level. All testing was done by one female 
examiner.* 

A summary of salient descriptive characteristics 
of both the patients and parents is provided in 
Table 1. The ages of the patients varied between 18 
and 36 years, with a combined mean of 26.8 years 
and cell means for the six groups well within chance 


4We are indebted to Virginia K. Smith for her 
invaluable assistance in administering the test ma- 
terials to the families, 
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TABLE 2 


DEFINITIONS AND EXAMPLES OF THE DEFENSIVE STYLE SCALE 


Defenses 


Weights 


Type I 


Denial 


Projection 


A. Response dynamics TN 
1. Content and associated affect incongruous (affect must be clearly implied or 
stated) 
One instance 
"Two or more instances 
Examples: Smiling witches, pretty snakes, ugly ballet dancer 
gus erudi of specific content of the percept or of the entire percept (un- 
rationalized) 
One instance 
"Two or more instances 2 
Examples: “An explosion. No, I guess it's just a thundercloud.” “A gorilla. 
No, it’s an animal hide.” “A ballet dancer. No, it isn’t that. I don’t know 
what it is.” 
3, Statements of negation 
One instance 
"Two or more instances 
Examples: “It couldn't be a bat.” “It’s not people." 
4. Externalization of responsibility for the percept 
One or two instances 
"Three or more instances $ 
Examples: “I was just doing what you said.” “You said tell you everything 
Isaw.” 
5. Criticism or negative evaluation of the test 
One or two instances 
Three or more instances 
Examples: “These are ugly.” “I don’t like this one.” “This one is not made 
well, the sides don’t match.” 


B. Characteristic imagery 


1, Imagery predominantly benign, “pretty,” or pollyannish in quality (Note: 
ll responses are given a positive or negative sign, reflecting the emotional 
valence, with neutral or indeterminant responses omitted.) 
About } to 4 of the scorable responses positive 
Over 3 of the scorable responses positive 
2, Content signifying surveillance or detection 
One or more percepts 
Examples: Eyes, fingerprints, police, people observing others (but not merely 
looking at something or someone) 
3. Content signifying protection or concealment 
One or more percepts 
Examples: Shield, mask, veil, person behind something, armor (or armored 
animal, tank, battleship, etc.) 

4. Human or human-like content which is endowed with intentions to act in 
either a positive, supportive, creative direction or negative, destructive, 
malevolent way toward another person or thing. Percepts implying neutral 
or indeterminate valence should not be scored. 

One instance 

‘Two or more instances 
Examples: Two natives boiling a. victim, Caspar the friendly ghost, two 
witches growling at each other 


C. Test taking attitude 


1. Glossy, shallow attitude toward percepts; test depreciation—e.g., this is 
silly, interpreting clouds is more interesting, etc.; “flightiness,” or jumping 
from percept to percept, thus precluding effective inquiry. 

2. Wariness; interest in what the test “is really about"; concern with the ex- 
aminer's recording; guarded, secretive approach. 


D. Card rejection 


1. One or more rejections, with multiple responses to other cards te: If 
other E elicit only one response each, Dat Scoring.) See 
Ol 


19 


ne 
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TABLE 2—Continued 


Defenses Weights 


Displace- 


Type II Isolation | Undoing | DiShiae 


A. Response dynamics 


1. Stilted, pedantic, or intellectualized verbal style, stressing nonessential detail 
One instance 
Two or more instances 
Example: “These two gentlemen are having a heated discussion, or argument 
—or whatever you want to call it—about . . .” 
2. Emotionally loaded percepts delivered without affect 
One instance 
Two or more instances 
Examples: “Bowel movement” delivered with same aplomb one would expect 
for “lamp.” “A smear of blood” reported without disturbance or subsequent 
improvement. 
3. Rationalized rejection of a percept, or attempt to minimize it on the basis of 
perceptual structure 
One instance 
Two or more instances 
Examples: “A tiger. Actually it looks more like a caricatured tiger because of 
the exaggerated nose and the color of the fur.” 
4. Benign or pollyannish percept developed immediately following a response 
reflecting strong emotional content 
One instance 
Two or more instances 


ne 


Ne 


pe 


ne 


B. Characteristic imagery 
1. Content signifying efficiency or machine-like characteristics 
One or two instances 1 
Three or more instances ak 2 
Examples: Machines of a relatively complex, efficient nature; machine-like 
animate content—e.g., dance team, bookkeeper, etc. (Note: Exclude modes 
of conveyance—e.g., car, boats, planes, etc.) e^ 
2. Religious percepts or content relating to religious activities 
One instance 
Two or more instances 
Examples: Church, Bible, two people genuflecting, prayer cap 
3. Intellectual or artistic content 
One or two instances 
Three or more instances i 7 
Examples: abstraction, art or literature, symbolic content, mythical or his- 
torical percepts A F 
4. Emotional content symbolized by inanimate object percepts—e.g., milk 
bottle, pistol, jet bomber, etc.—or botanical percepts—e.g., orchid, corsage, 
etc—more than through human content—e.g., buxom woman, Jeering 
clown, head hunter, etc. (Note: Nonhuman animate content is omitted— 
€.g., gorilla, etc.) 
Emotion expressed about equally in each area 1 
, Emotion expressed primarily through inanimate percepts — T 2 
5. Distantiated human content—e.g., statues, men from Mars, African natives, 
etc.—or the attribution of unpleasant or depreciatory characteristics to out- 
group members—e.g., two niggers, two idiots, etc. 
One instance 
Two or more instances 
6. Content containing obvious sexual symbolism—e.g., snake, cave surrounded 
by shrubbery, bullet penetrating the earth. 1 
_ Three or more instances A 
1. Objects used in vocational pursuits—e.g., sewing machine, lathe, saw, ham- 
mer, etc. 
One or more instances : 


C. Test taking attitude 
1. Task viewed as a “test” or challenge which one must rise to meet in order to 1 

Conform to an internalized standard 
Total 21 


ers 


ne 
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variation (F = 1.96, df = 2/24). The Phillips scores 
for the schizophrenics patients reflect sufficient spread 
to insure a meaningful dichotomy along the dimen- 
sion of adequacy of premorbid adjustment, and gaps 
of four and five points between the male and female 
groups suggest that the chance of misclassified group 
members is relatively slight. 

Comparison of the descriptive characteristics of the 
parents of the six patient groups was accomplished 
by analyzing each of the three variables separately, 
according to Lindquist’s (1953) Type III mixed 
analysis of variance design. Two of the variables 
produced significant differences, with the age of the 
fathers being significantly greater than that of the 
mothers (F= 12.38, df=1/24, p< .005), and the 
intellectual level of the parents of the three diag- 
nostic groups differing reliably (F = 3.53, df — 2/24, 
#<.05). The parents of the Poors were found to 
be significantly brighter than the other groups, while 
the parents of the Goods and neurotics did not 
differ reliably. The Rorschach response total (R) 
of the mothers exceeded that of the fathers, although 
the difference was not significant at the .05 level 
(F — 2.42, df = 1/24). 


Defensive Style Scale 


- In accord with the report of Miller and Swanson 
(1960) on the development and structure of defensive 
behavior, a continuum was conceived to which many 
specific defenses can be ascribed. This continuum is 
regarded as reflecting a dimension of ego maturity 
and efficiency of adjustment. Polar extremes of the 
dimension are defined by defenses characterized as 
more reality distorting, situationally nonspecific, re- 
quiring relatively little prior training or skill, and 
being relatively ineffective and unacceptable socially. 
Defenses inclining toward this pole of the continuum 
are regarded as less mature and are classified as 
Type I defenses. At the opposite end of the con- 
tinuum are defenses which are less reality distorting, 
more situationally specific, more dependent upon 
learned skills, and more socially effective. Defenses 
inclining toward this pole are considered more mature 
and are classified as Type II patterns. 

Defenses were chosen which bear a theoretical 
correspondence to the two classes of adjustment 
defined. Defenses chosen were: denial and projection 
as representative of Type I defenses; and isolation, 
undoing, and displacement representing Type II de- 
fensive behavior. Instances of specific defensive 
maneuvers manifested in the Rorschach test were 
obtained primarily from Schafer (1954),5 and weights 
were determined on the basis of clinical judgment. 
Weights were assigned in order to reflect the defensive 
Style to which the behavior bears most direct cor- 
respondence, and, in addition, to reflect the a priori 
confidence the authors had in being able to make the 


B Item Bs of the Type I scale was obtained from 
Fisher and Cleveland’s (1958) report on Body Image. 
The item was chosen because of its apparent rela- 
tionship to the mechanism of projection. 


CRITIQUE AND NOTES 


ratings reliably. Scores for the defenses were com- 
bined into two separate scales, providing an estimate 
of the relative amounts of Type I and Type II 
behavior present in the Rorschach responses. The 
items of the scales, along with illustrative examples 
and scoring weights, are presented in Table 2. — ' 


Scoring Procedure 


The Rorschach protocols of 20 parents excluded 
from the final sample were scored with the Defensive 
Style Scale (DSS) to obtain a preliminary assess- 
ment of the procedure. The protocols of five parents 
were used for practice and discussion of scoring 
problems between two raters, and the remaining 15 
were scored independently, in order to evaluate inter- 
rater reliability. Each protocol was rated for Type I 
and Type II defensive behavior, and difference scores 
were obtained by subtracting Type II scores from 
corresponding Type I values with a constant of 21 
added to remove negative signs. Thus, DSS scores 
could range from 0 to 40, with higher values repre- 
senting greater degrees of Type I behavior. Scores 
actually obtained ranged from 10 to 25 and 10 to 26 
for the two raters, with a product-moment correla- 
tion between raters of .86 (df=13, p<.01). This 
degree of reliability between raters was considered 
quite satisfactory. After completing these preliminary 
ratings, the Rorschachs of the parents included in 
the sample were rated by a single rater according 
to the same procedure. 


RESULTS AND Discussion 


Since significant differences were found between 
parent groups on two of the control variables, 
it was necessary to correct all DSS scores for 
these variables before analysis, R was also in- 
cluded as a third control variable because of 
its importance in evaluating most Rorschach 
scales. Accordingly, intercorrelations among the 
variables were computed for all 60 parents. The 
resulting matrix of correlations along with means 
and standard deviations of the variables are 
reported in Table 3. The multiple correlation 
between DSS scores and the three control vari- 
ables was found to be .667 (F — 14.99, df — 3/56, 


TABLE 3 
INTERRELATIONS BETWEEN AcE, IQ, RESPONSE 
Torat (R), AND DEFENSIVE STYLE 

Variable | M | sD 2 3 4 
1. Age | 5480| 7.54] —.021| —218 | +.286* 
2.1Q 106.37 | 19.89 +.466** | —.525** 
3.R 14.03 | 6.16 : —.566** 
4. DSS | 19.51] 3.26 

Note.—N —60. 

*D<.05. 

** p201, 

4 


| 


| 
| 
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p« 001) A value of this magnitude indicates 
that approximately 44% of variance of the raw 
DSS scores is associated with age, IQ, and R. 
Partial correlations of each of the three variables 
with DSS scores were also computed in order 
to evaluate the relationship between each of the 
separate variables and defensive behavior. All 
three variables were found to be significantly 
related to Defensive Style, with partial cor- 
relations of .254 (df=56, p<.05), —.388 
(df= 56, p < .005), and —.377 (df= 36, p< 
005) obtained for age, IQ, and R, respectively. 
The low positive relationship between in- 
creasing age and the predominant use of Type I 
defenses is congruent with independent evidence 
of declining adaptive resources, regression, and 
disorganization of defensive structure associated 
with aging (Wolff, 1959). It is a frequent ob- 
servation that older people often resort to 
increasing use of projection and denial as a means 
of coping with stresses arising from their dimin- 
ishing capacities. Since the sample employed in 
the present study was an older group of parents, 
the positive correlation between age and Type I 
behavior seems quite consistent. This is not to 
imply, however, that this relationship would be 
expected within a more homogeneous age range 
or in a younger sample. Indeed, one should 
expect a negative relationship in a younger 
sample, since denial is usually cited as one of 
the earliest defenses to appear, being later 
abandoned as the child matures (Freud, 1954; 
Jacobson, 1957). 
The moderate negative relationships of IQ and 
with primitive defensive behavior is also 
Consistent with observations of other investi- 
gators, Miller and Swanson (1960, p. 221) report 
à significant negative relationship between verbal 
intelligence and the use of denial by adolescent 
boys in a failure stress situation. Moreover, 
Schafer (1954) draws upon a number of clinical 
Teports in support of his contention that subjects 
Utilizing the defenses of intellectualization, isola- 
tion, and undoing should be expected to produce 
a Rorschach record rich in responses and replete 
with detail. These relationships are in agreement 
with expectations concerning correlates of specific 
defensive patterns included in the present scale. 
hus, the correlations obtained provide some 
evidence for the effectiveness of the scale in 
Measuring what it is supposed to reflect, i.e., 
Styles of defensive behavior as an index of the 
relative maturity of ego function. 
he main purpose of the study was to evaluate 
Styles of defensive behavior in the parents of 
B e diagnostic groups. Accordingly, all raw 
Scores were corrected for differences in age, 
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IQ, and R by multiple regression. These cor- 
rected scores were then entered in a Type III 
analysis of variance table (Lindquist, 1953) with 
the three dimensions defined by sex of the patient, 
diagnostic group of the patient, and sex of the 
parent. An analysis of the variance of the ad- 
justed scores was then conducted. Only the main 
effect for diagnostic groups attained significance 
at the .05 level (F = 3.84, df = 2/24). The re- 
maining main effects (i.e., differences between 
mothers and fathers and differences between 
parents of male and female patients) and all 
interactions failed to reach statistical significance. 
The order of the mean DSS scores for the parents 
of the three diagnostic groups, in terms of in- 
creasing use of the more immature Type I de- 
fenses, was found to be 19.43, 19.49, and 19.62 
for parents of Goods, neurotic controls, and 
Poors, respectively. Comparisons of the indi- 
vidual means indicated that the parents of Poors 
utilize significantly more Type I defenses than 
the parents of Goods (t=2.70, df= 38, 
p<.05). The difference between parents of 
Poors and neurotic controls approached signifi- 
cance (t= 1.84, df —38, p<.10), while the 
difference between neurotics and Goods was 
clearly unreliable (¢ < 1.00). In view of the 
inversion in the order of the means found be- 
tween parents of neurotics and Goods, an over- 
all comparison of differences in Defensive Style 
between the parents of the combined group of 
schizophrenics and the parents of controls was 
considered inappropriate and was not done. 

The adjusted DSS scores indicate that an 
overall difference in the maturity of defensive 
behavior between parents of the combined group 
of schizophrenics and neurotic controls does not 
exist reliably in the present data. Parents of 
subgroups of schizophrenics, distinguished on 
the basis of the adequacy of the patient’s pre- 
morbid history, do differ significantly, however, 
with the parents of Poor Premorbids employing 
significantly more primitive defenses than the 
parents of Goods. On the basis of these data, 
it can be inferred that the level of ego maturity 
of the parents of Poors is significantly less than 
the maturity and social effectiveness of the 
parents of Goods. The ego maturity of the 
parents of Poors also tends to be less than that 
of parents of neurotic controls. This result needs 
to be corroborated by further evidence, however. 
The present data suggest that distinguishing sub- 
types of schizophrenic patients has considerable 
heuristic advantage over investigating differences 
between schizophrenics as a whole and various 


control groups. 
° 
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EXPERIMENTAL DESENSITIZATION OF A PHOBIA* 


PETER J. LANG anp A. DAVID LAZOVIK 
University of Pittsburgh 


24 snake phobic Ss participated in an experimental investigation of systematic 
desensitization therapy. Ss who experienced desensitization showed a greater 
reduction in phobic behavior (as measured by avoidance behavior in the 
presence of the phobic object and self-ratings) than did nonparticipating 
controls, Ss tended to hold or increase therapy gains at a 6-month follow-up 
evaluation, and gave no evidence of symptom substitution. 


In recent years there has been increasing 
interest in the development of psychothera- 


" peutic techniques based on learning theory 


models, These efforts are not limited to the 


translation of accepted psychotherapeutic 


practice into a laboratory language, in the 
manner of Shoben (1949) and Dollard and 
Miller (1950), but are attempts to extra- 
polate from laboratory findings to new 
methods of treatment. The most promising of 


these techniques with respect to clinical find- 
ings, is Wolpe’s (1958) systematic desensi- 
‘tization therapy of phobic reactions. In a 


recent article Wolpe (1961) reported that 


© desensitization was effective in the treatment 


of 35 of 39 phobic patients. Similar results 


have been reported by Lazarus (1961) utiliz- 


ing group desensitization. 
In a pilot project Lazovik and Lang (1960) 


demonstrated that desensitization could be 


Successfully carried out under controlled lab- 
oratory conditions. This result opens the way 


| not only to a more precise evaluation of 


treatment outcomes, but also makes it pos- 
Sible to test conflicting theories of the treat- 


. ment process. 


According to Wolpe (1958), desensitiza- 


- tion is effective to the extent that subjects 


learn to make responses to phobic objects 


- Which reciprocally inhibit (are incompatible 


With) fear. Specifically, the treatment is 


- designed to substitute muscular relaxation for 


anxiety. It is assumed that this process—not 
Suggestion, “hello-goodbye” effects, or trans- 


- ference—is the agent of behavior change. It 


l'This research is supported by Grant M-3880 
from the National Institute of Mental Health, 
United States Public Health Service. The main 


Content of this paper was presented by A. David 


Tazovil at the meeting of the American Psy- 
ological Association in New York, September 1961. 
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is further assumed that explorations with the 
patient of the genesis of the fear are not 
necessary to the elimination of a phobia. 
Wolpe proposes that the unlearning of a 
phobia follows the rules of what is generally 
called association learning theory. He there- 
fore expects that therapy will be more dif- 
ficult, the more generalized the anxiety re- 
sponse, but that “symptom substitution" is 
not a consequence of successful behavior 
therapy. 

A very different set of predictions would 
be made by psychoanalytic therapists. This 
frame of reference expects little positive re- 
sult unless the background of the phobia and 
its symbolic meaning, is elucidated and 
worked through with the subject. If this ap- 
proach is not employed, only a temporary, 
“transference cure" may be anticipated. It 
is further assumed that the difficulty of the 
case is related to the importance of the 
symptom in the individual's “psychic econ- 
omy," and that its temporary removal can 
only lead to the substitution of some new 
symptom. ^ 

The current experiment is designed to 
evaluate these two interpretations of desensi- 
tization therapy. The procedure developed 
previously (Lazovik & Lang, 1960), while it 
submits to the rigid control of the laboratory, 
is nevertheless sufficiently flexible that it can 
be employed in the treatment of actual 
phobic behavior. In this experiment snake 
phobic individuals served as subjects. This 
fear was chosen because it is frequent in a 
college population, approximately 3 in 100 
students are to some degree snake phobic, and 
also because of the symbolic, sexual signifi- 
cance attributed to this fear by psycho- 
analytic theory (Fenichel, 1949, p. 49). The 
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fact that snake phobias are held to reflect 
conflict in more fundamental systems of the 
personality, suggests that this is good ground 
for a stringent test of behavior therapy. 

Specifically, the study is designed to: 
evaluate the changes in snake phobic behavior 
that occur over time, particularly the effects 
of repeated exposure to the phobic object; 
compare these changes with those that follow 
systematic desensitization therapy; determine 
the changes in behavior that are a direct 
function of the desensitization process, as 
opposed to the independent effects (when not 
part of desensitization) of hypnosis, training 
in deep muscle relaxation, and the establish- 
ment of a good patient-therapist relationship. 
In addition, an attempt is made to isolate 
factors which determine the success or failure 
of this method with individual subjects. 


MzrHop 
Systematic Desensitization 


The experimental treatment consists of two 
sequential parts, training and desensitization proper 
(Lazovik & Lang, 1960). The former procedure 
requires five sessions of about 45 minutes each. 
At this time an anxiety hierarchy is constructed. 
This is a series of 20 situations involving the 
phobic object, which each subject grades from most 
to least frightening. The actual items vary from 
subject to subject. However, the following scenes 
are typical: “writing the word snake,” “snakes on 
display at the zoo (moving within a glass case) 2 
"stepping on a dead snake accidentally.” 

The subject is then trained in deep muscle re- 
laxation, following the method presented by Jacob- 
son (1938). He is further instructed to practice 
relaxation 10-15 minutes per day at home. In the 
final phase of the training period the subject is 
introduced to hypnosis, and an effort is made to 
teach him to visualize vividly hypnotic scenes, 

Following training, there are 11 45-minute ses- 
sions of systematic desensitization, In this, the 
subject learns to respond with relaxation to 
stimuli that originally evoked anxiety. At the begin- 
ning of the first session the subject is hypnotized 
and instructed to relax deeply. He is then told to 
imagine the hierarchy item which he previously 
rated as least distressing—the smallest “dose” of 
anxiety. If relaxation is undisturbed by this experi- 
ence, the subsequent item is presented. Items which 
induce small amounts of anxiety are repeated, fol- 
lowed by deep relaxation, until the subject reports 
he is undisturbed by the scene. In this Way succes- 
Sive items are presented from Session to session. 
The Eoal of treatment is the presentation of the 
item originally ranked as most frightening without 
impairing the individual’s calm state. At this point 
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a new response (relaxation) has been attached to 
the imagined representative of the fear inducing 
stimulus, and clinicians working with the method 
assume that it will readily transfer to actual life 
situations. 

In the experimental treatment described here, 
just these operations were carried out. No attempt 
was made to induce change through direct hypnotic 
suggestion, nor was an effort made to alter motiva- 
tion. Subjects were informed that the experimenter 
was trying to evaluate a new method of treatment, 
and that he was much more interested in accurate 
findings than therapeutic successes. A majority of 
the therapist’s actual verbalizations, as well as the 
step by step description of the training and 
desensitization procedures, was contained in a 
mimeographed program which guided the treatment 
of all subjects. 


Subjects 


A total of 24 subjects participated in this re- 
search. They were all college student volunteers, 
attending undergraduate psychology courses. The 
experimental groups included a total of four males 
and nine females. The control groups consisted of 
three males and eight females. None of these sub- 
jects presented evidence of a severe emotional dis- 
turbance on the basis of MMPI and interview data. 

Subjects were selected on the basis of a classroom 
questionnaire which asked students to list their 
fears and rate them as mild, moderate, or intense. 
All subjects who participated in this experiment 
were afraid of nonpoisonous snakes, and rated this 
fear as “intense.” Furthermore, the two authors 
interviewed all subjects who met this criterion. If 
despite the high self-rating on the screening question- 
naire the subject’s fear was judged to be weak, he 
was not asked to participate in the project. Sub- 
jects who formed the final experimental sample 
were characterized by most of the following be- 
haviors: They reported somatic disturbance as- 
sociated with the fear—“I feel sick to my stomach 
when I see one.” “My palms get sweaty. I’m tense.” 
They habitually avoided going anyhere near a live 
snake. They would not enter the reptile section of 
the zoo or walk through an open field. They be- 
came upset at seeing snakes at the motion pictures 
or on the television screen, and would leave, close 
their eyes, or turn off the set, Even pictures in 
Magazines or artifacts such as a snake skin belt 
were capable of evoking discomfort in many of 
these subjects. 


Measures of Phobic Behavior 


All subjects filled out a Fear Survey Schedule 
(FSS) at the beginning and end of the experiment, 
and again at a 6-month follow-up evaluation, The 
FSS is a list of 50 phobias each of which is rated 
by the subjects on a 7-point scale. An estimate 
was thus obtained not only of the subject’s snake 
phobia, but of other related and unrelated fears. 

A direct estimate of the subject’s avoidance be- 
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havior was obtained by confronting him with the 
phobic object. The subject was informed that a 
nonpoisonous snake was confined in a glass case 
ina nearby laboratory. He was persuaded to enter 
the room and describe his reactions. The snake was 
confined at a point 15 feet from the entrance to 
the room. On entering the room with the subject, 
- the experimenter walked to the case and removed a 
wire grill that covered the top. The subject was 
assured that the snake was harmless, The experi- 
menter then requested that the subject come over 
and look down at the snake as he was doing. If 
the subject refused, he was asked to come as close 
as he felt he could and the distance was recorded. 
If the subject was able to come all the way to 
the case, he was asked to touch the animal (a 
5-foot black snake) after he had seen the experi- 
menter do this. If the subject succeeded in this, the 
experimenter picked up the snake and invited the 
subject to hold it. After the avoidance test, the 
subject was asked to rate his anxiety on a 10-point 
“fear thermometer” (Walk, 1956). The subject's 
degree of anxiety was also rated on a 3-point scale 
by the experimenter. 

In addition to the subjective scales and the 
avoidance test, all subjects were extensively inter- 
viewed concerning their fear. These interviews were 
tape recorded. The experimenter who conducted the 
interview and administered the avoidance test 
participated in no other phase of the project.? 


Procedure 


„Following an initial interview and the administra- 
tion of Form A of the Stanford Hypnotic Suscepti- 
bility Scale (SHSS; Weitzenhoffer & Hilgard, 
1959), subjects were placed in the experimental or 
control groups. Assignment was essentially random, 
although an effort was made to balance roughly 
these groups in terms of intensity of fear and mo- 
tivation to participate in the experiment. All sub- 
jects were administered Form B of the SHSS when 
the. experimental subjects completed the training 
Period, and before desensitization began. 

The basic plan of the study is described in Table 1. 
lt consisted of two experimental and two control 
Eroups, The subgroups were created so that the 
effects Of repeating the avoidance test, pretherapy 
training, and desensitization itself could be separately 
evaluated. Thus, the experimental groups E; and Es 
both. experienced the laboratory analogue of desensi- 
tization therapy already described. However, sub- 
Jects assigned to E; were administered the avoidance 
test before the training period, prior to desensitiza- 
tion, and again at the end of the experiment. Es 
Subjects, on the other hand, were tested before 
desensitization and after, but did not participate in 
the initial evaluation. The control subjects did not 
Participate in desensitization, but the C: and Cs 
Stoups were evaluated at the same time as their 
i ———— 


*The authors would like to thank David 
Yholds, who acted as interviewer and conducted 
€ snake avoidance test. 
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TABLE 1 


DESIGN OF THE EXPERIMENT, SHOWING THE TIMES AT 
WnuicH Supyects WERE EVALUATED (the Snake 
Avoidance Test, Experimenter's Rating, 

Fear Thermometer, and Taped 
Interview) 


Experimental procedures 


E, |Test 1| Training] Test 2| Desensitization| Test 3 


E: Training] Test 2| Desensitization| Test 3 
Cı |Test i| —  |Test2 — "Test 3 
C2 — |Test 2 — Test 3 


opposite numbers in the experimental series. All 
available subjects were seen and evaluated 6 months 
after the termination of therapy. 

Four replications of this experiment are reported 
here. They varied only in the therapists who were 
assigned to the experimental groups. Four experi- 
mental subjects and five controls participated in 
the first replication. The authors each saw two of 
the experimental subjects. In the second, third, and 
fourth replications (which included three, four, and 
two experimental subjects and two, three, and one 
control subjects, respectively) three other therapists 
participated.) While two of these individuals are 
engaged in full-time private practice, they had never 
before attempted desensitization therapy. The third 
therapist was an advanced clinical graduate student, 
who also had his initial experience with the 
desensitization method in this project. 


RESULTS 
Avoidance Test 


The results of this test were evaluated in 
two ways: an absolute criterion in which 
touching or holding the snake constituted a 
test pass, and scale scores based on the sub- 
ject’s distance in feet from the snake. Table 2 
presents the number of subjects from the 
separate experimental and control groups who 
met the former criterion. 

Note that the reliability of this test is 
high. The control subjects show no appreci- 
able change, even with three exposures to 
the snake. Furthermore, the pretherapy train- 
ing period does not affect the performance of 
the experimental subjects: no more E; sub- 
jects pass at Test 2 than at Test 1. How- 
ever, following therapy, the incidence of test 
passes goes up significantly in the experi- 
mental group. The percentage of increase 


8 The authors would like to thank Robert Romano, 
Richard Miller, and James Geer, who participated 
as therapists in this project. 
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' TABLE 2 


NuMBER or Supjects WHo HELD or TOUCHED THE 
SNAKE DURING THE AVOMANCE TEST 


Test 1 Test 3 


1 


Test 2 


RRO BE 
RAINS Nu 


8 
5 
Ci 5 0 
6 = 
E; and E; 13 

11 


from Test 2 to Test 3 yielded a ¢ of 2.30, 
< .05. A similar test of the control sub- 
jects was not significant.‘ 

The above analysis does not, of course, 
measure subtle changes in behavior. In an 
attempt to increase the sensitivity of the 
avoidance test, subjects were assigned scores 
on a 19-point scale which roughly corre- 
sponded to their closest approach in feet to 
the phobic object. Holding the animal was 
equal to a scale score of 1; touching, 2; the 
1-foot mark, 3; 2 feet, 4; and so on up to a 
score of 19 for subjects who refused to go to 
the testing room. The correlation between 
the first two presentations of the avoidance 
test (N = 19) yielded an r of +.63.5 Al- 
though this statistic suggests some degree of 
reliability, nothing is known about the rela- 
tive distance between values at different 
places on the scale. The control sample 
employed in the experiment is too small to 
make an adequate analysis, Nevertheless, it 
is logical that the probability of a positive 
increase in approach lessens the closer the 


4A live snake varies to some extent in activity, 
and this appears to be related to its effectiveness 
as a stimulus. In order to determine whether this 
factor influenced our results, the experimental as- 
sistant's ratings of the snake's activity during tests 
of the control and experimental subjects were sub- 
jected to a ¢ test. No significant difference in snake 
activity for the two groups was found. 

5'The sample (V=19) used in estimating the 
reliability of the avoidance scale and the other fear 
measures includes the members of the control sample 
plus the eight subjects of the E; group. Although 
the training period does intervene between the first 
and second presentations of the fear measures for 
the E; group, it appears to have no appreciable effect 
on the phobia. The E; subjects could not, of course, 
be included in a reliability estimate, as actual 
therapy intervenes between their first and second 
fear evaluation, 


» 


PETER J. LANG AND A. Davi» LAzovik 


subject is to the phobic object, i.e., movement 
from a score of 15 to 12 is more likely or 
easier than movement from a scale score of 
4 (2 feet away) to a'score of 1 (holding a 
live snake). Thus, a simple difference score 
does not appear to be the best estimate of 
change. 

The change score used in the following 
analysis was the difference between pre- and 
posttherapy scale scores divided by the pre- 
therapy score. For example, a subject who 
achieved a scale score of 12 on Test 2 and a 
Score of 5 on Test 3 was assigned a change 
score of .58—the solution to the equation: 


12/— 5 
125. 


The mean change score for the first two 
avoidance tests (V — 19) was only +.03. 
This suggests that the score has considerable 
Stability, and tends to minimize chance 
fluctuations. The mean change scores for the 
experimental and control subjects from Test 
2 to Test 3 may be found in Table 3. Note 
that the Mann-Whitney U test of the differ- 
ence between groups is significant. 


change score — 


Fear Thermometer and the FSS Snake Item 


The correlation between the first two tests 
for the reliability sample (W = 19) was r= 
+.75. The average difference score (obtained 
by subtracting the second fear thermometer 
score from the first) was only +.63. As in 
the case of the avoidance test, no significant 
change was associated with the pretherapy 
training period. The mean difference score 
for the E; group from Test 1 to Test 2 was 
+ 38, less than the group mean cited above. 

The difference between Test 2 and Test 3 
scores for the experimental and control groups 
are presented at the top of Table 4. While 


TABLE 3 


MEAN SNAKE AvOIDANCE Scare Scores at Test 2 
AND 3, MEAN CHANGE SCORES, AND THE 
MauN-WurrsEv U Test 


Group Test 2 | Test 3 Change U 
Experimental | 5.35 4.42 34 34.5* 
Control 6.51 7.13 by) 


*p <.05. 


DESENSITIZATION OF A PHOBIA 


TABLE 4 


Mean RATING SCALE MEASURES OF PHOBIC BEHAVIOR 
BerorE (Test 2) AND AFTER (Test 3) 
DESENSITIZATION THERAPY 


Group Test 2 


Test 3 | Difference 


Fear thermometer 


247 
1.00 


7.62 
6.45 


545 


Experimental 
5.45 


Control 


FSS-subject's rating of snake fear 


Test 1^ Test 3 Difference 
Experimental 6.69 5.31 1.38 
Control 6.27 5.73 54 


' ^The FSS was not administered at Test 2. The difference 
score is between a pretherapy interview and Test 3, 


the therapy groups show a greater mean 
change than the control subjects, this differ- 
ence did not attain statistical significance 
on the Mann-Whitney U test. The same trend 
and statistical findings were obtained for the 
snake item on the FSS. The experimenter’s 
rating of the subject’s level of anxiety during 
the avoidance test did not differentiate be- 
tween experimental and control groups. In 
this case, the failure to discriminate may be 
attributed to the selection, prior to the experi- 
ment, of a 3-point rating scale. The ex- 
perimenter reported that this measure was 
too gross for the behavior under observation. 


Follow-Up Study 


All subjects who were still available 
(N = 20) were re-evaluated approximately 
6 months after the experiment was com- 
pleted. This included 11 members of the 
original experimental group, 6 of whom 
touched or held the snake at the final avoid- 
ance test. Two of these subjects no longer 
met this criterion 6 months later. However, 
heither subject indicated an increase in self- 
Tated fear and one actually showed improve- 
Ment on this dimension. Furthermore, be- 
Cause of gains by others, the mean avoidance 
test change score for the entire experimental 
group indicates a slight reduction in phobic 
behavior from Test 3 to the 6-month fol- 
low up. ; 

The therapy group showed even greater 
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gains on the fear thermometer. The increase 
was sufficient that the difference between ex- 
perimental and control subjects from Test 2 
to the follow up was statistically significant 
(U —16.5, p < .05). Subjects who had ex- 
perienced therapy also showed a significant 
reduction in their overall estimate of the 
intensity of their phobia as measured by the 
snake item of the FSS. The change in this 
score from pretherapy to the 6-month fol- 
low up was significantly greater for experi- 
mental than control subjects (U — 8.5, 
p< .02). 


Therapy Terminated and Unterminated 


The design of the current experiment 
arbitrarily limited therapy to 11 sessions. 
This resulted in subjects being tested for 
change at varying points in the therapeutic 
process. Fortunately, in desensitization therapy 
it is possible to define a subject’s degree of 
progress by referring to the number of 
hierarchy items successfully completed. It 
will be recalled that all subjects started with 
a 20-item hierarchy. This represented the 
combined efforts of the therapist and the 
subject to build an equal-interval scale, ex- 
tending from a remote point where the sub- 
ject felt little or no fear to a maximum fear 
involving close contact with the offending 
object. Normally, therapy would be termi- 
nated when the twentieth item had been 
passed. In the present experiment four sub- 
jects achieved this goal. Seven subjects com- 
pleted 16 or more items and six subjects 
completed 14 or less items. 

All subjects who completed their hierarchies 
touched or held the snake at the final avoid- 
ance test. Furthermore, subjects who com- 
pleted over 15 items (N = 7) showed signifi- 
cant improvement on nearly all measures em- 
ployed in this experiment: subjects who com- 
pleted under 15 items differed little from con- 
trols. Table 5 presents the difference between 
the two therapy groups on the snake avoid- 
ance scale and the fear thermometer. Note 
that the improvement of the over 15 items 
group is significantly greater than that of 
subjects completing less than 15 items. Sim- 
ilar results were obtained for the FSS snake 
item and they are presented in Table 6. Note 
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TABLE 5 


AVOIDANCE Test BEHAVIOR CHANGE FROM Test 2 TO 
Test 3 ron THERAPY SUBJECTS WHO COMPLETED 
More THAN 15 HIERARCHY ITEMS, ror THOSE 
Wno COMPLETED LESS THAN 15, AND FOR 
THE MANN-WHITNEY U Test 


Number of hierarchy 
items successfully 
completed 


Test 2 | Test 3| Change score| U 


Snake avoidance scale 


More than 15° | 6.71 
Less than 15» 417 


3.93 49 


500 | —.07 |959* 


Fear thermometer 


More than 15» | 7.57 


4.00 3.57 
Less than 15> | 7.67 Als 


6.50 1.17 


Note.—Alll scores are mean values. 
N =7. 


NS ba 
p «.08. 
"^ «.03. 


in this same table that the mean rank of the 
FSS also shows a significantly greater reduc- 
tion in the over 15 items group, than in the 
group completing fewer items. This finding 
suggests that the elimination of snake phobic 
behavior does not initiate an increase in other 
fears, but in fact leads to a significant reduc- 
tion in overall anxiety. 


TABLE 6 


CHANGES IN THE FEAR SURVEY SCHEDULE (FSS) ror- 
LOWING DESENSITIZATION THERAPY FOR SUBJECTS 
WHO COMPLETED MORE THAN 15 HIERARCHY 
ITEMS, ror THOSE WHO COMPLETED Lrss 
THAN 15, AND FOR THE MANN-WHITNEY 


U Test 
" pani of 
lerarchy item: Pre- Post- " 
successfully "| therapy | therapy | Difference] U 
complete: 
Fear survey schedule 
More than 15° | 2,34 1.85 49 


Less than 156 | 321 3.20 Q1 |45* 


FSS-subject's rating of snake fear 


More than 159 | 6,71 4.14 d 
Less than 155 | 667 66; | oo 30 


eA Scores are mean ranks or mean rank differences, 
bys é; 
< 02, 
**p <01. 
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Discussion 


The results of the present experiment 
demonstrate that the experimental analogue 
of desensitization therapy effectively reduces 
phobic behavior. Both subjective rating of 
fear and overt avoidance behavior were modi- 
fied, and gains were maintained or increased 
at the 6-month follow up. The results of 
objective measures were in turn supported 
by extensive interview material. Close ques- 
tioning could not persuade any of the experi- 
mental subjects that a desire to please the 
experimenter had been a significant factor in 
their change. Furthermore, in none of these 
interviews was there any evidence that other 
symptoms appeared to replace the phobic 
behavior. 

The fact that no significant change was 
associated with the pretherapy training argues 
that hypnosis and general muscle relaxation 
were not in themselves vehicles of change. 
Similarly, the basic suggestibility of the sub- 
jects must be excluded. The difference be- 
tween the SHSS Form A scores of the ex- 
perimental and control groups did not ap- 
proach statistical significance (U = 58). 
Clearly, the responsibility for the reduction in 
phobic behavior must be assigned to the 
desensitization process itself. This is evi- 
denced not only by the differences between 
experimental and control subjects but also 
by the relationship within the experimental 
groups between degree of change and the 
number of hierarchy items successfully 
completed. 

One must still raise the question, how- 
ever, why desensitization therapy could be 
accomplished in 11 sessions with some sub- 
jects and barely gotten underway with others. 
The intensity of the phobia is obviously not 
a relevant factor. The mean avoidance Test 2 
score is actually higher for the experimental 
subjects who completed more than 15 items 
than for those who completed less (see Table 


ë While these findings indicate that hypnotizing 
subjects or training them in muscle relaxation are 
not effective independent of desensitization, we .do 
not yet know if they are a necessary part of the 
desensitization Process, itself. Research currently 
underway, in which these procedures are included 
or omitted in different therapy groups, is designed to 
answer this important question, 
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5). The base FSS snake item rank and 
the fear thermometer scores are almost ex- 
actly the same in both groups. On the other 
hand, a negative relationship (r= — .58) 
exists between the total FSS score at the first 
testing and the number of hierarchy items 
completed by individual members of the ex- 
perimental group. The FSS is in turn posi- 
tively related to the Taylor (1953) Manifest 
Anxiety (MA) scale (r= + .80 for the ex- 
perimental group). Thus, the degree of prog- 
ress attained in therapy in a constant period 
of time (11 sessions) appears to be a func- 
tion of generalized anxiety, as measured by 
both the MA scale and FSS. These data sug- 
gest that desensitization therapy is more dif- 
ficult, or at least slower, when many stimuli 
in the subject’s environment are capable of 
eliciting anxiety responses. This is of course 
consistent with the clinical findings of Wolpe 
(1958) and the prediction of a learning 
theory model. 

The present experiment also reveals an 
interesting connection between changes in 
overt avoidance behavior and the subject's 
verbal report. The relationship between these 
two dimensions is generally positive. How- 
ever, even when precisely the same event is 


being evaluated, it is sometimes surprisingly - 


low (Test 3 avoidance scale and fear ther- 
mometer r= + .40). Furthermore, initial 
Changes in phobic behavior seem to occur in 
either one dimension or the other, rather 
than in both simultaneously. Most frequently 
subjective report lags behind overt behavior. 
Thus, avoidance test scores differentiated be- 
tween experimental and control subjects im- 
Mediately following the experiment, but it 
Was not until the follow-up interview that 
the subjective scales yielded the same finding. 
It will be interesting to observe in future 
Studies if this pattern continues, and to 
what extent it is characteristic of any reduc- 
tion in phobic behavior, or simply a function 
of the desensitization technique. 

The question of whether learning theory, 
specifically counterconditioning, best explains 
the desensitization process is not completely 
answerable by the present investigation. 
Certainly the theory is consistent with the 
results, and some of the other possible ex- 
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planations have been eliminated. However, 
further research, particularly the direct meas- 
urement of changes in muscular tension dur- 
ing the presentation of hierarchy items, is 
necessary to an evaluation of theory. 

But of the greatest immediate interest are 
the implications of the present research for 
traditional theories of clinical practice. The 
findings suggest the following important 
conclusions: 


1. It is not necessary to explore with a 
subject the factors contributing to the learn- 
ing of a phobia or its “unconscious mean- 
ing” in order to eliminate the fear behavior. 

2. The form of treatment employed here 
does: not lead to symptom substitution or 
create new disturbances of behavior. 

3. In reducing phobic behavior it is not 
necessary to change basic attitudes, values, 
or attempt to modify the “personality as a 
whole.” The unlearning of phobic behavior 
appears to be analogous to the elimination 
of other responses from a subject’s behavior 


repertoire. 
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EFFECT OF HUNGER ON NONNUTRITIVE SUCKING 
IN INFANT RHESUS MONKEYS * 


LORNA S. BENJAMIN ? anp WILLIAM A. MASON 3 


University of Wisconsin 


In Experiment I the thumbsucking of 5 Ss aged 90-180 days was measured at 
0, 2.5, 5, 7.5, and 10 hr. after feeding. In Experiment II these same Ss, now 
between ages 180 and 270 days, were observed at intervals of 0, 37.5, 75, 112.5, 
250 min, and 10 hr. after feeding. In Experiment III 6 additional monkeys 
90-180 days of age were observed at the 1st 5 intervals of Experiment II. 
In all experiments thumbsucking increased with hunger. In Experiment II 
there was no significant increase in sucking during the intervals between 
0 and 250 min. Experiment III results showed a significant increase during 
this period suggesting that the effectiveness of specific deprivation intervals 


decreases with age. 


Nonnutritive sucking has been the subject 
of clinical and experimental concern (Benja- 
min, 1961a, 1961b; Davis, Sears, Miller, & 
Brodbeck, 1948; Levy, 1934; McKee & 
Honzik, in press; Ross, Fisher, & King, 1957; 
Sears & Wise, 1950). There are some ex- 
perimental data relating hunger to non- 
nutritive sucking. Kunst (1948) observed 
that infants 1-20 weeks of age, living in an 
institutional ward, began to show increased 
thumbsucking within 1.5-2 hours after they 
were fed. Scott, Ross, and Fisher (1959) 
found that unweaned puppies showed more 
sucking of the experimenter's finger after 
they had been separated from the mother 
for from 4-16 hours than they did just before 
separation. The effect was observed in puppies 
up to 3 weeks of age and did not appear in 
older groups. 


1 The first experiment of the present investigation 
is based on a thesis submitted by the senior author 
in partial fulfillment of the requirements for the 
degree of Master of Arts at the University of Wis- 
consin, January 1958. W. A. Mason supervised the 
thesis. The entire investgiation was supported by 
H. F. Harlow under grants from the Ford Founda- 
tion, the National Institutes of Health, United States 
Public Health Service (Grant M-722), and the Gradu- 
ate School of the University of Wisconsin. During 
the preparation of this manuscript, the senior author 
was supported by a postdoctoral fellowship in Psy- 
chology from the National Institutes of Health, 
United States Public Health Service (Grant MF-8915). 

?Now Research Associate in the Wisconsin Psy- 
chiatric Institute, University Hospitals, Madison, 
Wisconsin. 

3Now at the Yerkes Laboratories of Primate 
Biology, Orange Park, Florida. 
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In contrasting fashion, James (1957) found 
that puppies that had food injected directly 
into their stomachs showed no more sucking 
when returned to the mother than unfed 
puppies which had been away from her for a 
similar length of time. He concluded that 
*. . . the sucking response bears little, if any, 
relation to milk intake [p. 377].” 

Several factors, over and above species 
differences, may have contributed to these 
varied findings on the relationship of hunger 
to sucking. The failure of hunger to increase 
sucking in James’ experiment may be related 
to the use of the nutritive rather than the 
nonnutritive sucking response. An alternate 
possibility is that emotional disturbance in- 
duced by a forced feeding procedure produced 
an increase in sucking which obscured the 
differences between fed and hungry subjects 
(Adler, Linn, & Moore, 1957; Smith, 1960; 
Tinklepaugh & Hartman, 1932). Emotional 
disturbance produced by a period of enforced 
separation from the mother may also have 
contributed to the increased level of non- 
nutritive sucking obtained by Scott et al. 
(1959; Benjamin, 1961b; Smith, 1960). 
Kunst’s results may have been similarly af- 
fected by variables not directly associated 
with hunger, For example, midway between 
the regular 4 hour feedings the younger infants 
of that experiment sometimes received an 
extra feeding. We have observed that at feed- 
ing time there is a general increase in excite- 
ment among young monkeys which is ex- 
pressed in heightened motor activity, vocali- 
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zations, and increased nonnutritive sucking. A 
similar increase in excitement occurring in the 
ward during the extra feeding might have 
contributed to the rise in nonnutritive sucking 
obtained by Kunst. In summary, the relation- 
ship of food deprivation to the frequency of 
nonnutritive sucking remains problematic. 
The purpose of the present experiments was 
to determine the effect of hunger on non- 
nutritive sucking under better controlled 
conditions than earlier studies afforded. The 
subjects were infant rhesus monkeys, removed 
from their mothers at birth. When raised 
under standard laboratory conditions a large 
number of these primates “spontaneously” 
— develop digit sucking in a manner resembling 
‘human patterns of nonnutritive sucking. 


METHOD 
Subjects 


Five subjects (4 females and 1 male) were 
selected from a group of 10 on the criterion of 
showing nonnutritive sucking within the first 10 
days of observation in Experiment I. They had 
been separated from their mothers within 10 hours 
after birth, and maintained in the laboratory nursery 
in individual cages, 15 inches X 18 inches X 24 inches. 
The monkeys were bottle-fed from birth until they 
reached approximately 70 days of age, when they 
were changed to cup feeding. Starting at 60 days, 
they were given supplemental solid foods. They were 
fed milk formula at 9:30 a.m. and 8:00 P.M., and 
solid foods at 5:00 p.m. Unconsumed solid foods were 
removed from the cage just before the evening 
formula feeding. Additional bits of food were offered 
as rewards in other laboratory tests. The subjects 
Were approximately 90 days old at the start of 
Experiment I and 180 days old when it ended. The 
same subjects participated in Experiment II during 
the period from 180 to 270 days of age. 

.Six additional monkeys (4 females and 2 males) 
similar to the first group in laboratory and feeding 
history participated in Experiment III during the 
period from 90 to 180 days of age. 


Apparatus 


To insure uniform conditions of observation, the 
subjects were observed in a cylindrical Plexiglas ob- 
Servation cage, 31 inches in diameter and 23 inches 
high enclosed within a Masonite chamber equipped 
with one-way vision windows permitting all parts 
Of the cage to be seen (Benjamin, 1962). 

_A recycling cam timer indicated 10-second observa- 
tion periods by an audio-visual signal. 


Procedure 


For Experiment I, levels of hunger were defined 
by 0-, 2.5-, 5-, 7.5-, and 10-hour intervals after 
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the 8:00 p.m. formula feeding at which time the 
subjects had been allowed to drink to satiation. 
Evening hours were selected since at this time there 
were no other feeding or research activities for at 
least 10 hours. All individuals were observed once 
each day at one of the deprivation intervals. The 
deprivation interval changed from day to day and 
the order in which the five deprivation levels ap- 
peared within each 5-day week was varied according 
to a 5X5 Latin square design. In the first week, 
for example, the subjects were observed at 8:00 P.M. 
on Monday, 1:00 am. on Tuesday, 6:00 A.M. on 
Wednesday, 3:30 A.M. on Thursday, and 10:30 A.M. 
on Friday. The daily orders for the next 4 weeks 
followed the rules of Latin square. Beginning with 
the sixth week, the entire sequence was repeated, 
with the deprivation intervals in the same order 
as the first week. This continued until each subject 
had been observed 50 times, a total of 10 times at 
each of the five deprivation intervals. 

To facilitate adaptation to the experimenter, each 
test day at some time between 6:00 Am. and 4:00 
P.M., the infants were handled and offered bits of 
preferred food. 

A test session began when the experimenter entered 
the nursery at the appropriate time and recorded 
the percentage of subjects with their eyes closed, 
indicating they were asleep. This measure was taken 
because it was felt that sleeping might be related to 
thumbsucking. The subject was selected according to 
a predetermined counterbalanced order, and taken to 
the observation chamber. After gently placing the 
monkey in the Plexiglas cage, the tester stood with 
one arm in the cage, allowing the monkey to hold 
his hand or be petted. Following this adaptation 
period, the cage was rolled into the chamber and 
the door closed. Two minutes later observation began. 

The 250-second observation period was divided 
into 25 10-second intervals indicated by the timer. 
Nonnutritive sucking was defined as those instances 
when the animal closed its lips around any part of 
the body for 1 second or more. A check was made 
at the appropriate interval on a time-ruled observa- 
tion sheet if nonnutritive sucking occurred during 
a 10-second period. An individuals score for an 
observation period was the number of 10-second 
intervals out of a possible 25 during which non- 
nutritive sucking was observed. To measure other 
oral behavior which might interfere with non- 
nutritive sucking, a similar score was obtained for 
nonsucking oral activity, defined as biting or licking 
any part of the body or environment. 

After the observation, the experimenter rated the 
overall activity of the animal during the session 
on a 5-point scale ranging from "asleep" to “very 
active.’ This measure was taken because motor 
activity has been said to be inversely related to 
amount of thumbsucking in infants (Spock, 1957). 

The reliability of the observations was checked 
every fifth session by two observers working inde- 
pendently. A product-moment correlation for total 
sucking scores recorded by the two observers yielded 
an r of .97. 
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The procedure for Experiment II was the same 
as that for Experiment I, except that deprivation 
intervals were 0, 37.5, 75, 112.5, and 150 minutes. 
These were presented in the counterbalanced orders 
prescribed by the Latin square of the first experi- 
ment. In addition, on five successive days after the 
end of the Latin square sequences, observations were 
made 10 hours after feeding, at 6:00 A.M. 

The procedure for Experiment III was identical to 
that of Experiment II, except that no observations 
were made 10 hours after feeding. 


RESULTS 
Experiment I 


The monkeys showed progressively more 
nonnutritive sucking at the 0-, 2.5-, 5-, and 
7.5-hour deprivation intervals. The maximum 
reached by 7.5 hours was maintained through 
the 10-hour interval. These relations are 
shown in Figure 1 and the differences among 
the deprivation intervals were significant at 
the .01 level according to an analysis of 
variance of the arc-sine transformed scores 
(Table 1). 

Inspection of Figure 1 suggests that the 
Breatest increase in nonnutritive sucking oc- 
curred by 2.5 hours after feeding. At. this 
time, and at all subsequent intervals (5557157 
and 10 hours), nonnutritive sucking was 
significantly greater than at O hour. Proba- 
bility values for these differences were less 
than .01 according to the Duncan (1955) 
range test. Sucking also increased to a lesser, 
but significant, extent (p = .05) between 2.5 
and 7.5 hours. No other differences between 
intervals were significant. 


SUCKING SCORE 


o 25 5 T5 10 
HOURS OF DEPRIVATION 


Fic. 1. Nonnutritive sucking as a function of 
deprivation interval. (From 0 to 7.5 hours after 
feeding, there is a Progressive increase in the mean 
nonnutritive sucking scores of these 90-180 day old 
rhesus monkeys.) 


TABLE 1 
ANALYSIS OF VARIANCE 


Source df MS F 
Sequences (49) 

Deprivation Level (A) 4 5,710.71 | 18.59* 
Subsequence (B) 9 1,219.98 | 3.97* 
AXB 36 360.37 | 1.17 
Subjects (C) 4 | 14,369.09 | 46.77* 
CXA 16 386.68 | 1.26 
CXB 36 715.28 | 2.33* 
CXAXB 144 307.21 


Note.—Arc-sine transformed data. 
*«.01. 


In contrast with the increase in nonnutri- 
tive sucking observed over successive depri- 


vation intervals, nonsucking oral activity . 


declined, and the average scores for successive 
intervals were 3.0, 2.7, 1.8, 1.2, 1.7. These 
differences among intervals were significant 
at the .01 level according to an analysis of 
variance of arc-sine transformed scores, The 
reciprocal relation obtaining between non- 
nutritive sucking and nonsucking oral activity 
has been observed and reported previously 
(Benjamin, 1961a). 

Motor activity in the observation cage and 
the percentage of subjects found asleep in 
the living cage both increased throughout the 
night, and the mean scores for these measures 
are shown in Figure 2. In contrast with non- 
nutritive sucking which showed a substantial 
increase within 2.5 hours after feeding, neither 
of these activities increased appreciably until 
5 hours after feeding. This observation was 
confirmed by the failure to find significant 
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Fic. 2. Percentage of monkeys found asleep in their 
home cages and their mean motor activity scores 
in the observation chamber, as a function of time 
elapsed since the 8:00 PM. feeding. (The first sub- 
stantial increase in both these measures was observed 
5 hours after evening feeding.) 
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TABLE 2 


Comparison OF MOTOR ACTIVITY AT VARIOUS 
DEPRIVATION INTERVALS 


Interval 20 22.5 25 27.5 210 
«0 ns 04 01 .00 
E25 ns 03 02 
«5 ns ns 
< 7.5 ns 


Note.—Cell values indicate the probability that the motor 
activity at the deprivation interval indicated by the column 
heading was greater than motor activity at the interval indi- 
cated on the row. Entries on the right side of the diagonal in- 
es in activity with increased deprivation; entries 
e of the diagonal indicate a decrease in activity. 
Probabilities are based on a sign test comparing Individual a's 
score at Interval 7 with his score at Interval j during Week k, 
and summing over weeks and individuals. 

* In hours. 


differences between O hour and 2.5 hours for 
either motor activity, or preobservation sleep- 
ing behavior, as determined by the sign test 
and the Mann-Whitney U test, respectively. 
For both measures, differences between the 0- 
and 5-hour intervals were significant (p = .04 
for motor activity, and p = .00 for percentage 
of subjects found asleep). The outcomes of 
the statistical comparisons between the various 
intervals are presented in Tables 2 and 3. 

In spite of the increase observed for both 
nonnutritive sucking and motor activity over 
successive deprivation intervals, these meas- 
ures appear to be reciprocally related, and 
individuals obtaining the largest nonnutritive 
Sucking scores tended to be rated low in 
motor activity. Kendall's tau between indi- 
vidual motor activity and individual non- 
nutritive sucking scores was —.40, but this 
trend did not reach statistical significance. 


TABLE 3 


Comparison of PERCENTAGE OF ANIMALS ASLEEP 
BEFORE OBSERVATION AT THE VARIOUS 
DEPRIVATION INTERVALS 


_ Intervals] 50 >2.5 >5 2175 210 
29 ns 00 | .0 | .00 
< 25 01 01 -01 
i ns ns 
« T5 
«10 ns 


Note,—This table is constructed exactly as Table 2, except 
Probabilities are based on the Mann-Whitney U test. The 
anking procedure of that test was applied to the percentage of 
oe found asleep before observation at Interval $ for 
weeks compared to the percentage found asleep before Interval 
j for all weeks, 
* In hours, 
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Experiment II 


The largest increase in nonnutritive sucking 
during the first 2.5 hours after feeding which 
was observed in animals 90-180 days of age 
was not found when these subjects were be- 
tween 180 and 270 days of age. Mean sucking 
scores were 12.0, 10.8, 14.3, 13.0, and 13.1 
at the 0-, 37.5-, 75-, 112.5-, and 150-minute 
intervals, respectively. The differences were 
not significant according to analysis of vari- 
ance. After 10 hours of deprivation, however, 
a significant increase in nonnutritive sucking 
was obtained (p = .01 by £ test between each 
subject’s total sucking score for the five 
observations at 10 hours deprivation and its 
total score for the last five observations at 
0 hour deprivation). 


Experiment III 


As in Experiment I, the nonnutritive suck- 
ing behavior of the 90-180 day old monkeys 
increased significantly within the first 2.5 
hours after feeding. The mean sucking scores 
were 11.5, 18.7, 22.2, 22.0, and 22.8 at the 0-, 
37.5-, 75-, 112.5-, and 150-minute intervals, 
respectively. Analysis of variance of arc-sine 
transformed scores showed these differences to 
be significant at the .01 level. The results 
of a Duncan range test suggest that sucking 
increased significantly as soon as 37.5 minutes 
after feeding, and that all of the significant 
increases over this 2.5-hour range took place 
within the first 75 minutes. 


DISCUSSION 


Our results indicate that hunger defined in 
terms of deprivation intervals increases non- 
nutritive sucking in young monkeys. This 
effect may become apparent in younger 
monkeys within 37.5 minutes after feeding, 
but it decreases with age. The nonnutritive 
sucking of the 90-180 day old monkeys in 
Experiment I increased progressively from 0 
to 7.5 hours after feeding and the effect was 
observed at a statistically significant level 
within 2.5 hours after feeding. In Experiment 
II, when these.same subjects were 90 days 
older there was no significant increase in non- 
nutritive sucking within the first 2.5 hours 
after feeding. At this age, however, there was 
a significant increase in nonnutritive sucking 
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10 hours after feeding. These results sug- 
gested that a larger deprivation interval was 
required to produce an increase in nonnutri- 
tive sucking in older animals. The results of 
the Experiment III on a new group of mon- 
keys, aged 90-180 days, confirmed the find- 
ings of the first experiment. Again, younger 
monkeys showed a significant increase in non- 
hutritive sucking within the ‘first 2.5 hours 
after feeding and significant increases were 
observed as soon as 37.5 minutes after feeding. 

The results are consistent with Kunst’s 
(1948) finding that human nonnutritive 
sucking increases with hunger, and they are 
in agreement with Scott et al. (1959) findings 
that hungrier puppies showed increased suck- 
ing of the experimenter’s finger and that this 
effect decreased with age. Thus there is agree- 
ment with both human and puppy data. It 
is not clear whether the apparent disagree- 
ment with James’ (1957) conclusion that 
sucking in puppies is independent of hunger 
is related to differences in procedure, or to 
his use of the nutritive instead of the non- 
nutritive sucking response. 

The obtained relationship between depriva- 
tion and nonnutritive sucking did not appear 
to be a simple artifact of motor activity or 
of sleeping behavior. This conclusion is based 
primarily on the fact that changes in levels 
of nonnutritive sucking did not parallel 
changes in either motor activity, or in per- 
centage of animals found asleep, The mon- 
keys’ nonnutritive sucking showed a large 
increase within 2.5 hours after feeding, 
whereas their motor activity during observa- 
tion, and their sleeping behavior in the living 
cage, did not show significant changes until 
5 hours after feeding. 

Other evidence supports the conclusion that 
the obtained increases in nonnutritive sucking 
were not artifactual. An increase in the per- 
centage of animals found asleep in the living 
cage through the night was expected to 
indicate increased likelihood that the subjects 
would fall asleep during observation with 
digits in their mouths, thereby increasing their 
nonnutritive sucking scores. ‘Actually, how- 
ever, the obtained increases in motor activity 
Scores showed there was less, rather than 
more, sleeping behavior exhibited during 
observation at successive deprivation intervals, 


Further, the finding of a negative, although 
not significant, correlation between individual 
motor activity and nonnutritive sucking scores 
suggests that nonnutritive sucking increases 
with successive deprivation intervals in spite 
of, not because of, increased motor activity. 
This result is in agreement with the clinical 
finding that motor activity and thumbsucking 
are inversely related (Spock, 1957), and is 
consistent with previously reported results for 
rhesus monkeys (Benjamin, 1961a). 

The present investigation is a part of a se- 
ries of experiments which indicate that non- 
nutritive sucking is influenced by many vari- 
ables whose effects change with age (Ben- 


jamin, 1961a, 1961b). Examples of such. 


variables include feeding experience, sex, type 
of mother surrogate, and frustration. A pro- 
posal has been made (Benjamin, 19612) that 
oral responsiveness is universally present in 
infant monkeys, and is only partly depend- 
ent on feeding experience. Oral responsiveness 
may appear either as nonnutritive sucking or 
as nonsucking oral activity, and it is affected 
by many variables. These variables do not 
have irreversible effects, and they, as well as 
the general level of oral responsiveness, are 
highly subject to change with age. In short, 
nonnutritive sucking in the rhesus monkey 
may begin for one reason, and be sustained 
for another; any theory which attempts to 
explain the behavior at a given time must 
consider the multivariate nature of thumb- 
sucking. 
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EMPATHY, SELF-ESTEEM, AND BIRTH ORDER? 


EZRA STOTLAND anp ROBERT E. DUNN 
University of Washington 


An experiment to measure empathy following induced identification. Empathy 
is defined as sharing the presumed feelings of someone with whom you have 
similarity. 14 of 132 lower division college students were arbitrarily assigned 
an attribute of nonfunctional similarity (having worked on similar tasks) with 
a paid model. The other %4 were treated differently only in this respect. 
J4 the Ss under each of these conditions were exposed to the model's public 
success, the other 34 to the model's public failure. Simultaneously, a 2nd 
measure of palmar sweat was taken. Retrospective reports of anxiety were 
then obtained. Anticipated differential palmar sweating among the 4 conditions 
was not observed. Selí-reports indicated empathy when the model was 


similar, p < .005. 


This study is one of a series of experimental 
studies of identification (Burnstein, Stotland, 
& Zander, 1961; Stotland & Dunn, 1962; 
Stotland & Hillmer, 1962; Stotland & Patchen, 
1961; Stotland, Zander, & Natsoulas, 1961). 
The findings of these studies support the gen- 
eral hypothesis that if an individual perceives 
some attribute in common between himself 
and a model, he will tend to perceive him- 
self as having other of the model’s attributes. 
The process indicated by this hypothesis is 
termed identification. F urthermore, the Stot- 
land and Hillmer and Stotland and Dunn 
studies showed that identification occurs even 
when the attribute originally perceived in com- 
mon between the individual and the model has 
been arbitrarily imposed on both of them, 
so that there is no “logical” reason for the 
individual’s perceiving himself as having 
another attribute similar to the model's. In 
this series of studies, it is not assumed that 
the person identifies with the model because 
of the latter’s exemplary qualities. Identifica- 
tion is assumed to occur on the basis of 
similarity alone. 

In the previous studies, the attributes the 
individuals changed or acquired through 
identification were: preferences on a variety 
of materials, scores on a personality test, and 
self-evaluations of ability on a task. In the 
Stotland and Dunn (1962) study, the subjects 
also performed a clerical task at levels of 


+The work on the Study was financed by a 
grant (M-2423) from the National Institutes of 
Health, United States Public Health Service, 


performance consistent with the identification 


model's level. In the present study, the 
problem was whether an individual's level of 
anxiety would also be influenced by the level 
of anxiety he perceives in an "identification 
model. Empathy is defined herein as the 
special case of identification in which the 
identifier acquires the model’s level of 
anxiety (or any other emotion). The first 
hypothesis, then, is as follows: If an individual 
has perceived some attribute in common be- 
tween himself and another person, i.e., the 
model, the individual will manifest more 
anxiety when the model is placed in an ap- 
parently anxiety provoking situation than in 
a nonanxiety provoking situation. In the 
Present study, anxiety was measured by 
changes in palmar sweating and by self- 
reports of the subjects. 

In the Stotland and Dunn (1962) study, 
later born children were found to identify 
more with other people than first and only 
borns. In the present study, the second hy- 
pothesis is that later borns will empathize 
more than first and only borns. In the Stotland 
and Hillmer (1962) and Stotland and Dunn 
Studies, persons low in self-esteem were found 
to identify more than persons high in self- 
esteem. The third hypothesis, then, is that 
persons low in self-esteem will empathize more 
more than first and only borns. In the Stotland 
and Patchen (1961), Stotland and Hillmer, 
and Stotland and Dunn studies, persons low 
in authoritarianism were found to identify 
more than those high in authoritarianism. 
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The fourth hypothesis is that persons low in 
authoritarianism will empathize more than 
those high in authoritarianism. 


METHOD 


Subjects. The subjects were 132 freshmen and 
sophomores, male and female, recruited from several 
courses in introductory psychology at the University 
of Washington. The subjects participated in three 
sessions of research, two for personality testing 
and one for the present experiment. Because of 
the requirement of three sessions, they were paid 
$1.25 per session. 

Experimental sessions. A minimum of 9 and a 
maximum of 12 subjects of the same sex were 
present at each session. When they arrived in the 
experimental room, they were seated at four 7- 
„Or 8-foot long tables, 3 to a table. The tables were 
arranged, one behind the other, with all the sub- 
jects facing the front of the room. One of the 
subjects at each session was a hired and trained 
subject who was later to serve as the identification 
model. He, or she, usually sat in the second or 
third row. The purpose of the first part of the 
session was to arbitrarily assign to each subject 
an attribute which could be used later as a basis 
for identification. This attribute consisted of having 
worked on the same set of tasks in this part of 
the experiment. These tasks were contained in book- 
lets the experimenter passed out among them. After 
the experimenter passed out the booklets, he told 
the subjects that one of the booklets was stamped 
with the phrase, *Watson-Glazer, and that the 
person who happened to receive it would be called 
on later. 

Actually, no subject received a booklet marked 
Watson-Glazer, but the paid model pretended to 
receive one. The experimenter then continued, 
reading aloud the instructions on the first page of 
the booklets he had passed out. These instructions 
and the tasks in the booklets are identical to those 
used in the Stotland and Hillmer (1962) and 
Stotland and Dunn (1962) studies. The instruc- 
tions indicated that the subjects were to work on 
a series of tasks to help the psychology department 
evaluate the interest value of the tasks for further 
experimentation. The subjects were told that there 
was a wide variety of booklets containing different 
assortments of tasks, and that the particular booklet 
a person received was purely a matter of his 
accidental choice of a particular seat in the room. 

The subjects were then admonished not to look at 
their neighbor’s booklet. Actually, all of the booklets 
Contained the same set of tasks, but their appearance 
was varied from booklet to booklet by varying the 
Color of paper, the arrangement on the page, size 
Of type, etc. The tasks, described more fully in the 


Drevious studies, were finding causes, finding 
antonyms, letter-number substitution, and word 
building, 


When all of the subjects had finished the task 
(around 20 minutes), the experimenter asked the 
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person who received the booklet marked Watson- 
Glazer to step forward and sit in a chair in the 
area in front of the tables. This person could be 
observed by all the other subjects. The person who 
stepped forward was always the trained model. 
The purpose of the next instruction was to eliminate 
the possibility that the subjects would feel anxious 
because they expected to take the same test as the 
model and, if the model did poorly, to do as 
poorly as the model. The experimenter said: 


Now let me make this very clear. The person 
seated here will listen to a tape and take a test. 
No one else will take this test either in front of 
the group or privately. This person is the only 
one who will take it. Nor will the rest of you 
take any test remotely like it. You will all func- 
tion as observers. Later on you will be asked to 
record your observations of this person, so please 
pay close attention. 


At this point in the procedure, the experimenter 
administered the measure of palmar sweating to all 
of the subjects including the model. This was done 
by a procedure originated by Silverman and 
Powell (1949) and discussed by Mowrer (1953). This 
measure was taken twice during the experiment, the 
change in perspiration being taken as the measure 
of induced tension. 

The instructions for the Watson-Glazer test were 
given within hearing of all the subjects and lasted 
for 3 minutes during which the subjects’ perspiration 
was measured for the first time. 

The subjects were then told that the test was a 
very good predictor of ability to think on one's 
feet and of success in a wide variety of professions. 
The experimenter said that the instructions for the 
test would be given orally, but that the questions 
would be given through a set of earphones at- 
tached to a tape recorder. The testee would give 
his responses orally. 

The test consisted of listening to a series of syl- 
logisms and judging whether the conclusions were 
sound or unsound on the basis of the premises. 

The experimenter then said: 


Now, before this test begins, in order to further 
assist those observing in their evaluations of you, 
please describe the mature of the paper-and- 
pencil task which you completed prior to coming 
to the front of the room. 


In the Similar condition the model said: 


Well, I worked on questions where I had to 
find the cause for something; then words of 
opposite meanings; then made some sentences 
into a code, and then the last thing I did was 
to decide which big words were made of several 
little ones, Oh! I told how interested I was in 
each of these by putting a check on the bottom of 
each page. 

These were the same as the tasks the subjects 


worked on. 
In the Different condition, the model said: 
e 
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I worked on some arithmetic problems, and, 
which of those words that don’t mean anything I 
liked better; then, which animals belonged to 
which groups. I also had to decide which of several 
geometrical shapes contained the largest area. 


The experimenter then instructed the subjects to 
keep a record of the demonstrator’s responses on the 
first sheet of booklets he had passed out to them. They 
were also asked to record any unusual expressions 
by the demonstrator. The experimenter then turned 
on the tape recorder, which played 17 items of the 
test into the demonstrator’s earphone. The experi- 
menter pretended to record the oral responses of the 
demonstrator, 

The purpose of giving the test was to give the 
model a personally important experience of success 
or failure. Since the items would be heard only 
through the earphones, the observing subjects could 
not know directly how well or poorly the demon- 
Strator was doing and had to rely on the experi- 
menter's evaluation. 

When the test was over the experimenter in- 
structed all the subjects to repeat the palmar sweat 
measure, During the 3 minutes in which the sub- 
jects’ perspiration was measured, the experimenter 
went back over the responses of the demonstrator 
one at a time and stated whether each of the 17 
Tesponses was right or wrong. He made these 
Statements loud enough so that all the subjects 
could hear. In the Good condition, he indicated that 
the demonstrator was right on 13 of the 17 items. 
Then he said: 


Let me see, You got . . . 13 out of 17 right. 
That's extremely good. Let's see how you did 
with respect to other college students. Let me 
See... you would be in the upper 25%. Seventy- 
five percent of college students would be below 
you—not as good as you in their ability to think 
fast under pressure; that is, to think fast and 
logically on their feet in difficult situations. 


In the Poor condition, the experimenter said that 
13 of the 17 were wrong and then he said: 


Let me see. You got . .. 13 out of 17 wrong. 
That is not good. Let's see how you did with 
respect to other college students. Let me see . c 
you would be in the lower 25%. Seventy-five 
percent of college students would be above you 
—better than you in their ability to think fast 
under pressure; that is, to think fast and logically 
on their feet in difficult situations. 


The subjects were then instructed to answer the 
questions in the inside pages of the booklets they 
had received. The first three items concerned the 
subjects’ perception of the anxiety of the demon- 
Strator and were answered On eight-point scales. 
The first item was a rating of the calmness or 
nervousness of the demonstrator while ‘taking the 
test; the second, the same rating while he heard 
iow he did on the test; the third, the same rating 


“now.” The responses to these three items were added 
to form an index of perceived anxiety of the 
demonstrator. 

The fourth item was a rating on an eight-point 
scale of how interested the subjects were while the 
demonstrator took the test. 

The fifth through eighth items (answered on eight- 
point scales) concerned the subject's reports of 
their own nervousness at the moment. The responses 
to these four items were added to form an index of 
the subjects’ reports of own anxiety. The items were: 

How calm or tense did you feel when you found 
out how the person (ie, the demonstrator) had 
done? 

How hard or how easy are you finding it to 
concentrate on these questions? 

How dry or how wet do you think your mouth 
is now? 

How calm or tense do you feel right now? 

The next two items asked the subjects to rate how 
well they thought they would do on the test, first on 
an evaluation scale, second in comparison to other 
college students. The next item concerned their 
desire to meet the demonstrator. 

At a later session the study was explained and 
all the subjects’ questions answered. 

Personality measure. At another session, the sub- 
jects took the F Scale and a modification of the 
Q sort to measure their self-esteem, The modification 
of the Q sort, based on the work of Weinberger 
(1951) consisted of a check list of 26 desirable 
traits—adjectives, presented first with instructions 
to check how important it was to stand high in each 
trait, and next (on a Separate page) with instruc- 
tions to indicate how high they actually did stand 
on each trait. Each trait was checked on a five- 
point scale with the restriction that only 5 traits 
were to be placed in any one category except for 
6 in the middle category, The self-esteem score was 
the sum of the squared differences in rating for each 
adjective between the two administrations, the 
greater the sum, the less the self-esteem, 

Analysis. Because of the unequal Ns in the four 
conditions, analysis of variance was not employed in 
the analysis of the data. Instead, the following 
procedure was used. First, for each dependent 
variable, the homogeneity of the within-cell vari- 
ances in the four conditions was tested by a method 
suggested by Walker and Lev (1953). If this test 
Was negative, then an estimate of the within-cells 
variance was computed by pooling the variances 
from all four cells. This estimate was then used as 
the basis of ¢ tests between cells, If the within-cells 
variances were heterogeneous, the £ tests were based 
on the variance of the cells compared. Second, a 
Procedure suggested by Walker and Lev was used 
to test the interaction among the four cells. This 
procedure is a ¢ test of the significance of the differ- 
ence between differences between means, When the 
total sample was split in two on one of the 
Personality variables, the differences between the 
two subsamples were tested by means of this Walker- 
Lev procedure. This was done by taking pairs of 
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TABLE 1 
ANXIETY AND RATINGS IN THE EXPERIMENTAL CONDITIONS 


Similar Different 
Evaluation of performance t 
Good Poor Good Poor 
a c d 
. Change in palmar sweat* 92.8 92.3 80.2 92.8 
. Perception of model anxiety’ 8.7 13.7 9.1 13.7 a vs. b: 6.616**** 
€ vs. d: 6.626"*** 
. Interest in model's test^ 3.9 49 4.9 5.5 
. Self-report of anxiety» 12.1 15.1 13.9 14.6 a vs. b: 3.151*** 
1.793°* 
. Prediction of own ability on test? 3.4 4.2 3.4 4.7 a vs. b: 2.220** 
c vs. d: 3.512*** 
. Comparison of own ability on test? 3.5 45 3.4 4.6 a vs. b: 2.585** 
c vs. d: 3.119*** 
. Attraction to model> 4.8 4.8 5.3 5.1 
N 32 29 34 37 


b The lower the figure, the less the attribute rated. 
d The lower the figure, the better the evaluation. 
*p «.05. 
** b c.03. 
kp < 005. 
mp < 001. 


two conditions and determining if the differences 
between these cells differed for the two subsamples. 


RESULTS 


Before testing the hypotheses, it is neces- 
sary to show that the subjects perceived the 
model as more anxious in the Poor conditions 
than in the Good. Table 1 shows that in both 
the Similar and Different conditions, the model 
_ Was perceived as being more anxious in the 
Poor than in the Good conditions. 

The first hypothesis leads to the prediction 
that subjects in the Similar-Poor condition 
_ Would show more anxiety than subjects in the 
Similar-Good condition. The palmar sweat 
Measure of anxiety does not show any sig- 
nificant difference between the conditions 
(see Table 1). Nor is there any significant 
difference between the Different-Good and 
Different-Poor conditions. However, the pre- 
diction is supported by the subjects’ self- 
© teport of anxiety. Table 1 shows that subjects 

_ in the Similar-Good condition report signifi- 
cantly less anxiety than the Similar-Poor. 

Other prediction stemming from the first 
- hypothesis is that no difference would occur 
- between the Different-Good and Different- 
| Poor conditions. Table 1 shows that this is 


^ A score of 100 means no change in sweating. Scores lower than 100 mean a change toward less palmar sweating. The 
score was obtained by first taking the densitometer reading on each of the two prints. If the second print was darker, the difference 
inthe two readings was added to 100. If lighter, it was subtracted. 


* Walker-Lev test of difference between differences, one-tailed. 


the case for self-report of anxiety. The means 
in the two different conditions are intermediate 
between the means in the Similar-Good and 
Similar-Poor conditions. Furthermore, a 
Walker-Lev test of the significance of the dif- 
ference between these differences between 
means is significant at the .05 level. This test 
shows that the Similar conditions show sig- 
nificantly more differences between the Good 
and Poor conditions than do the Different 
conditions. In sum, the results on self-report 
of anxiety support the hypothesis. 

Previous work (Stotland & Dunn, 1962; 
Stotland & Hillmer, 1962) would lead to the 
expectation that the subjects in the Similar 
conditions would evaluate their own abilities 
on the test taken by the demonstrator con- 
sistently with the latter’s ostensible perform- 
ance, This previous work would also lead to 
the expectation that this consistency of self- 
evaluation would not obtain in the Different 
conditions, The results in Table 1 show, how- 
ever, that the subjects evaluate their own 
abilities consistently with the ostensible per- 
formance of the model in both the Similar 
and Different conditions (see Rows 5 and 6). 
It is interesting to note that the above cited 
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difference occurs without any significant dif- 
ference in attraction to the model (see Row Te 

The second hypothesis was that later borns 
would support the first hypothesis more than 
first and only borns. Table 2 shows that 
neither group showed any significant differ- 
ence in palmar sweating among the conditions. 
However, the hypothesis is supported by the 
subjects’ self-report of anxiety. The later born 
subjects report significantly less anxiety in 
the Similar-Good condition than in the 
Similar-Poor. They show no difference be- 
tween the two Different conditions. A Walker- 
Lev test shows that the difference between 
these pairs of mean differences in significant. 
It should be noted, however, that these results 
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are generated mostly by the low level of 
anxiety reported in the Similar-Good condition, 

On the other hand, the first and only borns 
show no significant differences among the four 
conditions. In fact, a Walker-Lev test is sig- 
nificant between the differences between the 
later borns and first and only borns with 
respect to the differences they show between 
the Similar-Good and Similar-Poor conditions, 
In short, the results support the second 
hypothesis with regard to self-reports of 
anxiety, but not with regard to palmar 
sweating. 

The support for the second hypothesis 
cannot be accounted for by differences between 
the two types of subjects in their perception 


TABLE 2 
ANXIETY AND RATINGS IN EXPERIMENTAL CONDITIONS By BIRTH ORDER 
Similar Different 
Evaluation of performance t 
Good Poor Good Poor 
a b c 
Later Borns 
1. Changes in palmar sweat* 97.3 95.1 94.0 90.1 
2. Perception of model’s anxiety? 8.5 13.5 9.4 13.4 a vs. b: 4.565**** 
H .038%** 
3. Interest in model’s test 40 5.2 5.6 4.7 4 s 2.233** 
4. Self-report of anxiety? 10.9 15.8 154 14.3 avs. b: 3.526 
LY AY fas 
5. Prediction of own ability on test* 3.5 4.3 4.2 4.5 tea 
- Comparison of own ability on test* 3.6 4.6 4.0 4.5 a vs. b: 1.757** 
7. Attraction to model? 49 5.1 5.1 5.0 
N 15 15 9 15 
First and Only Borns 
1. Changes in palmar sweat 88.6 90.0 87.0 92.6 
2. Perception of model's anxiety 9.7 13.5 92 13.7 avs. b: 3.999 
; — 
3. Interest in model's test. 39 4.5 49 6.0 "e s 
4. Self-report of anxiety 13.0 14.2 13.3 14.8 1.8854** 
5. Prediction of own ability on test 32 4.2 3.4 48 a vs. b: 218p. 
À X . d: 4.025* 
6. Comparison of own. ability on test 34 4.5 3.1 44 á i T 1.860* 
È ** 
7. Attraction to model 46 47 5.3 4.9 UE IU 
N 16 13 23 19 


? A score of 100 means no change. Scores of less than 100 mean a ch; 


b The lower the figure, 
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TABLE 3 
ANXIETY AND RATINGS IN EXPERIMENTAL CONDITIONS BY SELF-ESTEEM 
Similar Different 
Evaluation of performance i b 
Good Poor Good Poor 
a c d 
High Self-Esteem 
1. Changes in palmar sweat* 95.4 94.6 84.9 88.8 
2. Perception of model's anxiety 7.7 14.6 9.6 12.8 a vs. <.001 
C vs. «.01 
3. Interest in model's testh 3.2 5.4 4.9 4.9 a vs. <.05 
a vs. <.06 
4, Self-report of anxiety? 10.4 15.6 13.1 13.3 a vs. «.01 
a vs. <.02 
b vs. «.10 
2.878* «.01 
5. Prediction of own ability on test? 3:2 3.7 3.1 4.4 c vs. d: 2.847 <.01 
'6. Comparison of own ability on test! 3.4 41 3.3 4.0 
7, Attraction to model? 5.1 45 5.6 49 
N 13 14 16 18 
Low Self-Esteem 
1. Changes in palmar sweat? 92.5 91.1 93.4 93.8 
2. Perception of model's anxiety” 10.3 12.7 9.4 14.5 a vs. b: 2.237 «.03 
c vs. d: 4.724 «.001 
a & b: 2.0015 <.01 
3. Interest in model’s test 49 43 5.6 6.2 bvs.d:2.424 | <.05 
4. Self-report of anxiety 13.1 14.5 15.1 16.5 a & b: 1.9995 «.06 
5. Prediction of own ability on test 3.6 47 3.8 5:2 a vs. b: 2.073 «.05 
€ vs. d: 2.535 «.03 
6. Comparison of own ability on test Sh 4.8 3.6 4.9 a vs. b: 1.885 «.08 
€ vs, d: 2.234 <.03 
T. Attraction to model 43 52 48 4.9 
N 15 15 13 15 


"A score of 100 means no change. Scores of less than 100 mean change toward less sweat (see Table 1). 


b The lower the figure, the less the attribute rated. 
x Walker-Lev test of differences between differences. 
The lower the figure, the better the evaluation. 


* Walker-Lev test of difference between High and Low self-esteem persons. 


of the anxiety of the demonstrator, since both 
types perceived the demonstrator as being 
more anxious in the Poor than Good condition 
(see Row 2). 

The work of Stotland and Dunn (1962) 
Would lead to the expectation that the later 
born children in the Similar condition would 
evaluate their own abilities on the test more 
Consistently with the model’s ostensible per- 
formance, than those in the Different con- 
dition. Table 2 (Rows 5 and 6) provides 
only minor support for the expectation, with 
only one barely significant ¢ test on both 
Statements of ability and with the direction 
of the difference between Good and Poor 


conditions being the same in both the Similar 
and Different conditions. It was also expected 
that the first and only borns would not 
manifest a difference between the Good and 
Poor conditions. However, Table 1 shows that 
in both the Similar and Different conditions 
and for both measures of self-evaluation, these 
subjects evaluate themselves better in the 
Good than Poor conditions. 

None of the above differences between the 
two birth order groups is paralleled by any 
difference in attraction to the model. 

The third hypothesis led to the prediction 
that for persons low in self-esteem the ex- 
pected differences among conditions would be 
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larger than for persons high in self-esteem. 
Table 3 shows that there were no differences 
among conditions for either group of subjects 
on palmar sweating. However, on self-report 
of anxiety, the results are significantly op- 
posite to the prediction. Persons low in self- 
esteem show no significant difference among 
any of the conditions on reported anxiety. 
On the other hand, persons high in self-esteem 
report less anxiety in the Similar-Good con- 
dition than in the Similar-Poor one. The 
means in the two Different conditions are 
almost identical and are intermediate between 
the Similar-Good and Similar-Poor condition 
means. Furthermore, a Walker-Lev test shows 
a significant difference between the mean dif- 
ferences in the two Similar and two Different 
conditions. Thus persons high in self-esteem 
empathize more in the Similar conditions than 
in the Different. Furthermore, a Walker-Lev 
test shows that the two self-esteem groups are 
significantly different in the reaction to the 
two Similar conditions. 

Part of the process underlying the greater 
empathic reaction of the high self-esteem 
persons is suggested by perceptions of the 
models anxiety by the two groups. Both 
Broups perceive the model as significantly 
more anxious in the Poor condition than in 
the Good. But the high self-esteem persons 
appear to discriminate more between the 
Similar-Good and Similar-Poor conditions than 
the lows in their judgment of the model's 
anxiety. This difference is shown by the sig- 
nificant Walker-Lev test comparing the two 
types of persons on the difference between the 
Similar-Poor and Similar-Good conditions, 

Table 3, also, shows that the persons high 
in self-esteem show a level of interest in the 
test the model took in the Similar-Good con- 
dition which is significantly lower than in 
the Similar-Poor and Different-Good condi- 
tions. The results also show that persons low 
in self-esteem report more interest in the 
model's test in the Different-Poor than in the 
Similar-Poor conditions. 

The work of Stotland and Hillmer (1962) 
and Stotland and Dunn (1962) would also 
lead to the expectation that in the Similar con- 
ditions, the self-evaluation of ability on the test 
taken by the model would be more consistent 
with the models ostensible performance for 
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subjects low in self-esteem than for those 
high. This expectation receives support from 
the results in Table 3. On both self-ratings 
of ability, the lows show significantly higher 
self-evaluation in the Similar-Good than in 
the Similar-Poor conditions. The highs show 
no significant difference, although the direction. 
of difference is the same as for lows. 

On the other hand, on both self-ratings of 
ability the lows also evaluate themselves 
consistently with model's level in the two 
Different conditions. This result was not ex- 
pected on the basis of the previous work. 
Furthermore, on one of the two self-ratings 
of ability in the Different condition, the highs 
evaluate themselves consistently with the 
model's performance. Thus it appears that 
the self-evaluations of low self-esteem persons 
were influenced regardless of the experimental 
conditions of similarity or difference, while 
the highs were in general less influenced, al- 
though influenced somewhat in the different 
conditions. 

The fourth hypothesis was that persons 
low in authoritarianism would empathize more 
than persons high. The results provided no 
support at all for this hypothesis, 


Discussion 


The support given the overall hypotheses 
by the self-reports of anxiety shows that it is 
possible to deal experimentally with so in- 
tangible a phenomenon as empathy. The 
results also show that persons empathize more 
with someone whom they perceive as similar 
to them than with someone who is not so 
perceived. In line with the work of Stotland 
and Hillmer (1962) and Stotland and Dunn 
(1962), the attribute on which the person 
perceives himself to be similar to the model 
can be one which is arbitrarily and ac- 
cidentally acquired by both the person and 
the model. In the earlier works, the individual 
acquired an attribute of ability level as a 
result of such arbitrarily and accidentally im- 
posed similarity. In the present study, a re- 
ported level of anxiety was acquired. Thus 
empathy may be treated as a special case of 
identification. This conclusion is, however, 
made somewhat tenuous by the finding that 
in both the Similar and Different conditions, 
the subjects evaluated their abilities con- 


EMPATHY AND IDENTIFICATION MODELS 


sistently with the model’s. This lack of dif- 
ferentiation between the Similar and Dif- 
ferent conditions may have resulted from the 
potency of having a live model, instead of 
the verbally described models of the earlier 
studies. It may be that acquisition of an 
expected ability level may have a low thresh- 
old, so that either an arbitrarily imposed 
similar attribute or the perception of a live 
model may be sufficient. On the other hand, 
empathizing may have a high threshold, so that 
both similarity and a live model are necessary. 

In the earlier work, it was pointed out that 
an individual can acquire a socially undesir- 
able attribute (e.g., poor performance level) 
through identification. In the present work, it 
has also been indicated that an individual can 
acquire an undesirable attribute, anxiety, as 
a result of perceiving the attribute in another 
person under the appropriate conditions. 
These findings raise questions about the moti- 
vational basis for identification and empathy. 

The failure of the physiological measure 
to manifest differences parallel to those in 
the self-report of anxiety may have resulted 
from a variety of factors: individual differ- 
ences in the manifestation of anxiety (Lacey, 
1950); the subjects’ adaptation to the experi- 
ment; etc. 

The greater tendency of later born children 
to empathize was manifest in the self-reports 
of anxiety and is consistent with the findings 
of Stotland and Dunn (1962) that they 
identify more readily than first and only 
borns. The later borns also show some tend- 
ency to evaluate their ability consistently 
with the model’s in the Similar conditions. 
This tendency is also consistent with the 
earlier findings that later borns identify more. 
However, the first and only borns evaluate 
themselves in ability like the model, regardless 
of the degree of similarity. This pattern of 
results suggests that the later borns will both 
empathize and identify with someone who 
is similar to themselves. It is as if they react 
to such similarity as a re-creation of the 
family situation of having peers (i.e., people 
similar to oneself) which was their initial 
experience. The first and only borns, on the 
other hand, react as if they only use the other 
person's performance level as a guide to self- 
evaluation, and do not really “feel with” him. 
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The finding that persons high in self-esteem 
empathize more than those who are low sug- 
gests that the highs have less need to be con- 
cerned with themselves and can “lose them- 
selves” more in other people, provided they 
do not expect to experience the same potential 
threat as these other people. This qualifica- 
tion is based on the results of the earlier 
studies in this series which show that the 
highs protect themselves against any loss of 
self-esteem. This interpretation about their 
tendency to empathize is supported by the 
finding that the highs are more sensitive to 
the differences in the model’s presumed feel- 
ings between the Good and Poor conditions. 
The lows are equally sensitive only if the 
model is placed at some distance from them- 
selves, i.e., in the Different condition. Those 
high in self-esteem are not as influenced in 
their self-evaluation as the lows are. The lows 
appear to use their observations of the model’s 
performance as a basis for self-evaluation. 
This is indicated by the differences they 
show between both the Good and Poor con- 
ditions in self-evaluation of ability. The find- 
ing that the difference occurs in both the 
Similar and Different condition may be a 
result of the greater potency of a live model, 
suggested above. 

The lows may be so concerned with their 
self-evaluation that they react to the experi- 
ences of others primarily in terms of its im- 
plication to themselves. Consistent with this 
interpretation is the fact that in the previous 
work on the difference in reactions of low 
and high self-esteem persons, the dependent 
variable was self-evaluation of ability (Stot- 
land & Dunn, 1962; Stotland & Hillmer, 
1962). In both these studies, the lows were 
more influenced, These results are reminiscent 
of the clinicians’ belief that only those who 
really love themselves can love others (Fromm, 
1941). 

The reason for the failure to show any 
relationship between empathy and author- 
itarianism might be related to the fact that 
empathy was shown only in the self-report. 
Tn the Stotland and Dunn (1962) study, the 
greater tendency for low authoritarians to 
identify was not manifest in their self- 
evaluations but was shown in their actual 
levels of performance on a task. Analogously, 
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the differences between the two groups might 
have been obtained in the present study had 
the palmar sweat measure indicated empathy. 
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i. 
| Brown, Black, and Horowitz (1955) pub- 


lished the findings of a study in which mono- 
lingual subjects were required to match mem- 
bers of English antonym word pairs with their 
equivalents in Chinese, Czech, and Hindi also 
presented as antonym pairs. The significant 
levels of matching achieved by subjects was 
interpreted by the authors as supporting a 
"universal phonetic symbolism" hypothesis 
according to which the different sounds used 
by different languages to embody similar 
meanings are symbolically equivalent, enabling 
subjects responsive to this symbolism to guess 
the meanings of otherwise unfamiliar words. 
Very soon thereafter, however, this interpreta- 
tion was challenged by Maltzman, Morrisett, 
and Brooks (1956) and by Brackbill and Lit- 
tle (1957). The former investigators showed 
that contrary to what would be predicted 
from the Brown et al. hypothesis subjects did 
not exceed chance in matching Japanese and 
Croatian words although they could match 
Words correctly from either of these languages 
With their English equivalents. Brackbill and 
Little further demonstrated that their subjects 
Were responding to actual structural similari- 
ties rather than to symbolic equivalences in 
making their guesses. Sound symbolism, they 
agreed with Maltzman et al., was no symbol- 
ism at all. 


1 The author wishes to express his gratitude to the 
faculty and students of the University of Buffalo for 
their time and cooperation in the data collection 
Phase of this study. 
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ROLE OF “MEANINGFULNESS” VERSUS MEANING 
DIMENSIONS IN GUESSING THE MEANINGS 
OF FOREIGN WORDS 


JONATHAN H. WEISS 1 


University of Rochester 


Ss guess the meanings of foreign words best when asked to match members 
of English and foreign antonym pairs. The present study tests the necessity 
for the word pairs to be antonyms. 394 Ss, comprising 4 groups, guessed the 
meanings of Chinese and Hindi words under 2 conditions. 2 groups received 
Chinese or Hindi antonym pairs along with English counterparts. The other 
groups received the same words but in nonantonym pairs. Results indicate 
significant departure from chance for all groups but no differences between 
groups. The interpretation is that a meaningful stimulus rather than a mean- 
ing dimension is of importance in a clear demonstration of the phenomenon, 


In response to these findings, Brown and 
Nuttall (1959) pointed out that where the 
results of the other studies were in disagree- 
ment with their own, important procedural 
variations had been present. They reasoned 
that since theoretically a unit of meaning 
could be represented by any one of a num- 
ber of phonemes and since, conversely, a 
phoneme could carry any of a variety of 
meanings depending upon the context in 
which it occurred, it would be necessary to 
provide subjects with a dimension of mean- 
ing, viz., an antonym pair in their native lan- 
guage, on which to locate the sounds of the 
foreign words and thereby judge their mean- 
ings. In neither the Maltzman et al. nor the 
Brackbill and Little study had this require- 
ment been met. Thus, Brackbill and Little 
had subjects judge whether an English and a 
foreign word had the same or different mean- 
ings and Maltzman et al. had subjects choose 
which one of a pair of response words was 
closer in meaning to a single stimulus word. 
Brown and Nuttall therefore went on to dem- 
onstrate that the English-foreign antonyms 
condition, such as they had originally used, 
elicited significantly more correct guessing 
than did either of the other methods, although 
all three were significant beyond chance. On 
the basis of this finding, Brown and Nuttall 
argued once again in favor of the original uni- 
versal phonetic symbolism hypothesis. 

There is, however, one factor which renders 
the Brown and Nuttall argument question- 


£ 


542 


able. What their studies have shown to this 
point is that where you have a pair of stimu- 
lus words and a pair of response words it is 
necessary that one of these pairs be in Eng- 
lish in order for appreciable correct guessing 
to occur. However, they have not shown that 
the word pairs must be antonyms, since in no 
case were subjects asked to match members 
of nonantonymic word pairs. They have not, 
in fact, even shown that one must have word 
pairs in both the stimulus and response cate- 
gories, On the contrary it will be recalled that 
Maltzman et al. (1956) showed that a single 
stimulus word presented with a pair of re- 
sponse words is sufficient to elicit significantly 
correct guessing provided that either the 
stimulus word is, or the response words are, 
in English. In this case, however, where the 
response pair was English it was an antonym 
pair and where the single stimulus word was 
English one might contend that the subject 
could provide his own antonym implicitly. The 
present study was undertaken, therefore, to 
test the Brown and Nuttall hypothesis di- 
rectly by comparing the performances of sub- 
jects having to match members of English and 
foreign nonantonym pairs with those of sub- 
jects matching the same words presented as 
antonyms. Since it was possible, of course, 
that the presence of antonyms would be a 
helpful if not a necessary condition for the 
correct guessing of foreign words three hy- 
potheses were tested: 

1. If an English antonym stimulus word 
pair is a necessary condition for correct guess- 
ing in the English-foreign pairs paradigm, 
then above chance guessing should occur only 
when subjects are given antonym pairs while 
chance guessing should result when subjects 
are given nonantonym pairs. 

2. If it is sufficient that the stimulus word 
pair be English but nonantonymic in this 
paradigm, then significantly correct guessing 
should result in both the conditions where 
antonym and nonantonym pairs are presented 
and there should be no significant differences 
between the levels of correct guessing achieved 
in the two conditions. 

3, If an English word pair is the sufficient 
factor in this paradigm but it is further help- 
ful that this word pair be antonymic, then, 
while significant guessing should occur in both 
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the antonym and nonantonym conditions, sig- 
nificantly more correct guessing should be 
achieved under the antonyms condition than 
under the nonantonyms condition. 


MetHop 


Twenty antonym pairs, equivalent in English, Chi- 
nese, and Hindi, were taken from the Brown et al. 
(1955) list and employed in the present study, Four 
experimental conditions with four groups of subjects 
were involved. In two of these (Antonym Pairs con- 
ditions) subjects were presented with a list of 20 
English antonym pairs and an equivalent list in either 
Chinese or Hindi. In the other two conditions (Mixed 
Pairs conditions) subjects were given a list of 20 
English nonantonymic word pairs, composed of ran- 
domly paired words from the original list, and an 
equivalent list in either Chinese or Hindi, In the 
Mixed Pairs conditions none of the word pairs were 
antonyms. In all lists a random event determined 
the order of words in each pair and another random 
event determined the position of the pair in the total 
list. Order of presentation was exactly counterbal- 
anced for half of the subjects in each condition. The 
word lists were put into booklets with the English 
pairs printed on the left side of the page and the 
foreign equivalents on the right. A space was pro- 
vided along the right margin for subjects to indicate 
their choices, 

The subjects were 394 students in undergraduate 
introductory and social psychology classes. There 
were approximately equal numbers of males and 
females and all ranged in age from 19 to 26. Subjects 
Were seen and tested in their classrooms. In each class 
74 of the students got each of the four treatment 
conditions by means of random distribution of equal 
numbers of all four booklets, Subjects were asked 
to fill in identifying data, including name, age, Sex, 
and a brief statement of prior experience with foreign 
languages, on the cover sheet of the booklet. Subjects 
familiar with Chinese, Hindi, or other closely related 
languages were later eliminated from the sample. 
After filling out the cover Sheets, subjects were told 
that they had been given a list of 20 English word 
pairs and their equivalents in either Chinese or Hindi 
and that they were to try to match the members of 
each English pair with their equivalents in the 
foreign language by their sounds. Choices were to 
be indicated in the space provided for this purpose 
and then transferred onto IBM sheets for machine 
scoring. No further instructions were given nor were 
words pronounced aloud for the subjects. 

Following completion of the translation procedure 
subjects were asked to turn to the last page of 
their booklets on which a four-point scale had been 
provided for them to indicate the amount of difficulty 
they had experienced in making their choices. 


RESULTS 


The results of the present study may be 
examined in two ways. First we may look for 
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TABLE 1 


SUMMARY OF PERCENTAGE OF CORRECT SCORES AND PERCEIVED DIFFICULTY RATINGS FOR 
MALES AND FEMALES UNDER ÁNTONYMS AND Mrxep Parrs CONDITIONS 


Condition N Mean % correct Mean difficulty ratings* 
I English Hangs Antonyms 
Females 38 54.50 1.83 
Males 61 EE —55.35"* 205 ) =1.97 
II. English-Hindi Mixed Pairs 
Females 49 55.62 | 1.72 
Males 52 5570p oe 191 } ou 
III. English-Chinese Antonyms 
Females 45 52.77 1.70 
Males 52 56.82 ) —54.90* 2.02 ) -L87 
IV. English-Chinese Mixed Pairs 
Females 49 54.55 1.83 
Males 48 56.65 ) —55.60** 2.04 ) —1.93 
a 1 =extremely difficult, 2=moderately difficult, 3 =moderately easy, 4 =extremely easy. 
* p «.003. 
* > «0001. 


gross differences between groups on levels of 
correct guessing achieved, without examining 
the pattern of guesses contributing to the 
overall scores. When this is done (see Table 
1) we find that all four groups achieve scores 
departing significantly from chance in the 
direction of correct guessing while no note- 
worthy differences between groups are in evi- 
dence. This would seem to support our second 
hypothesis indicating that the presence of an 
English nonantonym word pair is sufficient to 
elicit a level of correct guessing comparable 
to that achieved when the English and foreign 
word pairs are antonymic. Thus the presence 
of the specific meaning dimension described 
by Brown and Nuttall (1959) apparently 
adds little to the subject's performance. How- 
ever, before this conclusion can be stated with 
any certainty we must undertake a second 
level of analysis to determine if there are 
noteworthy differences in the patterns of re- 
Sponses of the four groups. Two things might 
be looked for here. First, one would wonder 
if in the antonyms groups a more general 
pattern of correct guessing occurs than in the 
nonantonym groups, with the apparent simi- 
larity in gross scores being accounted for by 
the latter guessing fewer word pairs but with 
high scores for these pairs. Tables 2 and 3 
Show the word pairs in each list and the indi- 
vidual percentage of correct scores. Examina- 
tion of these scores reveals a strikingly 
Similar pattern of responses for all groups. 
Thus the number of correctly translated pairs 


is identical for the antonyms groups and the 
mixed pairs groups. Furthermore the number 
of scores significant at the .05, .01, and .001 
levels is almost the same for the two sets 
of groups, indicating a similar dispersion of 
scores. 

The second thing to be looked for in this 
more detailed analysis of the data is the 
fortuitous occurrence in the nonantonym word 
lists of pairs which, while not literal anto- 
nyms, could conceivably be perceived as 
“metaphoric antonyms." Should correct guess- 
ing occur strongly only for such word pairs 
this would seriously weaken our finding and 
would lend evidence to the Brown and 
Nuttall (1959) hypothesis instead. An exami- 
nation of our nonantonym word lists reveals 
three possible cases of such “metaphoric 
antonyms” which require attention. The first 
of these is sHORT (versus LoNG)-UP and its 
Chinese and Hindi equivalents TUÁN-sHÀNG 
and cHHOTA-UPAR, respectively. In this case 
a significantly incorrect level of guessing is 
found for the English-Chinese combination 
(possibly because of short-shang) and a 
highly significant level of correct guessing is 
achieved for the English-Hindi combination 
(very probably because of up and upar). In 
the second and third cases WEAK-MANY and 
UGLY-GOOD the same thing happens. Thus the 
possible presence of a dimension serves at 
one time to elevate percentage of correct 
scores and at another time to reduce them. 
The findings stated above, then, seem to hold 
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up as a general performance similarity. for all 
groups and we can state with some confidence 
that the presence of antonyms has not been 
demonstrated to elicit more correct transla- 
tions in the present paradigm. 

Interestingly, further confirmation for this 
position derives from the "perceived diffi- 


TABLE 2 


PERCENTAGE OF CORRECT SCORES FOR EACH 
ANTONYM PAIR IN CHINESE AND HNDI 


% % 
Pair Chinese Correct Hindi Correct 
1. BEAUTIFUL. MEL 49.48 KHUBSURAT [64.64 «^k. 
UGLY cn'ot BADSURAT 
2. BLUNT TUN 57.73 GOTHIL 64,64 ase 
SHARP K'UAL TEZ 
3, BRIGHT LIANG |51.54 |CHAMAKDAR| 80,80 wir 
DARK AN |DHUNDHALA| 
4. COARSE TS'U — |48.45 MOTA 41.41 
FINE nsi ACHHA 
5. DOWN HSIÀ |52.57 NICHE 79.79 wie 
UP SHANG UPAR 
6. DRUNK TSU] [44.32 NASHE MEN |59.59. 
SOBER HSING ISANJIDA 
7. DRY KAN 82.47 oe [SUKHA 33.330" 
WET SHIH BHIGA 
8. FAST K'UAÌ [73.19 9 | rez 63.63 v 
SLow MÀN sust 
9. FAT FEÍ 43.29 |MOTA 55.55 
THIN suoü PATALA 
10. corp CHIN  |41.23 ISONA 59.59 
IRON Tite LOHA 
11, Goop HAG 41.23 ACHHA 53.53 
BAD HUA KHARAB 
12. HAPPY HUAN [41.23 KHUSH 22.22% 
SAD PET RANJIDA, 
13, HARD KANG |39.67>* |saxut 55.55 
SOFT Joo INARM 
14, LIGHT CH'ING |67.019** |HALKA 57.57 
HEAVY CHUNG |WAZANI 
18. LONG. CH'ÁNG |46.39 LAMBA 18,1801 
SHORT TUÁN |CHHOTA 
16. MANY TO 36.08b** |BAnuT 83.83 wee 
ONE vr EK 
17. STRONG CH'IÁNG|54.63 |MAZBUT 136.360" 
WEAK Jò KAMZOR 
18, SWEET TIÉN (64.94% [MITHA 66.66 v 
SOUR SUAN KHATTA 
19. WARM NÜAN [76.289 |GARAM 69.69 se 
COOL LIÁNG THANDA 
20, WIDE K'UAN |62.88** |cmaura — |43.439* 
NARROW CHAL TANG 
Number 
correct 6 
‘umber d 
incorrect 5 5 


a Correct direction. 
hi acotat direction. 
db 
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TABLE 3 


PERCENTAGE OF CORRECT SCORES FOR EACH MIXED 
Pam IN CHINESE AND HINDI 


4 o P A 
Pair Chinese] co ee. | Hindi | coet 
1. WET sHin [41.23 BHIGA 56.43 
BRIGHT LIANG CHAMAKDAR 
2. DRY KĀN 51.54 SUKHA 35.64 bk 
SAD PET RANJIDA 
3. IRON TiÉH = |34.02>** |LOHA 44.55 
DARK AR IDHUNDHALA| 
4. BEAUTIFUL MEL 56.70 KHUBSURAT|40.59 
SOBER HSING ISANJIDA 
5. SHORT (versus |TUAN  |36.08b** |cumorA 82.17 aik 
LONG) 
UP SHÀNG UPAR 
6. SLOW MÀN [59.79 SUST 70,29 viii 
SHARP K'UÀI TEZ 
7. BAD HUAI  |46.39 KHARAB 54.45 
NARROW CHAL TANG 
8. LIGHT (versus |cm'iNG |56.70 HALKA 54.45 
HEAVY) 
THIN sHoÙ PATALA 
9. WEAK Jò 67.01 9** | KAMZOR 38.61 b 
MANY 16 BAHUT 
10. FINE ust 73.19 0 |ACHHA 77.22 we 
ONE yi EK 
11. COARSE TS'U 64.94% [MOTA 51.48 
WARM NUAN GARAM 
12. FAT FEÍ 65.97 9** |worA 75,24 wt 
DRUNK TSUl NASHE MEN 
13. ucLv cH'ot |43.29 BADSURAT |70.29 wk 
GOOD HAS |ACHHA 
14. SoFT Joo 63.919 |NARM 52.47 
LONG CH'ÁNG LAMBA 
15. HARD KANG |63.919** |saxuT (67,32 e*t 
HEAVY CHURG WAZANI 
16. BLUNT TUN — |77.319** [corm [49.50 
SWEET TIEN MITHA 
17. WIDE K'UĀN |51.54 CHAURA 59.40 
STRONG CH'IÁNG| MAZBUT 
18. HAPPY HUAN |61.859* |KHUSH 40.59 
GOLD CHIN SONA 
19. cooL LUANG |35,05b** |rHANDA |48.51 
DOWN HSIA NICHE 
20. sour SUAN  |59.79 KHATTA 40.59 
FAST K'UAl TEZ 
Number 
correct 8 6 
Number 
incorrect 3 2 


culty” ratings which are strikingly uniform 
(see Table 1), while it might be inferred 
from the Brown and Nuttall (1959) statement 
that subjects given antonym pairs ought to 
find the task easier, since the presence of a 
dimension should serve to eliminate some of 
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the perplexity from the attempt to find a 
suitable context in which to make the 
judgment. 

One final aspect of our data should be 
noted. While the differences are not signifi- 
cant, there appears to be a consistent tend- 
ency for males to achieve slightly higher 
correctness scores than females and to find 
the task easier to accomplish (see Table 1). 
While this is only suggestive, it is contrary 
to other reports that no sex differences are 
discernible, 


Discussion 


We have seen that with adult subjects it 
is advantageous that the stimulus word pair in 
‘the “translation paradigm” be in English. We 
have not found, however, that the word pair 
must be antonymic, nor that antonyms lend 
any discernible advantage where they are 
employed. Thus it does not seem to be the 
case that, as Brown and Nuttall (1959) have 
argued, a dimension of meaning embodied in 
an antonym word pair is a crucial factor in 
achieving significantly correct matching. What 
does seem to be true is that it is distinctly 
helpful for the subject to be given a meaning- 
ful context, represented by the English words, 
in which to make his judgment in order for 
a clearly consistent and correct performance 
to result. We have shown more extensively 
in another paper (Weiss, in press) that in a 
slightly different paradigm, where subjects 
were asked to attribute “meanings” to 
phonetically contrasted nonsense words by 
matching them with physically contrasted 
Nonsense pictures, only slightly better than 
chance agreement occurred. When, however, 
they were asked to match these word pairs 
with physically contrasted meaningful refer- 
ents, there was a significant increase in the 
extent of agreement and also an increase in 
the degree to which judgments were made in 
à predictable and “appropriate” direction. 
Since in both cases a dimension was clearly 
present, it would seem to be the meaning- 
fulness of the referents that served to elevate 
agreement and directionality scores by some- 
how orienting the subject in an appropriate 
direction and, as it were, drawing his atten- 
tion to the appropriate phonetic elements 
Upon which his judgment was to be based. 
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Thus in relation to the guessing of foreign 
words we would say that to the extent that 
appropriate phonetic elements are incorpo- 
rated into equivalent words in different lan- 
guages, and to the extent that the English 
words orient the subject to respond to those 
elements, to that extent will there be correct 
guessing. 

Before going on to consider what the nature 
of this “orienting” might be we must wonder 
about the distinctly poorer scores achieved 
in the Brackbill and Little (1957) "Same- 
Different? procedure despite the fact that one 
of the words was in English. The answer 
here lies, we feel, in a consideration of the 
strategy of the subject in a matching task. 
Where a single stimulus word and a single 
response word are involved the subject is 
given a single opportunity to discover some 
appropriate similarity upon which to base his 
judgment. In the absence of such a discovery 
the subject must then either make the judg- 
ment “Different” or resort to guessing or some 
other potentially arbitrary procedure. Where 
pairs of words are employed in both the 
stimulus and response categories, on the other 
hand, the subject has at least four times as 
great an opportunity for discovering some 
similarity, assuming that one exists, upon 
which to base a correct judgment of similar 
meanings. Thus we find the advantage of 
word pairs to reside primarily in the amount 
of information with which the subject can 
work. 

To return to the “orientation” which 
meaningful stimuli are hypothesized to 
produce, the exact nature of this process is 
difficult to elucidate at this point. It could 
be argued, of course, that despite our not 
having explicitly presented subjects in the 
nonantonym groups with antonyms in the 
word pairs, subjects in those groups could 
have supplied their own antonyms implicitly 
and would therefore be working with dimen- 
sions as Brown has described. This is, we 
feel, a tenable position and one which could 
be followed up by studying the performance 
of subjects on a translation task where English 
and foreign words are employed which have 
no clear antonyms. However, there is an 
alternative hypothesis which, while not radi- 
cally different, has important implications. We 
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have suggested elsewhere (Weiss, in press) that 
the orienting effect derives from a narrowing 
down of the range of associations which the 
subjects may use in trying to execute a match- 
ing task such as that involved in these studies. 
Brown and Nuttall (1959) have suggested 
that the sound-sense link which is the basis 
for the phonetic symbolism phenomenon is 
one which is learned through exposure to con- 
sistent co-occurrences of sound types with 
physical types in the real world. Thus the 
English word serves to suggest a category of 
experiences which the subject may use in 
determining the meaning of a sound or sound 
sequence. Where no such categories are sug- 
gested either because both stimulus and re- 
sponse words are foreign or because a non- 
sense word is to be paired with a nonsense 
picture (e.g., Usnadze, 1924) subjects are 
free or even forced to resort to highly diverse 
and arbitrary associations. Where the cate- 
gories are given, however, the associations 
are in a sense controlled and intersubject 
agreement, therefore, is increased. Thus, as 
indicated above, to the extent that sound- 
Sense relations congruent with the subject’s 
experiences have found their way into all 
natural languages, correct guessing is likely 
to occur, 

What implications do our findings have for 
the problem of how the sound-sense relations 
have found their way into natural languages? 
As it is presently conceived, the universal 
phonetic symbolism hypothesis seems to be 
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stated in terms of the symbolic equivalence of 
sounds in noncognate languages, ie. “dif- 
ferent languages-different sounds-same mean- 
ings." However, it may be that the hypothesis 
needs to be broadened to include instances of 
"different languages-same sounds-same mean- 
ings." This is a matter which only further re- 
search can settle. At this point, however, we 
find evidence for more than a single factor in 
the introspections of our subjects. Thus there 
is reliance upon associations in some instances 
and structural similarities in others. It does 
not seem unlikely when one considers Brown's 
learning hypothesis mentioned above that 
both would result since there are undoubtedly 
Some sound-sense co-occurrences which are 
universal. 
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"The experimental conformity situation, in 
hich a conflict is induced between an in- 
Ividual's belief, opinion, or judgment and 
oppositional group norm, rests on the 
upposition of personal commitment to, or 
investment in, the contravened position. 
ile considerable experimental analysis has 
n devoted to variations in group pressure 
Asch, 1952; Crutchfield, 1955; Mouton, 
lake, & Olmstead, 1956), the present study 
S addressed to the question of the effects of 
ipulating personal commitment by in- 
Creasing the penalty for conforming behavior. 
_ Punishing conformity, and correspondingly 
increasing the value of independence, was 
complished by providing subjects with an 
ount of money and requiring them to bet 
the correctness of their judgments. The 
nderlying rationale was that the introduc- 
On of negative consequences for yielding 


“This research was supported in whole by the 
nited States Air Force under Contract No. AF 
(638)-741, monitored by the Air Force Office of 
ntific Research of the Air Research and De- 
lopment Command. 

?We would like to thank our colleagues Julian 
Rotter and Milton J. Rosenberg for their 
Ipful suggestions and advice in the preparation 
this report. 

- 5 Subsequent to the completion of this experiment, 
Was discovered that Siegel (1956) briefly described 
Procedure similar to this in his Nonparametric 
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tion with the illustration of a statistical technique. 
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CONFORMITY UNDER VARYING CONDITIONS OF 
PERSONAL COMMITMENT * 


DOUGLAS P. CROWNE ann SHEPHARD LIVERANT 2 
Ohio State University 


This study tested a central proposition, stemming from Rotter's Social Learning 
Theory, that conformity is related to low expectations of success in socially 
evaluative situations and is consequently accompanied by defensive processes. 
2 variants of an Asch situation representing increasing degrees of personal 
commitment were compared to a control condition. Situational measures of 
confidence included betting and statements of expectancy. Additional 
personality indices included level of aspiration, need for approval, and internal 
vs. external control of reinforcement. The Ss were 110 introductory psychology 
students. Results depict the conformer as one who has a low expectation of 
success in evaluative situations, His lesser confidence leads to avoidant 
behavior to resolve the ensuing conflict. Defensive processes of the con- 
former tend to become more marked as personal commitment. increases. 


should increase the subject's commitment to 
his judgment. A lesser degree of investment 
was envisioned in a second condition in 
which subjects had to make public estimates 
of confidence in their stated judgments. 

These procedures were intended to make 
the choice of conformity or independence a 
more meaningful and crucial one and thereby 
to extend analysis of the spectrum of con- 
forming behavior. On a more tentative level, 
however, conformity under these conditions 
might suggest an answer to the persistent 
question of the degree of awareness, or of 
defensive distortion, involved in conforming 
behavior. The defensive nature of the con- 
formity of some subjects has been inferred 
from the post-experimental interviews em- 
ployed in such prior studies as those of Asch 
(1956). The present study, by exacting a 
price for conformity, makes it more difficult 
for the subject to explain his yielding to 
himself; there is, thus, the suggestive implica- 
tion that to the degree that conformity oc- 
curs under the conditions outlined above, it 
is accomplished with relatively little aware- 
ness and with defensive justification. 

Besides delineating certain aspects of con- 
forming behavior under conditions of in- 
creased conflict, the present design lends 
itself to the study of personality differences 
between conformers and independents. Most 
relevant is the question of the relation be- 
tween conformity and betting or expectancy- 
stating. The major rationale for postulating 
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differential expectations (betting is considered 
a measure of expectancy) regarding the 
probability of being correct is provided by 
Rotter’s (1954) Social Learning Theory 
(SLT). Within SLT, the conformer can 
be characterized as one who has a low ex- 
pectation of success in socially evaluative 
situations. In consequence, he is inclined to 
use avoidant or defensive behavior in antic- 
ipation of punishment and threats to self- 
esteem. In other words, conformity in such 
situations represents protective behavior de- 
signed to avert expected failure. It follows 
that conformers, as compared to independ- 
ents, should bet or state expectancies in a 
manner indicative of less confidence in their 
judgments, particularly on nonconforming 
trials. 

A further test of the adequacy of this 
theoretical formulation was made by admin- 
istering the Rotter (1942) Level of Aspira- 
tion (LOA) Board. Rotter has described 
this goal setting technique as a problem 
solving situation which calls for self-evalua- 
tion on the part of the subject. Odell (1959) 
found a high relationship between extreme 
Scores on the Independence of Judgment scale 
(Barron, 1953), a questionnaire index of 
tendencies to conform, and overly cautious 
and defensive aspiration patterns on the LOA 
Board. 

Two personality inventories were also 
utilized to extend the network of proposi- 
tions concerning individual differences in the 
self-evaluative behaviors of conformers and 
independents. The first of these was a measure 
of the need for approval (Marlowe & Crowne, 
1961), previously shown to be related to 
social conformity (Strickland & Crowne, 
1962). A high score on this test demands 
that the subject depict himself in a culturally 
sanctioned but highly improbable light. 
Recent research, notably a study by Conn 
(1963), suggests that high scores manifest 
a need for approval based, at least in part, 
on defensive components. Dependence on the 
favorable evaluation of others may thus be 
taken to reflect low self-esteem, which would 
tend to result in irreal or self-protective be- 
havior in important evaluative situations. 
On this basis, approval motivated subjects 
should be less able to resist group pressure 
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and should display less confidence in their 
judgments. 

The second personality inventory rep- 
resents an attempt to measure an expectancy 
construct recently derived from SLT entitled 
Internal versus External Control of Reinforce- 
ment (Rotter, Seeman, & Liverant, in press), 
Internal versus External Control (I-E) refers 
to the degree to which an individual tends to 
perceive the consequences of his actions as 
being within (internal) or beyond (external) 
his control. The externally controlled indi- 
vidual sees relatively little instrumentality in 
his own behavior and regards himself as 
the passive recipient of reinforcements dis- 
pensed by external forces (chance, fate, im- 
personal social forces, or powerful others). 
In the previously cited study by Odell, a 
significant relationship was found between 
the index of I-E and the Independence of 
Judgment scale, with externals showing 
greater tendencies to conform. 


METHOD 
Subjects 


The subjects were 110 introductory psychology 
students, 40 males and 70 females, apportioned to 
the three experimental conditions as follows: The 
Control group contained 32 subjects (14 males, 18 
females); the Expectancy group consisted of 35 
subjects (14 males, 21 females); and in the Betting 
group were 43 subjects (12 males, 31 females). 


Conformity Situations 


The basic conformity situation utilized in the 
three conditions was an adaptation of the Asch 
perceptual discrimination procedure. The task set 
for the subjects required them to distinguish, on 
each of 20 trials, the larger of two groups of dots 
which were presented tachistoscopically for a 
l-second interval by means of a 2 by 2 slide pro- 
jector. Each slide differed in the geometric arrange- 
ment of the groups of dots. The identification of 
the larger cluster on each slide was quite unambigu- 
ous, as established by accuracy levels ranging be- 
tween 90 and 100% for the 20 slides as rated by 
12 judges. 

In the Control condition, the confederates and 
the naive subject were informed that the experiment 
was concerned with perceptual discrimination and 
speed. On each trial, they were required to publicly 
announce, in a given order, the letter (A or B) 
designating the larger cluster of dots. 

Exactly the same procedure was followed, and 
the identical task instructions were given, in each 
of the other conditions. The Expectancy group, 
however, was told that, in addition to perceptual 
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discrimination and speed, their confidence in the 
accuracy of their perceptual judgments was being 
investigated. Subjects in the Expectancy group 
indicated their certainty of being right on a scale 
from O (little or no confidence) to 10 (very high 
confidence). In this group, then, the members were 
required to announce their confidence statements 
following each judgment. The confederates made 
confidence statements of a prearranged magnitude. 
They were assigned consistent roles in an attempt 
to give them credibility, although in actuality the 
total distribution of the confederates’ expectancy 
statements was approximately normal, with the 
modal expectancy at the midpoint of the scale. 

As in the case of the Expectancy group, the 
Betting group was informed that confidence was 
being evaluated. In this group each member was 
given $10.00 in quarters with which he could bet 
nothing, $.25, or $.50 on each of his judgments. As 
in the Expectancy group, the bets of the con- 
federates were so prearranged as to prevent the 
naive subject from forming his own betting pattern 
on the basis of consistent group behavior. The 
instructions indicated that at the end of the session 
the accuracy of each judgment would be deter- 
mined and that subjects’ bets would be matched for 
those trials on which they were correct. Subjects 
declared their bets by dropping $.25 or $.50 (or 
nothing) into small boxes in front of them. The 
subjects could keep a proportion of their winnings, 
and this was explained to them in the initial 
instructions. Each subject’s take was determined 
by adding the amount he won to any remaining 
portion of the original $10.00. If this total was 
$10.00 or less, the subject kept 10%; if he ended 
up with more than $10.00, he kept $1.00 plus 20% 
of everything over $10.00. Thus, if a subject bet 
$.50 on every trial and was correct each time, he 
won $3.00. 


Personality Measures 


The need for approval was assessed by means of 

the Marlowe-Crowne Social Desirability (M-C SD) 
scale (Crowne & Marlowe, 1960), a 33-item, true- 
false questionnaire. An illustrative item is, “I would 
never think of letting someone else be punished for 
my wrongdoings." 
4 The index of I-E was a 60-item, forced-choice 
inventory, the I-E scale (Liverant & Scodel, 1960), 
which pairs internal with external choices as in the 
following example: 


I more strongly believe that 

(a) I have usually found that what is going 
to happen will happen, regardless of my actions. 

(b) trusting to fate has never turned out as 
well for me as making a decision to take a 
definite course of action. 


The scores obtained on the LOA Board are the 
D Score, an algebraic summation of the differences 
between past performance and estimates of sub- 
Sequent success, and the number and appropriate- 
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ness of shifts or changes in estimates. In addition, 
these indices may be combined to yield a series 
of nine patterns which describe the individual’s 
overall approach to the setting of goals. Rotter 
(1954) has described the criteria for the assignment 
of these patterns and reported evidence on their 
reliability and validity. 

The patterns constituted our measure of self-eval- 
uation and were divided into two major groups. The 
first of these groups, consisting of Patterns 2, 4, 
and 7, indicates overcautiousness, self-protection, 
and defensive goal setting and reflects lack of self- 
confidence and avoidance of expected failure. The 
second group of patterns, 1 and 3, represents a 
greater degree of confidence and more appropriate 
expectations of goal attainment. 


Procedure 


The two personality inventories were administered 
to all introductory psychology students on the first 
day of classes. For each experimental appointment, 
the naive subject and the confederates, two males 
and two females, were met in the waiting room 
and escorted to the room in which the experiment 
was conducted. The confederates were instructed to 
pre-empt the first four chairs, leaving the seat on 
the far right for the naive subject. To forestall 
suspicion, the order of responding was rotated 
from trial to trial, beginning with the confederate 
at the far left. Thus, the naive subject came first 
in the response order on every fifth trial. These were 
the noncritical trials. At the conclusion of the 
conformity procedure, each naive subject was 
questioned in detail to determine his awareness of 
the nature of the experimental procedures, the pur- 
pose of the experiment, and his account of his own 
behavior. 

Finally, the LOA Board was administered to all 
the subjects in the Control group. Twenty-eight of 
the 35 subjects in the Expectancy group were 
recalled for a second appointment and given this 
measure as were 17 of the 43 Betting group subjects. 


RESULTS 


As a preliminary step, the post-experi- 
mental interviews were carefully examined 
for suspicion, and 13 subjects (10 from the 
Control group and 3 from the Expectancy 
group) who correctly surmised the contrived 
nature of the experimental procedures were 
eliminated. 

For all of the major analyses to be reported 
below, except the basic comparison of the 
amount of conformity in the three conditions, 
standard scores were utilized in order to 
achieve comparability between conditions and 
sexes. A range of values with a mean of 50 
and a standard deviation of 10 was chosen. 
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-TABLE 1 


MEAN NUMBER OF CONFORMING TRIALS IN THE 
THREE CONDITIONS 


Condition N M SD 
Control 

Males 9 4.78 405 

Females 13 6.31 3.64 

Combined 22 5.68 3.80 
Expectancy 

Males 11 6.18 3.68 

Females 21 5.90 435 

Combined 32 6.00 4.07 
Betting 

Males 12 4.92 4.54 

Females 31 9.77 3.59 

Combined 43 8.42 441 


Conformity in the Three Conditions 


The amount of conformity in the three 
conditions was separately compared for males 
and females as shown in Table 1. A simple 
analysis of variance of the number of con- 
forming trials by males in the three conditions 
yielded a nonsignificant F. A similar analysis 
for females was highly significant (F = 7.56, 
P< .005, df=2/62). For females, the 
Betting condition produced a significantly 
greater amount of conformity than either 
the Control (¢ = 2.72, p < .01) or Expect- 
ancy (¢= 3.55, p < .001) conditions. The 
between-conditions effects for females were 
sufficiently powerful to yield significant dif- 
ferences between the Betting and Control 
and Expectancy groups (t’s, respectively, of 
2.57 and 2.51, p< .02) with males and 
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females combined. Thus, the effect of intro- 
ducing negative sanctions for conformity 
did not decrease yielding among males and 
produced a striking increase in conforming 
behavior among females. Public declarations 
of confidence in the Expectancy group did 
not influence the amount of conformity. 


Confidence of Conformers and Independents 


For the purposes of this and subsequent 
analyses, conformers were separately defined 
for each condition as subjects who yielded 
on more than the mean number of conforming 
trials. Subjects denoted as independents con- 
formed less than the mean number of con- 
forming trials. ; 

The confidence of conformers and in- 
dependents was examined by comparing their 
expectancy statements and bets on three 
categories of trials: those on which subjects 
yielded, trials on which they remained in- 
dependent, and the four noncritical trials on 
which subjects! confidence statements and 
judgments preceded those of the confederates. 
Tables 2 and 3 present, respectively, the ex- 
pectancy statement and betting patterns on 
these three types of trials for conformers and 
independents.* 


*Certain differences were found between males 
and females in expectancies and bets, particularly 
in the overall amount bet (M > F, t = 2.64, p < .02). 
However, the patterns of expectancy statements 
and bets for males and females were very similar, 
and the analyses of confidence in each group were 
accordingly based on the combined results of 
males and females. 


TABLE 2 . 
MEAN EXPECTANCY STATEMENTS OF CONFORMERS AND INDEPENDENTS ON THE THREE CATEGORIES OF TRIALS 
Trial category 
Average expectancy 
Group N Conforming Independent Noncritical 
M SD M SD M SD M SD 
rus 
les 6 | 420 221 4.50 141 547 242 0.86 
Females 8 | 490 | 126 | 529 | 145 | zar | 148 | Sm | 125 
Combined 14 4.62 1.10 4.95 1.30 6.55 247 5.1 147 
Tos pendents 
ales 5 6.20 0.84 6.98 0.72 7.15 1.76 7.07 0.99 
Pemses | 11 | 383 | 209 | 629 | 144 | 7 | 149 | 620 | 110 
‘combine 16 4.57 2.29 6.49 1.29 7.23 1.55 6.46 1.12 
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TABLE 3 i 
MEAN AMOUNT BET BY CONFORMERS AND INDEPENDENTS ON THE THREE CATEGORIES OF TRIALS 
Trial category 
Average bet 
Group N Conforming Independent Noncritical 
M SD M SD M SD M SD 
Conformers 
Males 5 230 A1 .30 13 32 44 33 11 
Females 17 27 06 17 09 24 -13 23 07 
Combined 22 29 .08 .20 11 26 Hg 25 09 
Independents 
Males 5 31 18 37 40 .39 13 36 42 
Females 13 27 .09 24 B .36 A1 29 .09 
Combined 18 28 42 28 A2 37 42 31 10 


The analysis of variance of conformers’ 
and independents’ expectancy statements is 
presented in Table 4. Conformers tended to 
express a lesser degree of confidence in their 
judgments than independents, and this dif- 
ference was significant on trials on which 
subjects remained independent (f = 2.69, 
p<.02). Conformers and independents did 
not differ in their declarations of confidence on 
conforming or noncritical trials. There was a 
highly significant trial category effect: Least 
confidence was expressed by both conformers 
and independents on conforming trials, and 
independent and noncritical trials followed 
in an increasing order of expressed confidence. 

Although failure of the interaction term 
to attain significance renders internal com- 
parisons somewhat tenuous, we compared the 
expectancies on the three trial categories for 
conformers and independents separately. The 


TABLE 4 
ANALYSIS OF VARIANCE OF THE EXPECTANCY 
STATEMENTS OF CONFORMERS AND 
INDEPENDENTS 
Source af MS F 
Between groups 29 
Conformers versus 
Independents (A) 1| 330.70 | 3.20* 
Error (b) 28| 103.33 
eae jn 1,015.48 | 17.26** 
rial catego: ) i t 
NES 2| 9224| ns 
Error (w) 56 58.84 


confidence estimates of conformers on con- 
forming and independent trials did not differ 
significantly, while independents stated sig- 
nificantly higher expectancies on independent 
trials relative to conforming trials (¢ = 3.27, 
p< .01). Conformers were more confident 
on the noncritical trials than on either in- 
dependent (¢ = 2.66, p < .02) or conforming 
(t = 3.30, p < .01) trials. Independents were 
more confident on noncritical as compared 
to conforming trials (t= 4.96, p< .001), 
but their expectancies on noncritical trials 
were only slightly higher than on independent 
trials (£ = 1.69, p < .10). 

The summary of the analysis of variance 
of betting patterns is presented in Table 5. 
An analysis of the initial main effect indicates 
that conformers tended to bet less on in- 
dependent trials than did independents (t= 
1.95, p < .07) and were also less confident 


TABLE 5 


ANALYSIS OF VARIANCE OF THE BETS OF 
CONFORMERS AND INDEPENDENTS 


Source df MS F 
Between groups 39 
Conformers versus 
Independents (A) 1| 638.43 3.76* 
Error (b) 38} 169,95 
Within groups 80 
Trial category (B) 2| 507.17 | 10.95*** 
AX 2| 268.98 5.815. 
Error (w) 76| 46.30 
*p<.07. 
** p <.005. 
3$ <.001. 
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on noncritical trials than independents (¢ = 
2.48, p < .02). Additionally, there were sig- 
nificant differences between trial categories. 
Of major importance in this analysis is the 
significant interaction, indicating differences 
in the betting patterns of conformers and 
independents. 

Conformers were much less confident when 
resisting the influence of the group than on 
trials on which they reported wrong judg- 
ments in alignment with the confederates 
(t = 3.73, p < .01). In the case of independ- 
ents, divergence from the group judgment 
did not result in reduced confidence. On the 
noncritical trials, independents bet consider- 
ably more than on either their conforming 
or independent trials (t’s, respectively, of 
3.55 and 3.87, p < .001). Conformers, on the 
other hand, bet less on independent than 
on noncritical trials (¢ = 2.73, p < .01), but 
did not differ in the amount bet on conform- 
ing and noncritical trials. 


Sequential Changes in Conformity and Con- 
fidence 


To clarify the underlying processes mediat- 
ing the unexpected effects of the Betting con- 
dition, the amount of conformity and the 
Measures of confidence for the first half of 
the experimental situation were compared 
with conformity and confidence in the second 
half. Of interest in these comparisons were 
Possible changes in yielding and any in- 
crements or decrements in confidence on con- 
forming trials as the task progressed. If 
increased yielding and greater confidence on 
conforming trials in the last half were found 
among Betting group conformers as opposed 
to independents and as compared to con- 
formers in the Expectancy group, suggestive 
evidence for a process of inconsistency re- 
duction would be afforded. The rationale for 
this hypothesis is that conforming under con- 
ditions of increased investment in the out- 
come of one's judgment is incongruous when 
the correct choice is unambiguous. Having 
made a commitment to alignment with the 
group, it is necessary to justify this behavior. 
Accordingly, the Betting group conformer 
would be expected to continue or even to 
increase his influenced judgments and to 
show greater confidence as the situation pro- 
Eressed in an attempt to rationalize a tenuous 
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and objectively unsupportable decision. It is 
not as likely that this process of inconsistency 
reduction would hold under conditions of 
lesser personal involvement. 

For the comparison of the number of 
conforming trials in the first versus the last 
half, subjects were divided into those con- 
forming more on the first 10 trials and sub- 
jects conforming equally on both halves or 
more in the last half. Contrasting conformers 
in the Betting group with their counterparts 
in the Expectancy and Control groups, 19 of 
22 Betting conformers yielded at least as 
often or more in the last half (x? = 11.64, 
P < .001), while the corresponding propor- 
tion for Expectancy and Control conformers 
was 13 of 23 (y? = .39, p = ns). The same’ 
comparison of independents revealed no dif- 
ference in yielding initially versus later. 

A similar analysis was carried out for con- 
fidence statements on conforming trials, con- 
trasting Betting group conformers with 
yielders in the Expectancy group in the rela- 
tive magnitude of expressed confidence in- 
itially versus later. Here, the number of 
subjects who were initially more confident 
was compared to the number whose confi- 
dence was greater in the last half. Of 22 con- 
formers in the Betting group, 16 increased 
their confidence as the task progressed (x? 
= 4.54, p < .05), while 7 of 13 Expectancy 
group conformers showed increments in 
confidence, a proportion not beyond chance 
expectation. Independents in the Betting 
group were about as likely to express greater 
confidence initially as later, and this was 
similarly the case among independents in 
the Expectancy group. These trends were 
not in evidence on either the independent or 
noncritical trials. 


Personality Measures 


LOA. As a first step, the yielding of the 
27 subjects with Patterns 1 or 3 (high con- 
fidence) was compared to the conformity of 
the 31 subjects with Patterns 2, 4, and 7. 
Contrary to prediction, the 2, 4, 7 group did 
not show significantly greater conformity. 
Upon examination, it became clear that the 
Pattern 2 subjects were more similar to the 
1 and 3 group than to the 4s and 7s. 

Repeating the analysis after the elimina- 
tion of the 10 subjects with Pattern 2 resulted 
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in a significant difference (t = 2.25, p < .05), 
with the Pattern 4 and 7 group conforming 
more (standard score M = 55.19) than sub- 
jects with Patterns 1 or 3 (M = 48.11)? An 
even clearer differentiation was obtained when 
subjects with Patterns 3 and 4 were con- 
trasted. These patterns represent the greatest 
contrast in terms of our personality analysis 
of independents and conformers. Pattern 3 is 
indicative of the most confident, achievement 
oriented LOA, while Pattern 4, denoted as 
the achievement follower pattern by Rotter, 
represents the kind of goal setting behavior 
most analogous to social conformity. The 
mean for the Pattern 3 group was 44.11 
(N —9) as compared to a mean of 56.18 
- (N — 17) for the Pattern 4 subjects, yielding 
a t of 2.54 significant beyond the .02 level. 

The confidence of Pattern 1 and 3 subjects 
was compared to that of the 4 and 7 group 
by pooling the mean confidence estimates 
expressed in standard scores of the Ex- 
pectancy and Betting conditions. There was 
a trend for the 4 and 7 group to be less 
confident than the 1 and 3 group (46.83 
versus 51.31), but this difference was not 
statistically significant. 

Need for approval. Scores on the M-C 
SD scale were separately dichotomized at 
the mean of each condition for males and 
females in order to yield the high and low 
need for approval groups. Only in the case 
of females in the Control and Expectancy 
groups was the predicted relationship found 
between amount of conformity and scores on 
the scale, Expressed in standard scores, the 
mean amount of conformity by high need for 
approval females was 55.21 as compared to 
46.35 for low need for approval subjects 
(t = 2.61, p < .02). No significant differences 
in the confidence of the high and low need 
for approval subjects were found in either 
condition. 

I-E. Scores on the I-E scale were dichoto- 
mized at 15— and 16+, a split previously 
found to be effective in discriminating in- 
ternals and externals. Preliminary analyses 
indicated comparable findings for males and 
females, and the following comparisons were 
carried out without regard to sex. In the 


5 Pattern 2 is a more commonly found pattern 
than either 4 or 7 and signifies a minimum of 
cautious, protective goal setting behavior. 
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Betting group, externals conformed more than 
internals (Ms, respectively, of 53.04 versus 
45.83), yielding a ¢ of 2.35 significant beyond 
the .05 level. A slight but not significant 
trend in the predicted direction was found in 
the combined Control-Expectancy group. 

While no differences in the confidence of 
internals and externals were found in the 
Expectancy group, patterns suggestive of 
differential confidence were obtained in the 
Betting group. Here, a tendency was found for 
externals to be less confident overall than 
internals (F = 3.58, p<.10), and there 
were, additionally, differences in the relative 
amounts bet on conforming and independent 
trials. Internals bet approximately the same 
on these two categories of trials; externals 
bet significantly less on independent trials 
than they did on trials on which they yielded 
(t= 2.68, p < .02), a result similar to that 
obtained for conformers in this condition. 
Also, the greatest differentiation between in- 
ternals and externals in amount bet occurred 
on independent trials. 

Combinations of personality measures. The 
results to be presented in this section are 
based on the SLT analysis of conflict and 
maladjustment and are predicated on the 
general proposition underlying this study 
that social conformity represents an attempt 
to cope with conflicting demands by avoidant 
means. In SLT, conflict occurs and mal- 
adjusted behavior is probable when a goal 
is positively valued but the individual has a 
low expectancy of goal attainment. The M-C 
SD scale may be considered as an index of 
the relative value of, or preference for, ap- 
proval and affectional reinforcements, and 
LOA may be regarded as a measure of ex- 
pectancy. In terms of the foregoing analysis, 
high need for approval subjects with LOA 
Patterns 4 and 7 should constitute the con- 
flict group and should display a much greater 
amount of conformity than low need for 
approval subjects with LOA Patterns 1 or 3. 
The mean amount of conformity in standard 
scores of the high need for approval-Pattern 
4 and 7 group was 57.44 (N — 9), in con- 
trast to a mean of 43.69 (N — 13) for the 
low need for approval-Pattern 1 and 3 group. 
The resulting ¢ was 3.71 significant beyond 
the .01 level. 

A similar procedure, predicated on the 
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same rationale, was followed by combining 
the need for approval and I-E scales. High 
need for approval externals (N = 19), were 
compared with low need for approval in- 
ternals (W — 16) in the amount of con- 
formity. The resulting means were, respec- 
tively, 54.32 and 46.00, yielding a £ of 2.57 
significant beyond the .02 level. 


Discussion 
Situational Effects on Conformity 


In accounting for differential effects of the 
Betting condition on conformity, the common 
sense prediction would hold that rewarding 
independence, and concomitantly punishing 
conformity, would tend to reduce yielding to 
a minimum. Whatever the determinants of 
conformity in situations involving less per- 
sonal commitment, directly rewarding reliance 
on the individual’s own ability should tend 
to obviate the effects of social influence. 
Contrary to such a view, the Betting condi- 
tion produced significantly more conformity 
for females and not appreciably less yielding 
for males. 

To explain the effects of the Betting con- 
dition, it might be argued that the introduc- 
tion of betting serves to increase the credi- 
bility of the group. That is, it is inconceivable 
to the naive subject that the other group 
members would bet to lose money. 

It would appear that a primary effect of 
greater credibility in this context is to render 
the individual less certain of his own abilities 
and the validity of his own judgments. As a 
consequence of decreased certainty, the con- 
former in the Betting group apparently finds 
himself yielding more and having a greater 
need to justify this behavior to himselt and 
others. It was demonstrated that Betting 
group conformers yielded as much or more in 
the later stages of the task and displayed 
increasing confidence on their yielding trials 
as compared to independents and Expectancy 
group conformers. These findings suggest 
the operation of a process of inconsistency 
reduction.® This interpretation is necessarily 

$' There are apparent similarities between our 
analysis of inconsistency reduction and Festinger's 
(1957) theory of cognitive dissonance. However, 
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tentative, but it suggests, for Betting group 
conformers, a process of defensive rational- 
ization by which the individual masks his 
inability to resist the erroneous influence of 
the group. In support of this interpretation, 
the explanations offered by these subjects in 
the post-experimental interviews revealed a 
striking lack of correspondence with their 
actual behavior. To be noted is the fact 
that the tendency to increase conformity and 
confidence as the task progressed operated 
alike for males and females. 


Conformity and Confidence 


In both the Expectancy and Betting con- 
ditions, conformers tended to be less confident 
than independents. The differences between ' 
conformers and independents were more pro- 
nounced in the Betting group, a finding con- 
sonant with the effects of this condition as 
reviewed above. The low expectancies of 
personal adequacy revealed by conformers 
in this situation were especially evident on 
those trials on which they resisted the in- 
fluence of the group. These findings are gen- 
erally supportive of an expectancy theory 
which predicates conformity on lack of self- 
confidence or low expectancies of success in 
Situations of social evaluation and in which 
conformity is regarded as failure-avoidant, 
self-protective behavior. 


Personality Measures 


The results with each of the three personal- 
ity measures indicate the degree to which 
self-evaluative behaviors are complicated by 
differences in sex and by situational in- 
fluences. Nevertheless, our findings support 
the view that confidence in one's abilities is 
to a degree generalized and trans-situational. 
The disposition to subordinate one's abilities 
to the group is clearly related to failure- 
avoidant LOA patterns on a motor skills 
task. Further, those who conform under con- 
ditions of high personal involvement tend to 
believe that they have less personal control 
over their lives and regard the occurrence 
of reinforcements as outcomes of luck, fate, 
or powerful others, 


our procedure cannot be considered as a formal 
test of the propositions of dissonance theory. 
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CONFORMITY AND PERSONALITY COMMITMENT 


Conformity and Conflict 


The applicability of the SLT analysis of 
conflict to the problem of conformity is sug- 
gested by the marked differences in yielding 
which were obtained when the measure of the 
need for approval was, respectively, paired 
with the two expectancy measures, the LOA 
Board and the I-E scale. These results imply 
that the conformer can be conceptualized as 
an individual who has a high need for ap- 
proval and affection from others, but rela- 
tively low expectancies that these much de- 
sired reinforcements can be obtained as a 
result of his own abilities and efforts. His low 
evaluation of himself and his fear of social 
„rejection result in a strong disposition to 
conform, Although there is no direct evidence 
bearing on the point, it is by no means clear 
that these subjects can explain the nature 
of the conflict to themselves and account 
for their own behavior. Indeed, approval 
motivated individuals tend to represent them- 
selves in a highly positive and stereotypically 
acceptable manner on personality inventories 
(Crowne & Marlowe, 1960). It is suggested, 
thus, that they engage in defensive personal 
enhancement. Indirectly supporting this line 
of reasoning are the findings of Barron 
(1953) and Tuddenham (1959) that con- 
formers tend to describe themselves in favor- 
able terms on self-report devices. It must be 
said, however, that these results are more 
heuristic than definitive and will require 
cross-validation. 

The same cautious attitude must be ex- 
tended to additional findings as well. The 
effects of the Betting condition are not well 
delineated for males as a result of the small 
sample. A second case is afforded by the 
two personality inventories, which imperfectly 
conformed to prediction. Third, there is the 
issue of the generality or situational relevance 
of self-confidence and self-evaluative behavior. 

Beyond these specific reservations, the 
general convergence of a set of methodolog- 
ically independent indices of self-evaluation 
and expectancies of success on conforming 
behavior suggests the utility of the theoretical 
formulations underlying this study. The 
present findings imply a rather complex 
interaction of enduring dispositions to yield 
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and influences specific to the situation in 
the determination of conforming behavior. 
It would appear that as personal involvement 
in the conformity situation increases, de- 
fensive processes are set in motion by which 
to maintain self-esteem and to protect the 
individual from expected failure. 
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SOCIAL AGREEMENT AND THE LEARNING OF 
MATCHING BEHAVIOR * 


DOUGLAS K. CHALMERS, WILLIAM C. HORNE, anp MILTON E. ROSENBAUM 


State University of Iowa 


College Ss were exposed to 3 variations in amount of agreement by a partner 
in responding to a judgmental task. Training to imitate the partner was 
then carried out. The competence of the partner was varied during the 
imitation trials. During the Ist block of imitative trials, reciprocation of 
agreement was indicated. Over the remaining trials, amount of prior agree- 


ment was inversely related to imitation. 


High Model’s Competence led to 


greatest facilitation of imitative learning following disagreement. Variations 
in social agreement were interpreted as affecting imitative learning in a manner 
similar to direct indications of task success or failure. 


Within the context of social interaction, 
responses are made by individuals which then 
frequently receive the support of others or 
are rejected by them. It is likely that this type 
of differential treatment has consequences for 
succeeding interactional events. In the present 
experiment, the effect of prior experience of 
agreement or disagreement on the learning of 
an imitative response is examined. 

The experimental procedure requires that 
in an initial series of trials, the subject present 
judgments of a series of events with no 
information as to accuracy of his judgments. 
During this period, a partner (simulated) 
follows the subject on each trial in present- 
ing judgments to the same stimulus. Within 
three experimental conditions the frequency 
with which the partner agrees with the subject 
is varied, The next task provides that the 
subject follow the partner in presenting judg- 
ments. Reports of accuracy are made and 
the subject is rewarded for matching his 
partner’s responses. During this task, informa- 
tion is provided to the subject as to the 
competence of his partner. 

In a task situation in which no reliable (i.e., 
physical) source of information concerning 
the accuracy of responses is made available, 
it is likely that other, possibly less reliable 
cues, will be employed to assess accuracy. 


1 This study was supported by a grant (G-16011) 
from the National Science Foundation to Milton E. 
Rosenbaum. The senior authors were, at the time 
of the study, participants in the National Science 
Foundation Undergraduate Research Participation 
Program. 


Responses of others, even in the absence of 
knowledge of the authority of these responses, 
may serve this function. Festinger (1950) has 
proposed that social agreement serves as an 
important source for maintenance of responses 
for which other means of assessment are 
lacking. If social agreement is similar in its 
effects to other cues designating accuracy, 
then frequent agreement by another in a 
specific task can be expected to connote 
success for the subject, while frequent dis- 
agreement may be expected to connote failure 
(ceteris paribus). 

Previous studies (Kanareff & Lanzetta, 
1960a, 1960b; Rosenbaum, Chalmers, & 
Horne, 1962) found that a prior experi- 
ence of failure leads to more rapid learning 
of imitative behavior than does an earlier 
success experience. To the degree that social 
agreement functions as success and disagree- 
ment as failure, a negative relationship be- 
tween degree of prior agreement and the rate 
of learning to match should appear in the 
present experiment. 

It is possible that social agreement has a 
different consequence than to reflect success 
or failure of the subject’s responses. Weiss 
(1957) and McGuire (1957) have found that 
earlier agreement by another increases the 
influence capability of that source on later 
issues. From these findings and other related 
formulations (Festinger, Back, Schachter, 
Kelley, & Thibaut, 1950; Newcomb, 1959) 
the following statement may be derived: one 
agrees with a person who agrees with him, 
i.e., agreement is reciprocated. This analysis 
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leads to the hypothesis that a positive rela- 
tionship will appear between degree of prior 
agreement and learning of the matching re- 
sponse—a prediction contrary to the previous 
hypothesis. 

The character of the other may influence 
the manner in which the consequences of 
variations in agreement are expressed. Rosen- 
baum and Tucker (1962) have demonstrated 
that the greater the apparent competence of 
the model, the greater is the facilitation of 
imitative learning. In the present context, it 
may be that only when the model is competent 
will the effects of prior agreement be mani- 
fest. An incompetent model may be regarded 
as a relatively irrelevant source by all sub- 
*jects, regardless of their prior experience. 
That this may not be the case is suggested 
by Weiss’ (1957) finding that even a nega- 
tively valued, untrustworthy source gains in 
positive influence after having on a prior 
occasion been in agreement with the recipient 
of influence. The present experiment includes 
variations in the model’s competence during 
the period of training to imitate in order to 
evaluate potential interactive effects with the 
social agreement variable. 


METHOD 


The apparatus was identical and the procedure 
similar to that employed in previous experiments 
(Rosenbaum et al., 1962; Rosenbaum & Tucker, 
1962). Two subjects participated in each experi- 
mental session and were assigned to separate rooms. 
Each subject was seated facing a panel which 
was divided into two vertical sections. The upper 
third of the left section was labeled “A”; the upper 
third of the right section was labeled “B.” The lower 
two-thirds of each vertical section was divided into 
two compartments, one of which was labeled win, 
and the other ross. Between the upper one-third and 
the lower two-thirds of each vertical section there 
Was a green and a red jeweled lamp. 

The task required that the subject predict the 
Outcomes of a series of simulated horse races. The 
Subject was informed by a sign on the wall of his 
individual room that he was "Subject B." Each 
Subject was thereby led to believe that the other 
subject was “Subject A.” The subject was instructed 
to think of the race as involving his horse running 
against a field of horses and his task was to guess 
whether or not his horse would win or lose the race 
by appropriately moving a response lever, situated 
on the arm of the subject’s chair. The subject’s 
guesses were illuminated on the panel in his particular 
room. 
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The experimenter, in a third room, by means of a 
central control panel and proceeding from programed 
instructions, provided the signal to respond and 
simulated “A’s” responses. This was done by il- 
lumination of appropriate portions of the subject's 
panel. Reports of accuracy were given by the jeweled 
lights. 

For an initial series of 50 trials, the instructions 
required that on each trial the subject (B) and his 
“partner? (A), in that order, predict the outcome 
of the horse races. No information as to the accuracy 
of responses was presented during this period. The 
experimenter presented the responses of A according 
to three prearranged schedules which produced the 
variations in degree of social agreement. In a High 
Agreement (HA) condition, A's response was identi- 
cal with the subjects on 9 of every 10 trials. A 
Moderate Agreement (MA) condition involved iden- 
tity of A's and the subject's responses on 5 in each 
block of 10 trials. A Low Agreement (LA) condition 
was created by presenting A's response as identical 
to the subject's on 1 of 10 trials. 

Following these initial trials, the instructions indi- 
cated that A was to respond first and the subject 
second on each trial and that reports of accuracy 
would be provided on each trial, A’s win or lose 
response and one of the jeweled lamps (indicating 
accuracy) remained illuminated during the subject’s 
portion of the trial. The responses of A, as before, 
were presented by the experimenter, In every block 
of 10 trials, half of A’s responses were win and 
half were lose in a random pattern. In order to 
permit variation of the accuracy of A independent 
of the relevance of the A’s response for the subject’s 
performance, the subject was told to think of his 
horse as running at all times on a different race track 
from that of A’s horse, and that the two horses 
were never competing with each other, The designa- 
tion of the accuracy of the responses of A was made 
according to two programed schedules which defined 
the experimental conditions for the differential com- 
petence of the model. For one half of the subjects, 
A was designated correct on 80% of the trials in 
each block of 10; this constituted the High Model’s 
Competence (HMC) condition. The other half of the 
subjects were given the Low Model Competence 
(LMC) condition, in which A was designated cor- 
rect on 20% of the trials in each block of 10. 

The designation of the correctness of B's (subjects) 
win or lose response was dependent on A's response. 
It was arranged that B would be correct on 80% 
of the trials if he consistently matched A’s responses 
whether A is correct or incorrect. Thus, on two 
previously chosen trials of each block of 10, B was 
arbitrarily designated as incorrect if he matched A’s 
response. One hundred trials were presented in the 
second series. 

Seventy-two subjects from the introductory psy- 
chology course were given credit for experimental 
participation. Six males and six females were assigned 
in pairs of the same sex to each of six conditions 
created by the three variations in prior agreement 
and two varations in the “partner’s” competence. 
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RESULTS 


In Figure 1 are presented the mean fre- 
quencies of matched responses over blocks of 
trials as a function of Agreement treatments. 
In the first block of trials, the HA group 
presented a greater mean number of matched 
responses than either the MA or LA groups. 
Over the course of the remaining blocks of 
trials, the mean frequency of matched re- 
sponses presented by the LA group is greater 
than obtained from the other two groups. 

The analysis of the first block scores pre- 
sented in Table 1 indicates a significant 
Agreement effect. The mean for the HA group 
is significantly greater than the means for the 
MA and LA groups (Table 2), which do not 
differ from each other (by critical differ- 
ence = ,82, Lindquist, 1953). The significant 
Agreement X Sex interaction in Table 1 indi- 
cates that the primary contribution to the 
initial block differences is made by the female 
subjects in the HA condition. 

Analysis of the matched responses over all 
blocks of trials (Table 3) indicates that the 
main effect for Agreement conditions is just 
short of significance (.05 > p < 10). If only 
the responses in the blocks of trials fol- 
lowing the initial block are considered, this 
comparison achieves significance (F = 3.76, 
df = 2/60) with the Low Agreement group 
significantly different from the other two 
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TABLE 1 


ANALYSIS OF VARIANCE OF FREQUENCIES OF MATCHED 
RESPONSES IN THE First BLOCK OF TRIALS 


Source df MS F 
Agreement (A) 2| 10.60 SiS ORS 
Model’s Competence (MC) | 1 34 0.17 
Sex (S) 1 34 | 047 
AX MC 2 1.26 0.63 
AXS 2 6.43 3.215 
MC XS 1 02 0.01 
AX MCXS 2 3.93 1.96 
Error 60 2.00 

*p —.05. 
**p = 01. 


groups which do not differ from each other. 
Table 2 presents the relevant means. 

The significant Model’s Competence x Sex 
interaction in Table 3 indicates that male 
subjects match significantly more frequently 
in the HMC condition than in the LMC 
condition while no difference appears for 
females between Competence conditions; in 
fact, for females a nonsignificant trend in 
the reverse direction appears. 

The significant main effect for Trials re- 
flects the general tendency of all groups to 
increase in mean frequency of matched re- 
sponses. The Trials X Agreement interaction 
seems primarily to be a product of the rela- 
tionships in the first block of trials which 
are modified in succeeding trials. In analysis 
of the blocks of trials beyond the first block; 
the Trials x Agreement interaction does not 
approach significance (p > .10). 

The Trials X Agreement x Sex interaction 
seems to reflect two trends. First, the differ- 
ence between LA and the other Agreement 
groups becomes increasingly greater for males 
over trials than for females. Second, while for 
males the curves reflecting increasing matched 
responses are relatively smooth, for females 
the curves are quite erratic, especially in the 
HA group. 

TABLE 2 


MEAN FREQUENCIES OF MATCHED RESPONSES BY. 
AGREEMENT CONDITIONS 


Agreement Block 1 Blocks 1-10 Blocks 2-10 
High 5.75 6.01 6.04 
Moderate 4.54 5.83 5.98 
Low 4.67 6.54 6.79 
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TABLE 3 


ANALYSIS OF VARIANCE OF FREQUENCIES OF MATCHED 
RESPONSES IN TEN BLOCKS OF TRIALS 


Source df MS 

Agreement (A) 2 |3244 2.88 
Model's Competence (MC) | 1 5.87 — 
Sex (S) 1 {35.11 3.12 
AX MC 2 16.67 148 
AXS 2 |845 — 
MCXS 1 3249.61 4.41* 
AXMCXS 2 [12.20 1.08 
Error (b) 60 {11.25 

Trials (T) 9 (30.76 21.36** 
TXA 18 | 2.98 ANUN eng 
TXMC 94 ttt 

TXS 9 |144 1.00 
TXAXMC 18 | 2.08 1.44 
TXAXS 18 | 2.57 177% 
TXMCXS 9 |3.50 2.41* 
,TXAXMCXS 18 | 2.40 1.66* 
Error (w) 540 | 144 


Note.—Error (w) was employed to evaluate all terms in- 
volving Trials while all other terms were evaluated by means 
of Etror (b). 
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The Trials x Model’s Competence X Sex 
interaction is derived from the events de- 
scribed above in relation to the Model’s Com- 
petence X Sex interaction but on some trials 
fluctuations occur in which male subjects 
respond with as much matching in the LMC 
condition as in the HMC condition. 

The quadruple interaction appears to in- 
corporate a variety of relationships. One ap- 
pears to be the greater tendency, on the first 
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block only, for females in the HA condition to 
make a greater number of matched responses 
than males especially in the HMC situation. 
Separate analyses were conducted for each 
of the Model's Competence conditions (see 
Figure 2). The LMC analysis yielded as the 
only significant terms a main effect for Trials 
and a Trials X Sex interaction. This latter 
effect is based on an essentially smooth 
acquisition trend for females while males 
manifest considerable fluctuation in level of 
matching responses over the blocks of trials. 

The HMC analysis indicates significant 
main effects for Agreement and for Sex. The 
former result indicates greater matching in 
the LA condition than the other Agreement 
conditions while the sex difference is produced 
by greater matching by males than by 
females. A significant Trials X Agreement 
interaction reflects the steeper slopes of the 
LA and MA conditions in contrast to the 
flatter slope of the HA condition. Finally, 
a significant triple interaction of Trials x 
Agreement X Sex indicates that the differences 
among Agreement conditions is greater for 
males than for females. 


Discussion 


It appears that following the first judgment 
period wherein variations in social agreement 
were employed, when the opportunity arises 
to reflect the effect of this event, subjects in 
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the HA condition tend to reciprocate the 
treatment by matching their partner in the 
initial block of training trials. In accord with 
other observations of the affiliation process, 
it appears that if one receives social support, 
one gives social support, or, as phrased 
earlier, one agrees with the person who agrees 
with him. 

Tt does appear that almost complete agree- 
ment as presented in the HA condition is 
necessary to produce this reaction. Although 
surprisingly in some groups the LA condition 
produced greater matching than the MA 
condition, the overall difference was negligible. 
Somewhat more than 5096 agreement seems 
necessary to produce the reciprocal response. 
A parametric study of varying percentages 
of agreement would aid in clarifying these 
relationships. 

After the intial block of trials, what is 
essentially a reversal took place. Prior LA 
resulted in rapid acquisition of the matching 
response while HA and MA, although in 
general associated with increases above initial 
frequencies, did not attain levels approxi- 
mating the percentage of reinforcement. It 
now appears that prior agreement variations 
affected performance much as designation of 
success or failue (Kanareff & Lanzetta, 
1960a, 1960b; Rosenbaum et al, 1962). 
A subject, after experiencing consistent lack of 
agreement of another in task related judg- 
ments, may react with lack of confidence in 
his ability to cope with the assigned problem 
and look to the other person for cues as to 
correct behavior. 

As an alternative interpretation, it is pos- 
sible that disagreement on the part of the 
partner may suggest to the subject that his 
partner was quite confident of his capability 
to respond accurately. On the other hand 
consistent agreement may reflect to the sub- 
ject a dependency reaction on the part of the 
partner which, although initially responded 
to favorably, may preclude utilization of 
the partner’s behavior as a reliable cue for 
problem solution. The hypothesis may be 
offered that had a new partner been intro- 
duced for the second series of judgments the 
effects of the agreement variations would have 
been much reduced, if apparent at all. 

The results presented by the male subjects 


D. K. Cuatmers, W. C. Horne, AND M. E. ROSENBAUM 


in response to the Model's Competence varia- 
tion are similar to those obtained by Rosen- 
baum and Tucker (1962) who also found 
facilitation of imitative learning in the HMC 
condition. 

If, as has been suggested, amount of 
agreement on task related responses can be 
considered to affect the subjects in the same 
manner as success and failure experiences, 
the results for male subjects in the present 
experiment are similar to previous findings. 
Mausner and Bloch (1957) studied matching 
behavior in a situation in which accuracy 
was not reported, and found an interaction 
between prior success and failure and the 
Model's Competence variables in which 
Failure subjects matched more frequently 
than Success subjects when the model was 
presented as competent but no difference ap- 
peared when the model was incompetent. This 
is in contrast to the results of a study 
(Rosenbaum et al, 1962) in which train- 
ing to imitate was carried out. In this latter 
study, no interaction between the variables 
appeared but each variable was found to be 
independently related to matching behavior. 

The differences in results may be due to 
differential potency in the procedures for pre- 
senting success and failure experiences. Rosen- 
baum et al. (1962) arranged that subjects 
in their Success condition be correct on 8096 
of the pretrials and in the Failure condition 
correct on 20% of the pretrials. Mausner and 
Bloch employed 86% and 14%, respectively. 
In the present experiment, the extremes of the 
Agreement conditions were 90% and 10%. It 
may be that the prior experience must have 
dramatic qualities before attention is given 
to the competence characteristics of a model. 
A deliberate parametric study would be 
necessary for examination of this suggestion. 

The sex differences in the present experi- 
ment are similar in some respects to other 
findings on imitative learning (Kanareff & 
Lanzetta, 1960a, 1960b) in the suggestive 
evidence that males to a greater extent than 
females appear to differentiate among varia- 
tions in relevant stimuli. Kanareff and Lan- 
zetta report that males more than females 
respond to differences in reinforcement proba- 
bilities while in the present experiment males 
appear to respond differentially to Model’s 
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Competence variations and no significant dif- 
ference is indicated for females in relation 
to this variable. In addition, the males are 
somewhat more responsive to the agreement 
manipulation. 

As Kanareff and Lanzetta have suggested, 
based on questionnaire data, the males may 
be more disposed toward employing available 
cues in order to achieve success while females 
are somewhat inhibited in employing what 
may be regarded as illegitimate utilization 
of another person's behavior. 
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This study dealt with the effect on Negro efficiency of variations in degree 
of stress and in the race of other persons in the task situation. 115 Negro 
students at Fisk University performed individually a digit-letter substitution 
task in the presence of an administrator and a confederate posing as another 
S, both of whom were either white or Negro. In addition, Ss were told to 
expect either mild or strong nonavoidance electric shocks while working. The 
main findings, that (a) performance was better in White-Mild Threat than 
in Negro-Mild Threat, and (b) Strong Threat, as compared with Mild 
Threat, was more detrimental to performance in the White condition than in 
the Negro condition, are consistent with the hypothesis of an inverted 
U shaped relationship between arousal and performance. In addition, there 
was evidence of an inverted U shaped function between Manifest Anxiety 
Scale scores and Ist trial performance in the White condition. The results 
have implications for interpreting various types of Negro performance in- 


cluding scores on intellectual tests in racially mixed environments. 


An extensive literature on the intelligence 
test performance of Negroes, which has re- 
cently been reviewed by Dreger and Miller 
(1960), presents a general picture of lower 
Scores by Negroes than by whites. Although 
the interpretation of this fact has been hotly 
debated, the controversy has not produced any 
adequately controlled studies of motivational 
and situational factors that may influence 
Negro test taking behavior. The present 
experiment was designed to examine the ef- 
fects of variations in level of general anxiety, 
racial environment, and degree of personal 
threat on the performance of Ni egroes. 

Two recent experiments (Katz & Benjamin, 
1960; Katz & Cohen, 1962) indicate that 
northern Negro college students tend to be- 
come anxious and underproductive when re- 
quired to solve problems cooperatively with 
white partners. The relatively poor perform- 
ance of Negro subjects can be interpreted 
as an effect of personal threat experienced in 
the biracial situations. But it is not clear in 


1This research was carried out under Contract 
Nonr 285 (24) between the Office of Naval Research 
and New York University. 

The authors are indebted to S. Oliver Roberts for 
generously providing experimental facilities at Fisk 
University. Edgar Epps, Charles Esser, Joseph 


Dunkelberger, and Ted Smith helped conduct the 
experiment. 
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which way the experience of threat adversely 
influenced performance. Given the task inter- 
dependence of Negro and white teammates, 
Negroes may have suppressed correct re- 
Sponses to prevent instigations to hostility in 
whites whose self-esteem they thought would 
be threatened by displays of Negro ability. 
Or they may have been motivated by feelings 
of incompetence to suppress interest in the 
task in order to escape having to compare 
themselves unfavorably with whites. However, 
a process quite different from suppression 
may have been operative, in which threatening 
features of the biracial situation aroused 
anxiety in Negroes to a high level, where 
their ability to respond appropriately to task 
relevant cues was impaired. Many studies 
have shown that stress tends to lower per- 
formance on certain tasks (cf. reviews by 
Lazarus, Deese, & Osler, 1952; Sarason, 
1960). This suggests the possibility that the 
stress experienced by Negro subjects had a 
detrimental effect upon their problem solving 
efficiency that was different from, and perhaps 
independent of, any intentional suppression 
on their part. 

The present experiment deals with the ef- 
fect of a white environment on Negro verbal 
performance when motives for suppression are 
presumably not strongly activated, i.e., when 
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withholding of correct responses or disinterest 
in the task appears to have no defensive func- 
tion for the subject. Negroes worked in either 
a white situation or an all-Negro situation. 
To minimize the amount of social threat 
implicit in the white situation all subjects 
worked individually on a relatively nonthreat- 
ening task (digit-letter) that was described 
as unimportant, and had no interaction with 
a co-worker and no information about scores. 
The racial environment was varied by using 
an administrator and a co-worker who were 
both either Negro or white. (The reason we 
used two individuals, a co-worker as well as 
an administrator, as a racial environment, was 
to increase the likelihood that this variable 
‘would be effective.) In addition, subjects 
were subjected to a stressful stimulus of either 
high or low intensity, in order to compare the 
effect on Negro efficiency of an anxiety 
arousing condition in each of the two racial 
settings. Since our purpose in introducing 
stress was to determine whether Negro per- 
formance deteriorated more readily in the 
presence of whites than of other Negroes, it 
was necessary to use a stress stimulus that 
would not lead to suppression by increasing 
the apparent significance of the task. We used 
an announcement that severe electric shock 
(High Stress condition) or mild electric shock 
(Low Stress condition) would be adminis- 
tered to the subject and the co-worker at 
random times during the task. 

Duffy (1957), Malmo (1957), and others 
have hypothesized an inverted U shaped rela- 
tion between intensity of motivation, as indi- 
cated by degree of physiological arousal, and 
efficiency. In support of this, Stennett (1957) 
found that subjects scored higher on an audi- 
tory tracking task when instructions were 
moderately motivating than when they were 
strongly or weakly motivating. The optimal 
incentive condition was marked by intermedi- 
ate values on two physiological measures of 
arousal. In applying this hypothesis to the 
present study it should be noted again that 
Negro subjects were to be given low incentive 
instructions on a fairly dull task, in a situa- 
tion of limited social contact. Hence, in the 
Low Stress condition Negro subjects in the 
Negro environment presumably would be at a 
level of arousal that was too low for efficient 


563 


performance. On the other hand, Negro sub- 
jects in the white environment in the Low 
Stress condition might become moderately 
aroused, so that their total arousal would be 
closer to an optimal level for performance 
than that of subjects in the Negro condition. 
It also follows from the arousal performance 
hypothesis that the effect of the High Stress 
condition on performance should be different 
in the two racial settings. Since Negroes in 
the Negro condition would initially be in a 
state of low arousal, their total arousal after 
stress was introduced might be closer to an 
optimal level for performance than that of 
Negro subjects in the White condition, whose 
arousal would be increased by stress beyond 
its previously moderate level. Therefore, the 
arousal performance hypothesis leads to the 
prediction of a Stress X Racial Environment 
interaction effect on Negro subjects’ per- 
formance. 

With respect to individual differences in 
anxiety, one study has compared the relation 
between scores on Taylor’s Manifest Anxiety 
(MA) scale and intellectual performance in 
Negroes and whites. Matching Negro and 
white enlisted men on several background 
characteristics, Roen (1960) found a higher 
negative correlation for the former between 
scores on the Army Classification Battery 
(ACB) and the MA scale. A possible explana- 
tion is that the experience of taking the ACB 
was more stressful for Negroes than for 
whites, so that their test behavior was more 
closely linked to general anxiety. Several 
studies, reviewed by Sarason (1960), have 
reported interaction effects of stress and MA 
scale scores on intellectual performance. 

In view of Roen’s provocative finding, we 
decided to include this personality variable 
in the present experiment. We wished to deter- 
mine whether it would interact with our situa- 
tional variables in a manner consistent with 
the U shaped hypothesis. 


METHOD 
Subjects 


One hundred and fifteen male Negro students at 
Fisk University, out of a total male student body 
of about 200, were recruited on a voluntary basis 
and paid $2.00 for two sessions that totaled about 
80 minutes in length. Thirty-two were from the 
North and the rest were from the South. No one 
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was previously acquainted with the administrator 
and the confederate whom he encountered in the 
second session, 


Procedure 


Subjects in groups of 30 or more were adminis- 
tered, by a Negro faculty member and a Negro 
student assistant, 60 of Heineman’s (1953) forced- 
choice items for measuring manifest anxiety, Items 
were scored on Key 1. The sample was divided 
into high anxious and low anxious subgroups by 
median split. Then highs and lows were randomly 
assigned to each of the four combinations of shock 
threat and racial environment conditions. One to 
two weeks after they had taken the MA scale, sub- 
jects were called back individually for the second 
session. Because it was necessary to begin the experi- 
mental sessions before all subjects had been recruited, 
the median MA scale score was not known exactly 
until the experiment was almost completed. There- 
fore, the Ns for the eight treatments were found to 
vary from 12 to 18. The number of northern sub- 
jects in these cells ranged from 2 to 5. The subject 
entered a room in which the administrator (the 
experimenter) and a confederate were present. The 
experimenter introduced himself as a psychologist, 
and the confederate as a student from another college 
in Nashville. For the White condition, both the 
experimenter and the confederate were white, while 
for the Negro condition both men were Negro. Two 
pairs of white experimenters were used, with ap- 
proximately half of the subjects in each combination 
Of stress and anxiety conditions being assigned to 
each pair. Only one team of Negro experimenters 
was used, due to the difficulty of finding experi- 
menters who were not known to the subjects. The 
experimenter described the experiment as “an investi- 
gation in psycho-physiology” and stated that he 
was using the room they were in as a convenient 
place for seeing students from colleges throughout 
the city, but could work with only two students 
at a time due to limitations of the apparatus. 


You fellows are going to work on a task under 
different conditions. The task is not a test of any 
kind. Your scores will not be shown to anyone 
at your college, and you will not be compared 
with one another, 


During these preliminary remarks the subject and 
the confederate sat in view of one another. After- 
wards, the experimenter directed both men to chairs 
Which were separated by a screen so that they 
could no longer see one another. They were given 
Sheets of paper containing a digit-letter substitution 
task and told how to do it. (Sarason and Palola, 
1960, found this task to be easier than the digit 
symbol subtest of the Wechsler-Bellevue scale) The 
subject worked on the task for 3 minutes. Following 
this, the shock threat treatment was given. For both 
mild and strong levels of shock threat, the experi- 
menter placed before each man finger electrodes 
Which were wired to a metal box with several dials 
and switches, He showed them how to put the 
electrodes on the fingers of their nonwriting hond. 
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Then he explained that there would be another trial 
on the digit-letter task, during which electric shock 
would be administered. 


The shocks may occur at any time that you are 
working on the task. You won’t know when to 
expect them. They won’t have any connection with 
how well you are doing on the task. 


For the Strong Threat condition the experimenter 
continued: 


These shocks will not harm you in any way, but 


they will be strong. They will give you a painful 
jolt. 


For the Mild Threat condition he said: 


These shocks will be very mild, and they won't 
be at all painful. They will just tingle. You won't 
mind them. 


The subject then performed the digit-letter task on 
a fresh sheet for 3 minutes, No shocks were actually 
administered at any time. After this trial the elec- 
trodes were put away and the subject filled out a 
questionnaire which contained scales for rating how 
much stress he experienced and how much he thought 
his companion experienced. 

Performance on each 3-minute trial of the digit- 
letter task was scored on the basis of the number 
of boxes correctly filled in. In order to reduce the 
influence of individual differences in ability on the 
task, predictions were to be tested on scores repre- 
senting the difference between achievement on the 
first and second trials. 


RESULTS 
Original Sample 


The data for testing the predictions were 
the intertrial difference scores (second trial 
minus first trial), and these were evaluated 
by a 2 x 2 X2 analysis of variance. There 
was a significant Race X Threat interaction 
effect (F = 4.32, p < .05), as had been pre- 
dicted. Next, a2 x 2 X 2 analysis of variance 
was done on first trial scores, in which level 
of shock threat was treated as a “variable,” 
even though shock threat was not introduced 
until later, as a check on the comparability 
of the Strong Threat and Mild Threat sam- 
ples. There was one significant effect, a triple 
interaction (F = 6.48, p< .05), indicating 
that the subsamples in the eight treatment 
cells were not comparable. Therefore, the pos- 
sibility existed that an unbalanced distribu- 
tion of subjects had exercised a concealed in- 
fluence on the predicted outcome. 

Two possible sources of such imbalance 
were investigated, geographic origin and MA 
scale scores. When first trial scores of north- 
ern and southern subjects within the eight 
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cells were compared, there were no apparent 
differences. Next, within-cell distributions of 
MA scale scores were examined. It was found 
that the distributions in the four high MA 
scale cells, as well as in the four low MA scale 
cells, were markedly heterogeneous. We then 
compared MA scale scores with first trial per- 
formance.? Variance analyses were done on 
first trial scores in the Negro condition and 
in the White condition. These tested the linear 
and quadratic components of the MA scale 
variable, using three levels of MA scale 
scores: low, middle, and high. A significant 
quadratic effect, in the form of an inverted U 
shaped relation between MA scale scores and 
performance, was present in the White condi- 
tion but not in the Negro condition. 


Revised Sample 


Since there was ground for attributing the 
triple interaction effect on first trial perform- 
ance to the unbalanced distribution of MA 
scale scores in the various treatment cells, it 
was decided to correct the imbalance by sub- 
selecting, from the original sample, subjects 
who were matched on MA scale scores across 
the four Race x Threat treatment combina- 
tions. This entailed dropping the MA scale as 
an independent variable. The revised sample 
. had 19 subjects in each Race X Threat cell, 
with a somewhat smaller standard deviation 
for MA scale scores than that of the original 
sample.’ 

First trial scores. A 2 X 2 analysis of vari- 
ance was done on first trial scores (about 
which there was no prediction) in the revised 
sample, A trend toward a main effect for Race 
was obtained (F = 3.74, p = 06), in which 
performance was better in the White condi- 
tion than in the Negro condition. 

Intertrial improvement. Table 1 presents 
the results of a 2 X 2 analysis of variance of 
the intertrial difference scores. A significant 
Race X Threat interaction effect was obtained 
(p < 05) in support of our prediction from 
the U shaped hypothesis. Inspection of the cell 
means reveals that, as was anticipated, per- 
formance was more detrimentally affected by 


2The authors are indebted to Joseph Weitz for 
suggesting this comparison. 
. 8For the original sample N = 115, M = 29.08, and 
¢=6.11; for the revised sample N=76, M = 28.32, 
and ¢=4.85, 
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TABLE 1 


ANALYSIS OF VARIANCE AND CELL MEANS FOR INTER- 
TRIAL DIFFERENCE SCORES IN REVISED SAMPLE 


Source df F Pp 
Race (R) 1 48 
Threat (T) 1 57 
RXT 1 5.11 <.05 
Error (w) 

Condition White | Negro t b 
Strong Threat | .79 5.11 1.30 <.20 
Mild Threat | 4.37 | —1.89 1.91 <.10 
t 1.08 2.14 
P «05 


Note.—In all cells, N = 19. Values of p for t tests are based 
on two-tailed tests. 


Strong Threat in the White condition than in 
the Negro condition. It will be recalled that 
the interaction prediction was also supported 
by results based on the original sample. Also 
consistent with the U shaped hypothesis were 
significantly higher scores in Negro-Strong 
Threat than in Negro-Mild Threat, and a 
trend in which Mild Threat subjects scored 
higher in the White condition than in the Ne- 
gro condition (p < .10). 

Because of the finding of a trend toward a 
Race main effect on first trial scores, we also 
performed an analysis of covariance on the 
intertrial difference scores, holding first trial 
scores constant. It revealed a significant Race 
X Threat interaction (F = 6.53, p< .05), 
which corroborated the results of the analysis 
of variance, even when subjects were equated 
on first trial scores. The adjusted mean inter- 
trial difference scores are presented in Table 2, 
where it can be seen that the Negro-Strong 
Threat treatment still has a significantly 
higher mean than Negro-Mild Threat ($ < 
.05). Also, the trend toward higher unad- 


TABLE 2 


ADJUSTED MEANS FOR INTERTRIAL DIFFERENCE SCORES 
FROM ANALYSIS OF COVARIANCE OF 
REVISED SAMPLE 


Condition White Negro t p 
Strong Threat 1.33 4.12 «1.00 
Mild Threat 5.63 —2.04 2.54 | <.01 
f 1.33 2.05 
? <.20 <.05 


Note.—In all cells, N = 19. Values of are based on two- 
tailed tests. 
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TABLE 3 


ANALYSIS OF VARIANCE AND CELL MEANS FOR DIFFER- 
ENCES BETWEEN SELF-RATINGS AND RATINGS OF 
OTHER ON “NERVOUSNESS” IN REVISED SAMPLE 


Source df F 2 

Race (R) 1 2.05 
Threat (T) 1 .68 
RXT 1 5.46 «02 
Error (w) 72 

Condition White Negro t ? 
Strong Threat 74 —.19 3.06 <.01 
Mild Threat =.53 —.16 74 
t 2.54 1.26 
72 <.02 


Note.—In all cells, M = 19. Values of 5 for ¢ tests are based 
on two-tailed tests. On the seven-point rating scale, high scores 
indicated nervousness, Rating of other was subtracted from 
self-rating, so that a negative score indicates the other was 
rated more nervous than the self was rated. 


justed scores in White-Mild Threat than in 
Negro-Mild Threat has now become a sig- 
nificant difference (p < .05). 

Self-ratings. Analysis of variance of total 
self-ratings on the three stress scales, total 
ratings of the other person's stress, and the 
difference between these scores, had no sig- 
nificant results. When analyses were done for 

. the three scales separately, only nervous-calm 
revealed effects of the experimental conditions. 
As shown in Table 3, a significant interaction 
occurred on the self-other difference scores 
(2 < .02). The cell means show that White- 
Strong Threat was the only condition in which 
subjects viewed the companion as less nerv- 
ous than themselves, and that scores in this 
Cell were significantly higher than those in 
Negro-Strong Threat (p < .01) and White- 
Mild Threat (p < .02). 

Self-other differences in ratings on nervous- 
ness were tested for correlation with intertrial 
differences in performance. Only one condi- 
tion, White-High Threat, showed a significant 
correlation (r= .47, p = .05). Thus, in the 
most stressful condition (as evidenced by rat- 
ings of nervousness and by performance) the 
tendency to see oneself as more nervous than 
the other person was inversely related to digit- 
letter achievement. 


DISCUSSION 


Ina situation that was otherwise presumed 
to be low in motivational significance (Mild 
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Threat condition), Negro subjects worked bet- ~ 
ter in the presence of a white administrator 
and a white confederate than did subjects 
who were in a Negro setting. But in a highly 
stressful condition (Strong Threat) this re- 
lationship was reversed. The results clearly 
show that on a relatively nonthreatening task 
a white environment can have either favorable 
or detrimental effects on Negro performance, 
depending upon other conditions. On the as- 
sumption that the white environment was — 
moderately stressful for Negro subjects, our 
data are consistent with the U shaped hy- 
pothesis of Malmo (1957), Duffy (1957), 
and others. 

An alternative to the notion of an invariant 
U shaped relationship between arousal and 
performance is Easterbrook's (1959) hypothe- 
sis that the range of cue utilization by an or- 
ganism is a decreasing function of drive in- 
tensity. Accordingly, the inverted U will oc- 
cur for tasks on which the optimal range of 
cue utilization is an intermediate one. These 
will be fairly complex tasks that require many; 
cues for optimal performance, e.g., nonrepeti- 
tive serial learning. But tasks that require 
recognition of very few cues, e.g., simple re- 
action, will tend not to reveal deterioration of 
performance at high drive levels. Thus, East- 
erbrook's hypothesis goes beyond the arousal 
formulation by specifying the kind of task on 
which a U shaped relation between drive and 
performance will occur, and by providing a 
measurable response variable (range of cue 
utilization) to account for it. 

A different statement of the relationship be- 
tween stress and performance is provided by 
Hullian theory, as applied by Farber and 
Spence (1951). According to their formula- 
tion, stress should impair performance on 
complex tasks, but facilitate performance on 
simple tasks. Also, different stress stimuli 
should have a summative effect. If one re- 
gards the digitletter task of the present ex- 
periment as complex, Hullian theory would 
suggest the following main effects in the pres- 
ent experiment: poorer performance in the 
White condition than in the Negro condition; 
poorer performance in the Strong Threat con- 
dition than in the Mild Threat condition; 
hence, best performance in the Negro-Mild | 
Threat treatment (lowest total stress); and 
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worst performance in the White-Strong Threat 
treatment (highest total stress). But if the 
digitletter task is defined as simple, opposite 
predictions would be made. Our finding of 
‘an interaction effect in which the two inter- 
mediate stress treatments produced better per- 
formance than either the lowest stress treat- 
ment or the highest stress treatment clearly 
does not support either set of Hullian pre- 
dictions. 

‘As an alternative to the interpretation of 
results in terms of increments of drive, or 
increments of arousal, one must consider 
whether the experimental manipulations could 
have affected the significance of the situation 
for the subject. For example, although the 
task administrator implicitly defined the task 
as unimportant and nonevaluative in all con- 
ditions, subjects in the White-Strong Threat 
treatment may have perceived the situation 
as a test of their ability to withstand stress 
(by working efficiently), and felt in competi- 
tion with the white co-worker for favorable 
evaluation by the white administrator. That 
is, the combination of white environment and 
Strong Threat may have had stronger evalua- 
tive significance than any of the other ex- 
perimental treatments, causing subjects to 
avoid trying to do well, out of fear of failure 
or fear of hostile reprisals if successful. Re- 
search is now under way to determine the ef- 
fects of variations in task importance and task 
difficulty on Negro performance. Another pos- 
sibility is that when the threat of strong shock 
was made by a white administrator it aroused 
more fear and hostility than when it was made 
by a Negro administrator. If so, the work re- 
ported by Horwitz (1958) would suggest that 
the relatively poor performance of Negroes in 
the White-Strong Threat treatment may have 
been caused by the inhibition of hostility 
against the administrator. 

The present experiment did not differentiate 
between the effect on subjects of a white ad- 
ministrator and the effect of a white co- 
worker, and further research is needed to de- 
termine their separate effects. The finding that 
the co-worker was perceived as less nervous 
than the self in the White-Strong Threat 
treatment, and the correlation in this treat- 
ment between performance and the tendency 
to see the co-worker as less nervous, suggest 
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that awareness of the white co-worker may 
have generated feelings of inadequacy under 
stress. 

Motivational factors have largely been ig- 
nored in previous research on the test per- 
formance of Negroes and other minority 
groups. In this regard, the findings noted by 
Dreger and Miller (1960) that Negroes tend 
to perform less well than “comparable” groups 
of whites call for close re-examination. The 
present study, as well as those of other in- 
vestigators who used white subjects, indicates 
that anxiety and other motivational factors 
may strongly influence test taking behavior. 
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EMOTIONAL UNCERTAINTY AND SOCIAL COMPARISON  - 
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An experiment was run to test an assumption, basic to social comparison 
theory, that evaluational uncertainty regarding some aspect of the self, such 
as an emotional experience, produces a desire to compare one’s self with 
others. Under threat of a strong shock, 2 levels of uncertainty were induced. 
Cross-cutting this induction were treatments in which information about 
others due to undergo the same experience was manipulated. Following 
Schachter, the desire for comparison information was measured by the 
strength of S's affiliation tendency. The findings suggest that: (a) uncertainty 
does produce a desire to compare one’s self with others, (b) the individual’s 
uncertainty relative to available comparison persons will determine his 


comparison tendency, and (c) 


a discrepancy in position (in this case, 


emotionality) reduces the desire to compare. 


The discomfort associated with an uncertain 
evaluation of some aspect of the self (e.g., 
an opinion or an ability) is a function both 
of its indeterminateness and its importance 
for the person, This discomfort, being drive- 
like, will lead to behavior enabling the indi- 
vidual to achieve greater certainty in his 
evaluation, When information relevant to an 
uncertain evaluation is available the indi- 
vidual will therefore attempt to acquire it. 
When this information is in the form of 
the opinions or performances of others, a 
process is generated whereby the individual’s 
uncertainty is reduced through comparison 
of himself with others. Some aspects of this 
process have been elaborated by Festinger 
(1954). Nowhere, however, has the funda- 
mental relationship between uncertainty and 
social comparison been examined directly. 
This is the purpose of the present experiment. 

Schachter’s (1959) recent work attempts to 
bring emotional experience within the context 
of social comparison theory. He found that 
an individual, when emotionally aroused, will 
seek the company of others who are con- 
fronted by the same emotion provoking 
situation. Schachter infers, from indirect evi- 
dence, that a major motivation accounting for 
this affiliation is the individual’s desire to 
evaluate his experience by checking his own 
emotional reactions to the situation against 
the reaction of others. 

If Schachter's finding is indeed due to 
the individual’s desire to compare himself with 


1 Now at the University of California, Riverside. 


others, then providing him with information 
about the others would have a consummatory 
effect and thus reduce affiliation, Using 
Schachter's basic experimental situation, 
Gerard and Rabbie (1961) found such evi- 
dence in a study designed to test this deriva- 
tion. Their results, however, were not entirely 
explicable in comparison theory terms, leaving ^ 
the matter still open. 

A social comparison interpretation of 
Schachter's data rests entirely upon the as- 
sumption that less uncertainty is aroused in — 
the low than in the high fear treatment. The 
argument is as follows. The situation con- 
fronting the subject evokes an emotional re- 
sponse which is far greater in the high as. 
compared with the low fear treatment. Since 
the threat of being shocked is novel, cognitive 
uncertainty exists regarding the bodily state 
produced by the anticipation; the greater the 
threat, the greater both the bodily response 
and the cognitive uncertainty, This assump- 
tion of a positive relationship between 
emotional level and uncertainty may not be 
tenable. It is just as likely that greater 
uncertainty is associated with weak, vague 
feelings and that a situation posing an obvious 
danger like a strong shock is very clearly 
cognized. Another motive or set of motives 
which was not ruled out by Schachter’s pro- 
cedures may account for the positive relation 
between level of fear and affiliation. The fact 
that uncertainty may not have an obvious 
relationship to emotional level is suggested 
by the parallel effects of information under 
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both fear 
experiment. 

Since, in comparison theory, it is not the 
level of fear arousal but uncertainty which is 
assumed to mediate the relationship between 
arousal and affiliation, it is important to bring 
this relationship into experimental focus by 
a more direct check. 

Rabbie (1961) attempted such a direct 
examination, again using Schachter’s basic 
experimental situation, and varying the like- 
lihood that the individual would receive a 
strong shock. Under one treatment, the sub- 
ject was informed that each of the four 
people present would receive a strong shock 
whereas under another treatement he was told 
that only one of the four would receive the 
shock. Although the treatments produced no 
difference in reported emotional arousal, there 
was greater affiliation under the “one-in-four” 
likelihood treatment. Rabbie's results support 
the uncertainty-comparison relationship, if we 
assume that varying the likelihood of getting 
shocked varies cognitive uncertainty regarding 
the internal emotional state. An obvious 
alternative interpretation of his data is that 
the greater affiliation in the one-in-four treat- 
ment may have been due to aroused curiosity 
as to who specifically would be the victim. 

The present experiment approaches more 
directly an examination of the assumption 
that uncertainty produces information seeking 
behavior. This was done by attempting to do 
the obvious, namely, to vary uncertainty 
regarding the aroused bodily state. 

Since affiliation in these experiments is 
assumed to be a measure of the motivation 
to compare one's self with others, providing 
the individual with information about the 
others should reduce affiliation, as in the 
Gerard-Rabbie experiment. This reduction in 
affiliation should be more marked the greater 
his uncertainty since the information would 
presumably have a greater consummatory 
effect the greater the drive. Evidence exists 
indicating that an individual will cease com- 
paring himself with others to the extent that 
they differ in the aspect being compared 
(Festinger, Gerard, Hymovitch, Kelley, & 
Raven, 1952; Hoffman, Festinger, & Law- 
rence, 1954; Schachter, 1951). "Therefore 
further evidence for social comparison would 


levels in the Gerard-Rabbie 
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be a reduced desire to be with others to the 
extent that they differ in emotionality. 


METHOD 
Subjects 


Each four-person group consisted either of all men 
or all women seated in adjacent cubicles. All subjects 
were students at Drew University. When only three 
subjects showed up for the experiment, as happened 
occasionally, it was made to appear, by appropriate 
deception, that all four cubicles were occupied. 


Procedure 


After the subject was seated in his cubicle, a pair 
of nonpolarizing electrodes, designed for electrodermal 
measurements by O’Connell and Tursky (1960), were 
attached to the subject’s nonpreferred hand. One of 
these electrodes was attached to the palm and the 
other fixed in position just above the wrist on the 
dorsal side of the forearm. A dummy EKG electrode 
was attached to the dorsal side of the forearm just 
above the reference skin resistance electrode. The 
arm was then immobilized by strapping it into a 
specially made armrest. The skin resistance electrodes 
were connected across one of the inputs of a four- 
channel constant current (40 microamperes) galva- 
nometer circuit. 

All of the instructions for each experimental condi- 
tion were prerecorded and heard by the subject over 
earphones which had a rubber cup over each earpiece. 

Each subject was exposed to the same high fear 
threat which was induced by a procedure similar to 
that used in the Schachter, Gerard-Rabbie, and 
Rabbie experiments. The two experimenters wore 
white coats and each had a stethoscope bulging from 
one pocket. A sphygmomanometer, some gauze, 
cotton, and medicaments were conspicuously on 
display as was a sign on some formidable looking 
electronic gear which read "Caution, High Voltage." 
After the electrodes were in place, the subject was 
informed that he was taking part in an investigation 
of the effects of electrical shock and that these shocks 
would be painful. For the exact wording of these 
instructions, the reader is referred either to Schachter 
(1959, p. 13) or Gerard and Rabbie (1961, p. 587). 

The certainty and information treatments were 
then administered after which the subject was in- 
formed that there would be a 10- or a 15-minute 
wait before we were ready to administer the shocks. 
He was given a choice of waiting together with the 
others in a room nearby or alone in a separate room 
and was told that on the basis of the choices made, 
we would arrange the most appropriate waiting 
situation, He depressed one of two buttons to indicate 
his choice and was then asked to indicate the strength 
of his desire for the waiting situation he had chosen. 
He did this by setting a pointer on a dial in the 
center of his console at a position between “1” and 
“11.” His dial setting was read on a meter in the 
control room which transformed it to a score from 
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0 to 100, This was the basic information required 
to test the comparison hypothesis, When informed 
that they were not actually going to be shocked, 
the subjects were generally quite relieved. They were 
apprised of the deceptions and asked not to reveal 
the nature of the experiment to fellow students who 
had not as yet participated. 


Manipulation of Uncertainty 


The subject had a meter on his table which was a 
disguised VU (volume unit) meter. It was labeled 
"Emotionality Index" and its dial was painted to 
read from 0 to 100. Since a VU meter registers the 
amplitude of an audio signal, it was possible to 
control the magnitude and variability of the bogus 
emotionality index reading with a tone which was 
recorded on the instruction tape. While this tone 
was on, the subject’s earphones were disconnected 
from the output of the tape recorder. 

He was told: 


We are interested not only in how you react to the 
Shocks but also in your reactions while anticipating 
these shocks, With the aid of the electrodes at- 
tached to you now, we can determine your present 
emotional state, With a formula that has been 
carefully worked out, we take your skin con- 
ductance as measured by the little polystyrene cup 
electrodes attached to your hand and above your 
Wrist and combine it with your muscle tremor, 
which is detected by the large electrode attached 
to your forearm, and arrive at an emotionality 
index. We have found the measure to be very 
GENTE in reflecting the worked-up state of the 
ody. 

In this experiment we have decided to let you 
See your own emotionality index which will appear 
on the meter in front of you. The needle may 
waver somewhat indicating momentary changes in 
your emotionality. We are going to record your 
index on our equipment for a three minute period. 
After that, we will proceed with the shock admin- 
istration, Zero is complete calm, whereas 100 indi- 
Cates extreme emotionality. Now watch your meter. 


Tn the “uncertain” treatment, the needle made 
rather erratic and wide excursions for the 3-minute 
period with an average reading of 75. In the "certain? 
treatment, the needle remained at or very close to 
75 during the entire 3-minute period. 


Information Treatments 


Tn order to examine the effects of information upon 
tendencies to engage in social comparison two in- 
formation treatments were superimposed on the two 
levels of uncertainty. A Procedure similar to the 
Gerard-Rabbie one was used. In one treatment the 
subject received no information about the others. In 
the other treatment the instructions continued: 


We thought that you would like to know how 
emotional the other subjects were during that 
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three minute period. Rather than give you a 
running record of each of their emotionality in- 
dexes, we will indicate, on the meter, the average 
for the other subjects for the three minute interval, 
'The average for each of you for the interval is 
computed automatically. It will take a few more 
seconds to combine these averages. Separate 
computations have to be made since each of you 
will see the average of the three others. 


After a reasonable pause, the meter indicated the 
others’ average in the Others-same treatment, as 75, 

Two additional information conditions which, 
due to a subject shortage, were run only under 
the unsteady meter display, purported to show 
the others as either much higher (an average of 95) 
or much lower (an average of 50) in emotionality: 
than the subject. These treatments were applied in 
order to further detect social comparison. Positive 
evidence would be lower affliation in both of these 
treatments as compared with the Others-same treat* 
ment. Of course, any conclusions that we may draw 
are necessarily limited by the lack of counter- 
balancing. 


Summary of the Design 


Two degrees of emotional uncertainty in the face 
of a threat of electrical shock were induced. This was 
accomplished by displaying to the subject a steady 
and an unsteady meter reading which purported to 
indicate to the subject the level of his emotionality. 

Each of these treatments was run under two levels 
of information about the others. Under one, he was 
given no information about the others whereas under 
the other he was shown data indicating that the 
others were similar to himself in emotionality level. 
Two additional treatments of discrepant information 
were run only under the uncertain condition. 


RESULTS 


Since affiliation varies with level of threat 
(Gerard & Rabbie, 1961; Schachter, 1959), 
it is important to establish at the outset that 
the manipulation of differential uncertainty 
did not induce a difference in emotionality. 
The change in base level conductance during 
the uncertainty induction was taken as a 
measure of the change in emotionality pro- 
duced by the induction (Gerard & Rabbie, 
1961). A comparison of the change in skin 
conductance from the beginning to the end 
of the induction shows no differential increase 
between the certain and uncertain treatments. 
We are therefore relatively safe in assuming 
a lack of difference in emotionality between 
the two treatments. 
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Induction of Uncertainty 


During the 3-minute uncertainty induction, 
a continuous record was made of the subject’s 
skin conductance. In order to determine if this 
induction produced a differential in emotional 
variability, we measured the variability of 13 
arbitrary points taken at 10-second intervals 
from the beginning to the end of the un- 
certainty induction. The unsteady needle 
treatment does show somewhat greater vari- 
ability (p < .15 by a £ test comparing the 
average variance per subject in each uncer- 
tainty treatment) .? 


Affiliation 


_ Table 1 * presents the mean desire for affilia- 
tion for the six experimental conditions. For 
the four counterbalanced conditions we see an 
effect of uncertainty in the predicted direction 
(p < .01 by £) as well as an interaction with 
information. Information that the others are 
at the same emotional level reduces affiliation 
under the steady meter (p «.01 by £) but 
not under the unsteady meter display.* Both 


2Two “maverick” scores were exceptionally large 
and added considerably to the standard error. Were 
it not for these two scores, the difference would have 
been significant at beyond the .05 level. Unfortu- 
nately no other check on emotional variablity could 
be taken immediately after the uncertainty induction 
since the interposition of a questionnaire measure 
might have further complicated what js a somewhat 
awkward procedure to begin with. The “take” of the 
variable might have been adversely affected. We 
therefore only have the conductance variability data 
to rely on. 

8The data for those six subjects who chose to 
be alone are not presented in this table. In all of 
the experiments thus far, a constant proportion of 
amy, 10% of the subjects choose to be 
alone. 

4There is a good deal of interest currently in 
certain phenomena which appear to be linked with 
the order of birth in the family (Dittes, 1961; Gerard 
& Rabbie, 1961; Glass, Horwitz, Firestone, & 
Grinker, 1963; Sarnoff & Zimbardo, 1961; Schachter, 
1959). Schachter, using all female subjects, found 
greater affiliation among first as compared to later 
borns. Gerard and Rabbie, on the other hand, found 
an interaction with sex. Their females behaved like 
those of Schachter whereas males showed the opposite 
trend, As Glass et al. have pointed out, it is prob- 
ably misleading to consider birth order as a variable 
in psychological research independently of the socio- 
cultural context of the particular family since there 
are such wide variations in child rearing practices. 
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TABLE 1 


AFFILIATION PRODUCED BY DEGREE OF CERTAINTY 
(Steadiness of meter display) AND INFORMA- 
TION ABOUT THE OTHER’S EMOTIONALITY 


Unsteady meter Steady meter 


| Others- | Others- | Others- | No info | Others- 
Noinfo | Same | higher | lower | Noinfo | same 
79.62» 79.6012) | 51.1) | 1212) | 69.010) | 54.202) 


a Ns are indicated in parentheses. 


discrepant conditions show some reduction in 
affiliation. However, only the Others-higher 
condition is significantly different (p < .01 
by £) from the Others-same condition. 


Discussion 


We shall first discuss the data from the four 
counterbalanced conditions reserving some 
remarks at the end for the other two condi- 
tions. When fear is aroused we find greater 
affiliation the greater the uncertainty. This 
basic finding is direct support for the assump- 
tion underlying comparison theory. However, 
we had predicted that information indicating 
that the others were similar to the subject 
would have a greater effect on reducing affilia- 
tion under the uncertain rather than the 
certain treatment. The data indicate that the 
opposite obtained. How can we account for 
this reversal? 

One possible interpretation is that the mere 
fact of knowing the level of the others’ emo- 
tion is not sufficient to reduce comparison 
motivation in the uncertain treatment. The 
general level of uncertainty may be much too 
high for this information to have had any 
effect. The steady meter treatment on the 
other hand, does have at least some uncer- 
tainty associated with it. Perhaps this level 
is sufficiently low for the information to 


However, for the record, birth order data were 
collected. Once again we find the same interaction 
found in the Gerard-Rabbie experiment. That is, 
first born females want to affiliate more than later 
borns whereas for males, the relationship is reversed 
(p<.05 for the interaction), In response to the 
Gerard-Rabbie results, some have argued (inform- 
ally) that the interaction may be due to the fact 
that the subjects were from a large urban college 
with a large Jewish population. The present results 
were for subjects from a small Methodist suburban 
college. 
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produce a change in comparison motivation. 
Admittedly, this is not a very compelling 
interpretation. 

There is a more plausible, although some- 
what more complicated, interpretation which 
makes three assumptions: the individual not 
only compares his emotional level with that 
of the others but makes variability com- 
parisons as well, variability comparison occurs 
only when the individual’s own variability 
is high, and variability comparison takes 
precedence over level comparison. Thus, under 
the unsteady display information about the 
others’ level failed to reduce affiliation since 
this information has only second priority, as 
it were. The high degree of affiliation here 
presumably reflects the desire for variability 
information, Following our assumptions, the 
subject who saw the steady meter was more 
concerned with level than with variability 
since his variability was low. The information 
as to the others’ level was information which 
he could utilize in reducing whatever uncer- 
tainty there was regarding his own level. 

Variability comparison is assumed to have 
priority over the comparison of positions or 
levels since it is tantamount to the attribution, 
by the individual, of reliability or credibility 
"weightings" to his own and the others’ posi- 
tions. This says that relative rather than 
absolute uncertainty is the prime mover in 
social comparison. Under circumstances where 
there is easy access to information from others 
it is conceivable that the individual will at- 
tempt to check his self-evaluation against 
direct “physical” facts even when these facts 
are quite difficult to obtain. He could do this 
if he attributed low credibility to the informa- 
tion available from others. 

"There is still another explanation for the 
reversal which relates to some data from a 
recent experiment (Gerard, 1962) where it 
was found that a comparison person takes on 
positive value for the individual to the extent 
that he serves to reduce uncertainty. The rela- 
tively high affiliation level for the unsteady 
meter subject in the Others-same condition 
may reflect such an increased attraction, 
There may be two opposing tendencies at 
Work, a reduction in comparison motivation 
due to having information which has reduced 
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uncertainty and an increase in affiliation due 
to the attraction. produced by uncertainty 
reduction. Further research is necessary to 
examine these tendencies separately. Such 
research would, however, require a measure 
of comparison motivation which is purer than 
simple affiliation since affiliation contains 
motivational components other than com- 
parison motivation which are probably af- 
fected by the experimental manipulations of 
the kind that we impose upon the situation, 

Since comparison theory does predict re- 
duced affiliation where the others’ differ in 
level, the evidence from the treatments in 
which the others were higher or lower than 
the subject in emotionality does suggest that 
levels were being compared under the uns 
certain treatment. The fact that affiliation is 
higher in the Others-low condition suggests 
the possibility that, in addition to a desire to 
compare levels, the subject was seeking reas- 
surance that the prospects were not as dire 
as he anticipated. Since the design was not 
counterbalanced, the findings here cannot be 
properly evaluated. If we are correct in 
assuming that variability comparison was 
salient in the uncertain treatment then a dis- 
crepancy would have relatively little effect 
on affiliation as compared with the steady 
meter display where the individual is prima- 
rily concerned with information as to the 
others’ level. Therefore, if the design were to 
be filled out by running these variations under 
the steady display we would predict an 
even greater decrement in affiliation due to a 
discrepancy in emotionality. 
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TOWARD AN ADEQUATE TAXONOMY OF 
PERSONALITY ATTRIBUTES: 


REPLICATED FACTOR STRUCTURE IN PEER NOMINATION 
PERSONALITY RATINGS? 


WARREN T. NORMAN 
University of Michigan 


A rationale and procedures for the development of a taxonomic basis for 
personality research and theory construction are outlined. Some prior 
research efforts in this area are briefly reviewed and evaluated. Results of 
a series of studies are presented which employed primarily groups of uni- 
versity male Ss who varied in length and intimacy of previous associations. 
Data were obtained using peer nomination rating methods. These analyses 
yielded clear and consistent evidence for the existence of 5 relatively orthogonal, 
easily interpreted personality factors. Behavioral correlates of selected patterns 
Of scores on these factors are briefly discussed and suggestions are made 


concerning subsequent research efforts warranted by these findings. 


The arguments and research findings to be 
discussed in this paper focus on the problems 
involved in developing an adequate descriptive 
or observation language for personality as an 
area of psychological research and theory 
construction. The basic commitment which 
underlies the research efforts and program- 
matic considerations presented is that the con- 
Struction of more effective theories of the 
development, structure, and functioning of 
personality will be facilitated by having 
available an extensive and well-organized 
vocabulary by means of which to denote the 
phenotypic attributes of persons. 

It is explicitly mot assumed that complete 
theories of personality will simply emerge 
automatically from such taxonomic efforts. In 
the opinion of this writer, there is a good 
deal more to theory construction and refine- 
ment than the development of an observation 
language—even a good one. Neither are the 
arguments, empirical results, and assessment 
methods described below presented to support 
the notion that a complete and adequate, de- 
scriptive, taxonomic basis for personality is 
now a fait accompli. Rather they are offered 


1 Portions of this paper were presented at the 
meetings of the American Psychological Association 
in New York, September 2, 1961. The material pre- 
sented in this report stems in part from a project 
sponsored by the Personnel Laboratory, Aeronautical 
Systems Division, Air Force Systems Command, 
Lackland Air Force Base, Texas, 


as a contribution toward such a goal, as a 
rudimentary basis from which further work 
can profitably proceed and, by implication, 
as an assessment of our current status in this 
enterprise. 


Locic AND PROCEDURES FOR THE DEVELOP- 
MENT OF A TAXONOMIC Basis 
FOR PERSONALITY 


Attempts to construct taxonomies of per- 
sonality characteristics have ordinarily taken 
as an initial data base some set of perceptible 
variations in performance and appearance be- 
tween persons or within individuals over time 
and varying situations, By far the most gen- 
eral efforts to specify the domain of phenom- 
ena on which to base such a system have pro- 
ceeded from an examination of the natural 
language. The argument in its essential form 
has been that perceptible differences between 
Persons in their characteristic appearance or 
manner of behaving or changes over time and 
situations of single individuals in these regards 
become codified as a subset of the descriptive 
predicates of the natural language in the 
course of its development. Thus we find 
Allport and Odbert (1936) reporting the re- 
sults of their exhaustive search for all “trait 
names” in a standard English dictionary. 
From such a compendium (their’s comprised 
some 18,000 terms altogether) presumably, 
would be drawn the descriptive predicates of 
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TAXONOMY OF PERSONALITY 


the observation language for any theory of 
personality. 

The second step in the taxonomic effort 
would consist in selecting from the original 
set those terms suitable for inclusion in a 
scientific data language; that is, that subset 
which possesses unambiguous denotative refer- 
ence to limited classes of relevant observable 
phenomena. There is no necessary reason why 
every trait descriptive term that occurs in the 
natural language should be required. Nor is 
it likely even that all of those terms, admis- 
sible as members of the data language vo- 
cabulary, will refer to sufficiently distinct and 
independent classes of phenomena to warrant 
including all of them. Thus a third aspect of 

* the taxonomic problem would be to cull from 
the admissible set of predicates, those which 
are not needed in order parsimoniously to 
span the domain of phenomena referred to by 
the entire collection. 

On completion of this step the taxonomic 
effort could be, and typically has been, termi- 
nated and the focus of the research program 
directed toward other goals. However, it would 
seem important to carry the endeavor a step 
or two further. In the first place, if the cate- 
gorizing of terms has been done on the basis 
of direct judgmental comparisons among 
them, some assurance is needed that in actual 
denotative use, terms in a given category are 
functionally interchangeable, at least to a high 
degree. Nor should there be terms in two 
different classes which are functionally syn- 
onymous in denotative reference. Second, 
distinctness of the classes does not imply inde- 
pendence of occurrence of the attributes 
denoted by the terms in the various categories. 

The implication of these last two points is 
that the development of an adequate tax- 
onomy involves more than an exhaustive cata- 
loguing of designators into distinct homogene- 
ous classes—it also requires an organization 
of the classes based on the mutual relation- 
ships or joint occurrences of the phenomena 
or attributes denoted by the terms in the 
various sets. This entails the collection of data 
from representative groups of persons from 
the language community concerning their use 
of these terms to describe persons or their 
actions, and the analysis of these data to 
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determine the degree of relationship or joint 
usage of terms in the several categories. 


Some Prior RESEARCH FINDINGS 


The list of trait names which resulted from 
the search by Allport and Odbert (1936) were 
used by Cattell (1947, 1957) in his con- 
ceptualization of the Language Personality 
Sphere and in his development of the con- 
densed Personality Sphere set of rating scales. 
In brief, Cattell limited his consideration first 
to the subset of 4,504 terms which are con- 
tained in Column I of the Allport-Odbert 
monograph—those characterized by Allport 
(1937) as, 

.. most clearly "real" traits of personality. They 


designate generalized and personalized determining 
tendencies—consistent and stable modes of an indi- 


vidual’s adjustment to his environment. . . . These 
terms do not imply merely temporary and specific 
behavior as do the terms in Column IL... they 


are more neutral and less censoria than those in 
Column III; and they are less metaphorical and 
remote in their applicability to personality than 
those in Column IV [p. 366]. 


Having thus initially curtailed his con- 
sideration to those terms denoting relatively 
"consistent and stable" aspects of personality 
a further reduction (à la Phase 3 above) was 
accomplished by Cattell (1957, pp. 71-72). 
Based on the notion of “just distinguishable” 
differences in judgments of semantic meaning, 
some 171 terms were selected to represent 
“synonym groups” presumably spanning the 
“personality sphere.” Ratings obtained on 
these “trait elements” were then correlated 
and another reduction in the set was accom- 
plished by cluster analysis methods. Each of 
the 36 clusters so identified (and subsequently 
augmented to number 42-46) was converted 
into bipolar descriptions and this collection 
was dubbed the standard reduced personality 
sphere set of rating scales. 

Thus we see that by a combined use of 
semantic judgments and correlational evidence 
the initial set of trait descriptive terms was 
rather severely condensed by Cattell and con- 
verted into a comparatively small number of 
rating scales. These scales were then intended 
for use in subsequent investigations to deter- 
mine the organizational or structural features 
of personality—the final phase of a taxonomic 
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program of the sort discussed above. These 
scales have in fact been employed by Cattell 
and by others in a number of such studies. 
The results of factor anlayses of the obtained 
data have not been unequivocal, however. 
Cattell (1957) contends that, 


.. at the level of behavior ratings, an almost 
complete structuring has been obtained .... The re- 
sults of four consecutive and interrelated studies yield 
twelve very stable and two or three less definite pri- 
mary personality factors [p. 73]. 


By way of contrast, Tupes and Christal 
(1961) report the results of analyzing data 
collected on these scales from eight separate 
samples (including reanalyses of two of 
Cattell's own matrices) : 


These samples differed in length of acquaintanceship 
from three days to a year or more; in kind of ac- 
quaintanceship from assessment programs in a mili- 
tary training course to a fraternity house situation; 
in type of subject from airmen with only a high 
School education to male and female undergraduate 
Students to first-year graduate students; and in type 
of rater from very naive persons to clinical psycholo- 
Eists and psychiatrists with years of experience in 
the evaluation of personality. 

In all solutions except one there appeared to be 
five relatively strong and recurrent factors and 
nothing more of any consequence [italics added]. In 
one solution, based on data from undergraduate 
women, the fifth factor split into two highly related 
factors [pp. 13-14]. 


Thus there appears to be a considerable dis- 
parity between the results presented by 
Cattell and those reported by Tupes and 
Christal—even for the two sets of data which 
both analyzed, 

One possible explanation lies in the rota- 
tional methods employed; an oblique simple 
structure criterion by Cattell but an orthogo- 
nal approximation to simple structure by 
Tupes and Christal. This would not account, 
of course, for the differences in the numbers 
of factors extracted. In the male college 
sample analyzed by both, Cattell ( 1947) 
extracted 13 centroid factors whereas Tupes 
and Christal could find no basis for extracting 
more than 8 factors from these same data. 

This writer has several reasons for prefer- 
ring the Tupes and Christal version to that of 
Cattell. First, Cattell (1947) presents a 
"representative set" of reliability coefficients 
for 12 of the 36 scales he employed in this 
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study. Eleven of these 12 are lower (sic) in 
value than the corresponding communality 
values he reports for the unrotated, centroid 
factors from these data—over half of them 
appreciably lower. If one grants the appropri- 
ateness of the method of reliability estimation 
used and considers the relation between com- 
mon factor variance and reliable variance in 
the factor analytic model, Cattell’s results 
present at least a quandary. 

Second, despite the fact that oblique rota- 
tions were used by Cattell, the defining vari- 
ables or salients for his rotated factors have 
loadings which are remarkably small in magni- 
tude when compared either with their un- 
rotated communalities or with the loadings 


obtained by Tupes and Christal for these. 


scales on their orthogonal factors, While inter- 
pretation is certainly facilitated by high 
hyperplane counts, it is also easier when the 
salients load both strongly and uniquely on a 
single factor. In Cattell’s analysis no variable 
loads as high as .60 on any factor and over 
half of the variables which are salient on one 
factor are salient on two or more. 

It has been argued however, that factorial 
invariance is an important result of using a 
complete simple structure criterion for rota- 
tion. The four studies reported by Cattell 
(1957) are in fact construed by him as pro- 
ducing evidence of invariance for the structure 
he presents. But the Tupes and Christal 
analyses certainly present no less convincing 
evidence of invariance. One of the major 
objectives of the studies and analyses to be 
presented below, in fact, was to determine 
numerical estimates of the degree of factor 
similarity for factor solutions obtained using 
a set of scales drawn from the reduced per- 
sonality sphere set with diverse samples of 
subjects. 

A second objective was to determine 
whether the use of completely analytical 
factor extraction and rotation procedures 
would yield a factor structure similar to those 
obtained by Tupes and Christal who used 
simplified calculation and visual rotation 
methods.? A third objective was to determine 

? For one of their samples, Tupes and Christal also 


rotated the matrix using an IBM 650 normal varimax 


Program. The two solutions obtained were extremely 
similar. 
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whether even further reduction in the number 
of scales included in the analyses would alter 
markedly the factor structures obtained. 


METHOD AND PROCEDURE 


In the present studies, the peer nomination rating 
scales selected for data collection were drawn from 
the reduced personality sphere set on the basis of 
the results of several analyses presented by Tupes 
and Christal (1958). The four scales with the highest 
median factor loadings for each of the five factors 
identified in these earlier analyses were selected. Ab- 
breviated descriptions of these scales together with 
suggested names for the factors they represent are 
listed in Table 1. The actual trait descriptions em- 
ployed in the data collection were those presented 
by Cattell (1947) in his Psychometrika article and 
also may be found in the Appendix of a previous 
report by Norman (1961), The polar descriptions 
for these scales were typed, one scale to a page, and 
assembled in a booklet form. The scales representing 
a given factor were systematically alternated with 
scales for other factors throughout the booklet to 
minimize the effects of serial response sets. 

Four samples of male college students from the 
University of Michigan were selected for study as 
follows: 
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. 84 senior ROTC students 

. 82 fraternity and co-op residence men (largely 
seniors) 

. 215 fraternity men (largely sophomores and 
juniors) 

. 241 dormitory residence hall men (largely 
freshmen, sophomores, and juniors) 


by Aa BP 


In each of the four samples, rating groups number- 
ing 6-16 men per group were formed in each 
residence or ROTC unit so as to maximize interrater 
familiarity and to minimize status differences owing 
to academic class standing. All subjects in each of 
the samples were volunteers who agreed to par- 
ticipate in their respective groups in a large scale 
psychological testing study. Each subject was paid 
at the rate of $1.00 an hour to insure his co- 
operation and participation throughout all phases of 
the testing program. 

Each rating group was administered the set of 
20 peer nomination scales. The administrative in- 
structions required each person in a rating group 
to nominate one-third of the other members of his 
group (that is, excluding self) on Pole “A” and one- 
third on Pole “B” of each scale. Scores for each 
person in the group on each scale were determined 
by summing nominations received by him on Pole 
A, subtracting the number of nominations received 
on Pole B, adjusting the score for group size, and 


TABLE 1 


ABBREVIATED DESCRIPTIONS OF THE PEER NOMINATION RATING SCALES AND THEIR A PRIORI 
Factor DESIGNATIONS 


Abbreviated scale labels» 


Factor name* 
Number Pole A Pole B 
I. Extroversion or Surgency 1 Talkative-Silent - 
2 Frank, Open-Secretive 
3 Adventurous-Cautious 
4 Sociable-Reclusive 
IL Aj blene: 5 Goodnatured-Irritable 
sree 6 Not Jealous- Jealous 
ri Mild, Gentle-Headstrong 
8 Cooperative-Negativistic 
III. Conscienti e 9 Fussy, Tidy-Careless 
opsoen Ha 10 Responsible-Undependable 
11 Scrupulous-Unscrupulous 
12 Persevering-Quitting, Fickle 
IV. Emotional Stabilit, 13 Poised-Nervous, Tense 
motional Stability A AREN 
15 Composed-Excitable 
16 Not Hypochondriacal-Hypochondriacal 
V.C 17 Artistically Sensitive-Artistically Insensitive 
alture 18 Intellectual-Unreflective, Narrow 
19 Polished, Refined-Crude, Boorish 
20 Imaginative-Simple, Direct 


Note.—Based on the findings of Tupes and Christal (1958). 
3 Pole A. 


b For the actual scale labels and instructions to raters employed in data collection see Norman (1961, Appendix). 
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adding a constant to eliminate negative values. The 
actual formula used in all computations of rating 
scale scores was, 


10 


Rss = 10+ = 


(X4 — Xz) 


where N — number of raters in the group X4 = 
number of “A” nominations received by the sub- 
ject from all raters in the group on the given 
scale, and Xs = number of “B” nominations re- 
ceived on the scale, 

There are several points worth noting concerning 
the effects of these procedures for data collection 
and scoring (which are essentially the same as 
those used in prior studies by Tupes and Christal 
and only slightly different from those used by 
Cattell). The first is that they preclude any dif- 
ferences between rating groups in average scores on 
the rating scales, However, the variances and higher 
moments of the group distributions are not so 
constrained. Second, if scores are obtained from 
rating groups of markedly different sizes and then 
(as is customary) are combined in a single distribu- 
tion, those scores obtained from the larger groups, 
being based on judgments from more raters, will 
Possess higher reliabilities. The effect of this is that 
two persons with the same “true score,” one of 
whom is a member of a small rating group and 
the other of a large One, can be expected to have 
systematically different observed scores with the 
person from the smaller group being more extreme. 
The details of this argument, some empirical data 
confirming the existence of this effect, and a pro- 
posed solution to the problem are presented by 
Willingham (1959a, 1959b). However, since the 
Magnitude of this bias is relatively small for dif- 
ferences in group size such as those which obtain 
in the present studies, Willingham’s correction was 
not employed, And finally the point should be 
made that although group mean profiles across 
scales are necessarily congruent and flat, a profile 
of scores for any given individual is in no such 
way constrained, either as to its elevation, scatter, 
or shape components, 

For each of the four samples, rating scale scores 
were intercorrelated and separate factor analyses 
Were carried out analytically using a principal axes 
method of factoring and a normalized varimax 
rotation procedure, Initial estimates of com- 
munalities were obtained from the highest correla- 
tions in the columns and communalities were 
iterated until they stabilized. The computer program 
Was set to extract 100% of the common factor 
variance but no Specification of the number of 
factors to be extracted was given. In addition, 
and D (Residence 


distribution of academic class standing. The rating 
scale scores for this regrouping of Samples C and D 


Warren T. Norman 


were also analyzed according to the specifications 
described above. 

Factor relationship indices were computed be- 
tween all pairs of factors resulting from the six 
analyses using a method recently developed by 
Henry F. Kaiser? In addition, rough approximations 
of factor scores were computed by simply summing 
the scores obtained on the four rating scales rep- 
resenting a given factor for each subject. These 
"factor scores" were then intercorrelated separately 
within each of the six analysis samples. 


RESULTS 


The factor tables for Samples C and D are 
presented in Table 2, together with the com- 
munality values for each scale in each of 
these two analyses.* At the bottom of the 
table are recorded also the percentage of 
common factor variance attributable to each 
factor in each analysis and an index of factor 
purity. This latter index was computed by 
taking the ratio of the sum of the squared 
loadings for the a priori factor salients di- 
vided by the latent root for that factor, the 
ratio multiplied by 100. 

Tt can be seen that the communality values 
are generally quite high. However, the most 
striking feature of the results in Table 2 is 
the very close approximation to orthogonal 
simple structure achieved in each analysis by 
the varimax rotation procedure. This is most 
easily observed by running down the column 
of factor loadings and observing the difference 
between the loadings of the a priori salients 
(in bold face type) on that factor and the 
loadings of all other scales in that column. 
The sole exception to the general pattern in 
any of the six analyses occurred for the 
loading of Scale 19 on Factor III in the 


3 The method is described in a dittoed paper en- 
titled “Relating Factors between Studies Based upon 
Different Individuals,” Bureau of Educational Re- 
search, University of Illinois, July 1960. The analyses 
reported here were made possible through the kind 
cooperation of Henry F. Kaiser. 

* Factor tables, Kaiser factor relationship matrices, 
and correlations among unit-weighted-salient factor 
Scores based on the six analysis samples have been 
deposited as Tables A, B, and C with the Amer- 
ican Documentation Institute. Order Document No. 
7429, from ADI Auxiliary Publications Project, 
Photoduplication Service, Library of Congress; 
Washington 25, D, C, remitting in advance $1.25 
for microfilm or $125 for photocopies. Make 
checks payable to Chief, Photoduplication Service, 
Library of Congress, 
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TABLE 2 


Factor TABLES BASED ON THE PEER NOMINATION RATING SCALE DATA FROM Two OF THE 
Six ANALYSIS SAMPLES 


Sample C* Sample D> 
Scale Factor Factor 

D da 

I Il IH IV V I II IH IV v 
1 907| 02 |-02 | 04 |—00 | 81 90] |—07 06 |—04 03 82 
2 78 | |-08 07 |—03 07 | 62 79|| 03 |-15 | 03 05 65 
3 T9 15 |-20 | 32 01 | 79 79||-03 |-16 | 30 |—00 73 
4 86|| o1 |-18 |-01 |-02 | 77 | 190] |—03 01 | 09 |—09 83 
5 17 | [80 17 12°} 07 | 72 16 |[85]|-04 | 15 11 78 
6 —10 ||64 20 | 49 | 07 | 70 |-20 ||72||-05 | 40 06 73 
7 —20 || 80 27 19 10 | 80 |—37 79 15 | 07 13 80 
8 33 ||74 28 13 11 | 76 28 | L73 00 | 14 12 65 
9 —33 |—08 66] |-35 20 | 70 |-45 | 02 43] |-31 34 | 60 
10 —03 | 32 86] | 08 18 | 89 |—25 57 422|| 04 41 74 
11 —30 | 44 68 | |—02 20 | 79 |—53 | 55 37 | |-10 26 80 
12 —05 | 28 74 12 27 | 72 |-31 | 39 33]| 12 45 57 
13 01 | 56 15 |[61 05 | 70 01 45 11 |[58 17 58 
14 06 | 21 |-10 ||82]||-07 | 73 14 | 17 |-04 ||81]||—03 70 
15 13 | 06 16 71 24 | 60 16 | 02 02 ||71 27 60 
16 21 | 27 |-00 |L65]|-09 | 55 10 | 30 |—05 |L65]|—08 54 
17 —04 | 08 39 |-10 7571| 73 |—20 | 08 27 |—08 60 48 
18 —04 | 05 47 04 74|| 78 |—O1 14 10 | 08 84 78 
19 15 | 25 53 16 46|| 60 13 | 32 41 10 56 61 
20 12 19 03 10 68} | 52 04 | 17 |-20 | 09 70 56 

Percentage of variance? | 23 22 22 18 15 29 27 rà 18 19 

Factor Purity Index? 85 | 70 | 69 | 77 | 86 73 | 65 | 60 | 81 72 


Note.—Decimal points omitted 
for a priori salients in bold face type. 

a Fraternity groups (N = 215). 

» Residence Hall groups (N = 241). 

o (Latent root + sum of the latent roots) X 1 

4 (Sum of squared loadings for a priori salients 


00. 


factor matrix for Sample D. In that one in- 
stance, a non-a-priori salient has a loading 
exceeding two of the a priori salients. In 
this one instance, apparently, the obliquity 
between Factors III and V was sufficiently 
great to make the orthogonal varimax solu- 
tion a rather bad fit to the actual data. The 
factor purity indices in the last line of the 
table also indicate that the a priori salients 
for each factor are in fact accounting for 
most of the variance attributable to that 
factor. 

The percentage of variance entries at the 
bottom of the factor tables indicates that in 
each analysis the factors are accounting for 
roughly equivalent proportions of the total 
common factor variance—the most pro- 
nounced exception in any of the six analyses 


for factor loadings and communalities, All values rounded to two significant digits. Loadings 
Principal axes solutions with normalized varim 


ax rotations. 


+ latent root) X 100. 


again being Factor III in Sample D. By no 
means is there any indication of a large gen- 
eral factor, for instance, one attributable to 
general evaluative tendencies or social de- 
sirability, present anywhere in these results. 
This is not to say, of course, that these 
dimensions of personality are "value-free"; 
only that the methods of data collection and 
analysis used have rather successfully elim- 
inated the influence of this pervasive and 
well-known class of response sets. Important 
as these sorts of response determiners may 
be as objects of study in their own right, 
the intent here was to get at other differential 
aspects of personality as clearly and as in- 
dependently of “desirability” or “evaluation” 
effects as possible. The forced-choice nomina- 
tion method of data collection and the nor- 
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TABLE 3 


Kaiser Corrricients or Factor Smmariry BASED 
ON THE ANALYSES OF THE PEER NOMINATION 
RATING SCALE DATA FROM Two OF THE 
Six ANALYSIS SAMPLES 


Sample C^ 
ED m Factor 
I II III IV Mi 
Factor 
I [990] | 5023 | —136 | —009 | —029 
IT 056 [964]| 256 006 | —049 
III 110 | —264 | [910 030 | —300 
IV 004 002 | —037 [999] | —014 
V 068 | —034 295 024 | [952] 


Note.—Decimal points omitted. All values rounded to three 
significant digits. 

^ Residence Hall groups (N = 241). 

b Fraternity groups (V = 215). 


malized varimax rotation procedures were 

chosen, in fact, partly because they are known 
to militate against the influence of such 
general factors, though they certainly do 
not preclude finding such effects if they are 
operative. It is thus reassuring to this writer 
at least, to observe that no factor was found 
which even closely resembled such a general 
evaluative response set effect in any of these 
analyses. 

The factor similarity indices between 
analysis Samples C and D reported in Table 3 
(and those between all other pairs of samples 
as well; cf. Footnote 4, p. 578) also clearly 
support this impression of relative independ- 
ence and clear definition of the five per- 
sonality factors. These values are strictly 
interpretable as correlation coefficients be- 
tween factors after the common factor pro- 
jections of the test vectors (not the factors!) 
have been matched as closely as possible be- 
tween the two studies. 

Of the 75 corresponding factor values ob- 
tained (by relating factor structures between 
all pairs of samples), none was less than .888 
in value. Of the 300 other indices (between 
noncorresponding factors) the greatest mag- 
nitude obtained was a value of .304. This 
method of relating factors is comparatively 
new and an extensive empirical basis for in- 
terpretating these indices has not yet accumu- 
lated. This writer, however has seen about a 
dozen other matrices of this sort computed 
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between diverse samples using a variety of 
rating and self-report personality measures, 
In no case has one been seen in which the. 
correspondence was so close as those obtained 
in the studies reported here. If one simply 
uses as a basis for interpretation the fact that 
these indices are linear correlations, the dif- 
ference between the lowest (of 75) corre- 
sponding factor values, .888, and the mag- 
nitude of the largest (of 300) noncorrespond- 
ing factor values, .304, certainly indicates a 
high degree of correspondence between the 
factor structure, across these samples. 

One final indication of the degree of rela- 
tionship among these factors is presented in 
Table 4. Here the “factor scores” obtained 
by simply summing each individual’s rating , 
scale scores for the four a priori salients on 
each of the five factors have been correlated 
between each pair of factors across persons 
within each of the two analysis samples, C 
and D (cf. Footnote 4, 578). In the matrix 
for the fraternity seniors (Sample B), only 
the correlation between Factor II and Factor 
IV was of appreciable magnitude (46). In 
the remaining five matrices various combina- 
tions of factors did relate somewhat appreci- 
ably, notably II with III, II with IV, and III 
with V. These results, together with the find- 


TABLE 4 


"FACTOR SCORE" INTERCORRELATIONS BASED ON THE 
PEER NOMINATION RATING SCALE DATA FROM 
Two or THE Six ANALYSIS SAMPLES 


Factor 
Factor 
I II III IV v 
Sample C* 
I 
II 08 
III —27 48 
IV 20 55 10 
V 02 37 63 18 
Sample Db 
I 
II —02 
TII —49 | 49 
IV 22 44 05 
V —02 31 59 19 


Note.— Decimal points omitted. All values rounded to two 


Significant digits. See text for method used to compute "factor 
scores, 


a Fraternity groups (V = 215). 
* Residence Hall groups (N = 241), 


) 
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ings from the factor analyses where occa- 
sionally a scale was found to load moderately 
on a factor for which it was not an a priori 
salient, do seem to indicate that some degree 
of obliquity may exist among certain pairs of 
the dimensions of personality for which these 
factor measures have been developed. 


DISCUSSION 


Despite the reservations just mentioned, 
however, it is clear that a relatively orthogo- 
nal and highly stable structure of personal 
characteristics has been identified and that 
reasonably good measures of these character- 
istics are available in situations where the 
peer nomination method of data collection 
.can be reasonably employed. It is also re- 
assuring to note that the results from our one 
sample where the subjects had lived together 
for periods of 1-3 years in a very intimate 
interpersonal context (the Fraternity Senior 
study) the factor structure was somewhat 


“cleaner” and the factor scores considerably , 


less highly intercorrelated. By way of con- 
trast the ROTC Senior study employed per- 
sons who had been together only in formal 
classroom and military drill situations, but 
who had not, in general, shared common resi- 
dence experiences. In a preliminary analysis 
of the data from this latter study where only 
98% of the common factor variance had been 
called for, a fifth factor failed to emerge. An 
inspection of the factor table indicated that 
Factors IT and IV had failed to separate. The 
subjects in this study complained more than 
usual that they felt uncomfortable about fill- 
ing out the rating forms and felt unable to 
make the discriminations among some of their 
peers called for by the rating procedure. In- 
spection of the interscale correlations and the 
rating scale scores for given individuals within 
this sample, in fact, did reflect the presence 
of some halo effect. Nevertheless, sufficiently 
differential discriminations apparently were 
made by the respondents despite these diffi- 
culties to produce a reasonably clear and in- 
terpretable five factor structure when ana- 
lyzed according to the revised specifications. 

Some complaints of the sort mentioned 
above are of course to be expected, even from 
groups of intimate associates, when they are 
required to use a forced-nomination rating 


581 


procedure. But, the question can be raised as 
to whether the results obtained here are truly 
a reflection of discriminations made by the 
respondents of the attributes of their peers. 
Perhaps the structures obtained might more 
simply be attributed to just the administra- 
tive and analysis procedure used. If there had 
been no degree of agreement among raters, 
however, or if the nominations (being forced) 
were actually assigned at random, then the 
variance to be expected for such a distribu- 
tion can be, and was, calculated. In every 
sample, the obtained variances on every scale 
were larger than would be expected for such 
a random nominating process. Nor did the 
interscale correlations in any sample cluster 
homogeneously about zero as would be ex- 
pected under such conditions. If, alternatively, 
the ratings had been made on the basis of 
some single general attribute (e.g., prestige") 
then the correlations among scales would have 
been homogeneous but nonzero and would 
have yielded only a single factor instead of 
the clearly articulated and replicated five fac- 
tor structure actually obtained. It is of course 
still possible that the attributes of the persons 
being rated were not what the raters were 
using as a basis for their nominations. But 
if not, it is hard to imagine what sorts of 
alternative mechanisms could have produced 
the degree of differentiated consensus neces- 
sary to generate these results. 

One fact does remain, however—that even 
groups with rather limited histories of inter- 
personal association (such as those compos- 
ing the ROTC sample or the sample of Peace 
Corps Volunteers to be discussed below) pro- 
duce a highly similar factor structure. This 
may well indicate that the "level" or “depth” 
of the contacts required to produce such re- 
sults may not be very great. (One associate 
has remarked that these factors sound very 
much like the sorts of things she sees in let- 
ters of recommendation!) But this hardly 
seems a defect of a method and set of meas- 
ures for assessing and determining the or- 
ganization of phenotypic attributes of per- 
sons. To the contrary what seems somewhat 
surprising (and encouraging) is that relatively 
naive subjects come to such consensual and 
differentiated views of one another on so 
many separate dimensions on the basis j 
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such limited amounts and variety of inter- 
personal contact. 

Relative to extending the class of subject 
populations in which these five factors recur, 
these analyses go little beyond what has 
been demonstrated previously by Tupes and 
Christal (1958, 1961) in their series of stud- 
ies. However, results obtained from one ad- 
ditional sample recently tested does enlarge 
somewhat the scope of these findings. A sam- 
ple of 55 Peace Corps Volunteers composed 
of 16 women and 39 men who were adminis- 
tered the peer ratings toward the end of their 
2-month training period, yielded the same, 
clear five factor pattern described above. Al- 
though ratings were collected in groups com- 
posed only of like-sex subjects, for the analy- 
sis these groups were combined to yield a 
suitable (if minimal) sample size. Consider- 
ing that the period of association for these 
persons had been fairly brief, that some mem- 
bers in each rating group had had only scant 
contact with the others, and that these per- 
sons felt rather strongly that they would not 
be able to rate each other meaningfully on 
these scales, the fact that the analysis did 
yield the same five, clearly defined factors 
augers well for the general utility of this 
method of assessment and for the invariance 
of this factor structure. 

With regard to other behavioral correlates 
of these dimensions, Tupes (1957) has dem- 
onstrated the predictive validity of a certain 
pattern of these characteristics against cri- 
terion ratings of military officer effectiveness. 
These results were obtained on a large sam- 
ple of Air Force Officer Candidate students 
highly selected initially on the basis of meas- 
ured abilities and military aptitudes and de- 
spite the attenuation caused by a consider- 
able dropout rate during the training period. 
More recently, Isaacson, McKeachie, and Mil- 
holland * have found that peer nomination 
measures on these characteristics account for 
substantial proportions of the variance in stu- 
dent ratings of teaching effectiveness. The 
subjects in these studies were graduate stu- 
dent teaching fellows in the introductory psy- 
chology course at the University of Michigan. 
Rather distinct patterns of scores on these 


5 Personal communications, 1961. 
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five factors seem to be indicative of success 
in these two contexts and yet a third appears 
optimal to account for prognostic judgments 
of success in the field by the Peace Corps 
Volunteers, 

A major objective of the current research 
program of the author is to develop self-re- 
port or individual performance measures of 
these five personality factors (and for what- 
ever additional factors might be identified 
subsequently). Some degree of success in this 
effort has been achieved to date primarily by 
use of self-report, forced-choice inventory 
methods of data collection and a newly pro- 
posed method for scoring key construction, 
A preliminary report of this work is currently 
in press (Norman, 1963). 

Before concluding, it is well to note a couple 
of points made by Tupes and Christal (1961) 
concerning their findings based on the peer 
nomination rating data: 


In many ways it seems remarkable that such sta- 
bility should be found in an area which to date has 
granted anything but consistent results. Undoubtedly 
the consistency has been hidden by the inconsistency 
of factorial techniques and philosophies, the lack of 
replication using identical variables, and disagree- 
ment among analysts as to factor titles. None of the 
factors identified in this study are new. They have 
been identified many times in previous analyses, al- 
though they have not always been called by the 
same names .. . 


It is unlikely that the five factors identified are the 
only fundamental personality factors. There are quite 
likely other fundamental concepts involved among 
the Allport-Odbert adjectives on which the variables 
used in the present study were based [p. 12]. 


It seems to this writer on the basis of the 
results from previous studies and those pre- 
sented here that it is time to return to the 
total pool of trait names in the natural lan- 
guage—there to search for additional person- 
ality indicators not easily subsumed under 
one or another of these five recurrent factors. 
Tf one were then to attempt the sort of exten- 
Sive categorization of these terms described 
above; if he were then to collect and analyze 
data based on scales constructed from such 
terms including well defined markers from 
each of the five previously isolated factors in 
each analysis; if samples from diverse sub- 
ject populations were routinely employed; and 
if the length, intimacy, and variety of con- 
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tacts subjects had had with one another prior 
to data collection were systematically varied 
—then perhaps an adequate taxonomy of per- 
sonal attributes and a useful “observation 
language” for personality as an area of re- 
search and theory building might well result. 
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DEVALUATION OF FORBIDDEN BEHAVIOR ' 
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If a person is induced to cease performing a desired action through the threat 
of punishment, he will experience dissonance. His cognition that he is not 
performing the action is dissonant with his cognition that the action is 
desirable. An effective way of reducing dissonance is by derogating the 
action. The greater the threat of punishment the less the dissonance—since 
a severe threat is consonant with ceasing to perform the action. Thus, the 
milder the threat, the greater will be a person's tendency to derogate the 
action. In a laboratory experiment 22 preschool children stopped playing 
with a desired toy in the face of either a mild or severe threat of punishment. 
The mild threat led to more derogation of the toy than the severe threat. 


If a ruler, a parent, or a psychologist wishes 
to elicit or prevent the occurrence of a par- 
ticular response from a citizen, a child, or 
a pigeon, his problem is not a difficult one. 
All he must do is offer a salient reward or 
threaten to inflict a salient punishment. 
Clearly, the more attractive the reward or 
the more severe the punishment, the greater 
the likelihood that the organism will comply. 
But such induced compliance is an inefficient 
method of social control, for one must con- 
tinue to reward or to punish the response in 
order to ensure continued compliance. A much 
more effective technique would entail some- 
how getting the organism to enjoy (or abhor) 
the performance of the act? Such a tech- 
nique has been suggested by the theory of 
cognitive dissonance (Festinger, 1957, 1961; 
Festinger & Aronson, 1960). According to the 
theory, dissonance occurs when an individual 
simultaneously holds two incompatible cog- 
nitions. Dissonance is assumed to be an un- 
pleasant drive state; thus, when an individual 
experiences dissonance he attempts to reduce 
it by changing one or both of his cognitions, 
adding new cognitions, etc. 


1 This research was supported by grants from the 
National Science Foundation (NSF-G-16838 and 
NSF-G-22316) to Elliot Aronson, The experiment 
was conducted while Merrill Carlsmith was on the 
tenure of an NSF fellowship. The authors wish to 
thank the staff of the Harvard Preschool for their 
kind cooperation. 

2 Now at Vale University. 

3See Kelman (1961) for an interesting discussion 
of this issue, 


One situation which often arouses disso-' 
nance involves the performance of an unpleas- 
ant or effortful task for little or no reason. 
That is, if a person finds himself doing some- 
thing which he does not like to do and is in- 
sufficiently rewarded, his cognition that he 
performed an unpleasant task is dissonant 
with his cognition that he received little or 
no compensation for it. He can reduce disso- 
nance in this situation by seeking some other 
justification for having performed the act. 
Previous research in this area has demon- 
strated that an effective way of justifying an 
insufficiently rewarded action is by cognitively 
magnifying the attractiveness of the goal. 
In one experiment, subjects who expended a 
high degree of effort to attain an unattractive 
goal convinced themselves that the goal was 
indeed attractive—whereas subjects who ex- 
pended little effort saw the goal as it was— 
that is, unattractive (Aronson & Mills, 1959). 
Theoretically, the opposite effect should occur 
for punishment. That is, one should be able 
to induce a strong distaste for a previously 
desired action by getting an individual to 
cease performing that action following a mild 
rather than a severe threat of punishment. 
Specifically, if a person is induced to cease 
the performance of a desired act by the threat 
of punishment, his cognition that the act is 


* This is not to imply that this is the one means 
of reducing dissonance in this kind of situation. Al- 
ternative methods of reducing dissonance in similar 
situations have been investigated by Festinger and 
Carlsmith (1959), Mills (1958), and Aronson (1961). 


584 


< 


SEVERITY OF THREAT 


desirable is dissonant with his cognition that 
he is not performing it. A threat of severe 
punishment, in and of itself, provides ample 
cognitions consonant with ceasing the action. 
If a person ceases to perform a desired action 
in the face of a mild threat however, he lacks 
these consonant cognitions and, therefore, 
must seek additional justification for not per- 
forming the desired act. One method of justi- 
fication is to convince himself that the de- 
sired act is no longer desirable. Thus, if a per- 
son is induced to cease performing a desired 
action by a threat of punishment, the milder 
the threat the greater will be his tendency to 
derogate the action. 


METHOD 


The general procedure involved having young chil- 
dren evaluate several toys, issuing either a mild or a 
severe threat of punishment for playing with one 
specific toy, asking the children to re-evaluate the 
toys at the close of the experiment. Through this 
technique we could compare the effect of a mild 
threat with that of a severe threat on the attractive- 
ness of playing with the forbidden toy. 

The subjects were 22 children at the Harvard Pre- 
school, 11 girls and 11 boys, ranging in age from 3.8 
to 4.6 years.5 The experimental room was a large 
playroom familiar to all subjects. It contained a one- 
way observation mirror and a low table on which 
the experimenter could display five toys. The toys 
used were a battery-powered tank, a steam shovel, 
a set of plastic gears, a battery-powered fire engine, 
and a set of dishes and pans. The toys were all at- 
tractive to the children, and an opportunity to play 
with them was met with enthusiasm. Prior to the 
beginning of the experiment, the experimenter spent 
several weeks at the nursery school playing with the 
children, so that all the children knew him well when 
the experiment began. 

The experimenter led each subject into the experi- 
mental room, closed the door, and showed the sub- 
ject the toys. He demonstrated how each toy worked, 
and allowed the subject to play with it briefly be- 
fore moving on to the next one. After the subject 
was familiar with all the toys, the experimenter sug- 
gested a “question game,” following which the sub- 
ject would have a chance to play with the toys. The 
experimenter placed all the toys on the floor and sat 


5 An additional six children were run through part, 
but not all, of the experiment: two failed to com- 
plete the experiment because they moved from town; 
three were unable to make consistent rankings by 
the method of paired comparisons, and so were dis- 
carded; one did not wish to continue with the ex- 
periment. Only data from those children who com- 
pleted all of the experiment are included in the 
analysis. 
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on the opposite side of the low table from the sub- 
ject. Putting two of the toys on the table (for ex- 
ample, the steam shovel and the tank) he asked: 


Suppose you could either play with the steam 
shovel [picking it up], or the tank [picking it up]. 
Which one would you rather play with? 


After the subject had responded, the experimenter 
replaced the two toys on the floor, put two others 
on the table, and continued until the subject had 
made choices between all 10 pairs. By this procedure, 
a ranking was elicited, from the most preferred toy 
(1) to the least preferred toy (5). With children this 
young, it was inevitable that there would be some 
inconsistencies in the paired comparisons, Three sub- 
jects gave judgments which were completely incon- 
sistent; they were not run through the remainder of 
the experiment and their results were discarded. In 
a few other cases, the reversal of one paired-com- 
parison judgment led to a tie in the ranking of three 
of the toys. In these cases, the three toys were placed 
on the table and the experimenter pestered the child 
until the tie was broken. Such cases were surprisingly 
rare. The great majority of subjects were able to 
rank the toys in a consistent manner. 

After the subject ranked the toys, the experimenter 
picked up the second-ranked toy and placed it on 
the table in the center of the room. He arranged the 
remaining toys on the floor, and said: 


I have to leave now for a few minutes to do an 
errand. But why don’t you stay here and play with 
these toys while I am gone? I will be right back. 
You can play with this one [pointing], this one, 
and this one. But I don’t want you to play with 
the ————— [indicating the second-ranked toy]. 


At this point the experimental conditions were in- 
troduced. In the Mild Threat condition, the experi- 
menter continued: 


I don’t want you to play with the ————. If 
you played with it, I would be annoyed. But you 
can play with all the others while I am gone, and 
I will be right back. 


In the Strong Threat condition, the experimenter 
continued: 


I don’t want you to play with the ————. If 
you played with it, I would be very angry. I 
would have to take all of my toys and go home 
and never come back again. You can play with all 
the others while I am gone, but if you played with 
the —— ——, I would think you were just a baby. 
I will be right back. 


The experimenter then left the room, observed the 
subject for 10 minutes through the one-way mirror, 
returned, and again allowed the subject to play 
briefly with all the toys, including the forbidden toy. 
After the subject had played with all the toys, the 
experimenter suggested that they play the question 
game again, after which they would play together 
with all the toys. The experimenter administered the 
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paired-comparison procedure exactly as before, and 
then played with the subject for a short while. 

Each subject was tested in both conditions, with a 
period averaging 45 days separating the two condi- 
tions. The order in which the subjects were run 
through the two conditions was randomized—one 
half of the subjects were run in the Mild Threat 
condition first and one half were run in the Strong 
Threat condition first. No subject played with the 
forbidden toy during the 10 minutes he was alone 
with it. One or two subjects reached out a hand and 
tentatively touched the toy, but none went so far as 
to pick it up, much less operate it. 

The second-ranked toy was chosen as the crucial 
(forbidden) toy because this toy was very attractive 
to the subject, thus creating appreciable dissonance 
if he did not play with it; and this allowed the sub- 
ject the opportunity to change his evaluation in either 
direction. 


RESULTS AND DISCUSSION 


Our hypothesis was that a mild threat of 
punishment for playing with a desired toy 
would lead to a devaluation of that toy while 
a severe threat would not. The dependent 
variable in this experiment is the change in the 
subject's relative ranking of the crucial toy, 
that is, the difference between its attractive- 
ness before and after the threat was adminis- 
tered. The results are presented in Table 1 
which indicates the number of subjects in the 
two experimental conditions who decreased 
their liking for the toy, increased their liking 
for the toy, or did not change their liking for 
the toy. In the Mild Threat condition, for 
8 of the 12 subjects whose preferences 
changed at all, the crucial toy underwent a de- 
crease in attractiveness. In the Severe Threat 
condition, however, none of the 14 subjects 
who changed their evaluations of the toy 
showed a decrease. To test the significance 
of this difference, we computed a difference 
score for each subject. His second ranking of 
the crucial toy in the Severe Threat condi- 
tion was subtracted from his second ranking 
of the crucial toy in the Mild Threat condi- 
tion. "These difference scores yielded highly 
significant results (f < .003 by randomiza- 
tion test). 

Although the difference between the two 
conditions is striking, a good deal of this dif- 


f Since the same subject participated in both ex- 
perimental conditions, it should be pointed out that 
there were no Systematic effects due to order of 
testing. 
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TABLE 1 
CHANGE IN ATTRACTIVENESS OF FORBIDDEN Toy 


Rating 
Strength of 
threat 
Increase Same Decrease 
Mild 4 10 8 
Severe 14 8 0 


ference is due to the fact that, in the Severe 
Threat condition, the subjects actually came 
to increase their liking for the crucial toy. 
This increase in attractiveness tends to intro- 
duce some ambiguity in the interpretation of 
the results. It must be determined whether 
this increase was a function of the severe 


threat or of some other aspect of the experi- 


mental situation. A parsimonious and highly 
plausible explanation for this result is that 
the actual baseline in the experiment was not 
zero change. That is, there may be something 
in the specific procedure which caused a con- 
stant increase in the attractiveness of the for- 
bidden toy, irrespective of the degree of threat 
imposed. For example, it might be that by 
calling attention to that toy, the experimenter 
had enhanced its value either by mere em- 
phasis or by creating the impression that he, 
personally, was interested in that toy. More- 
over, by playing with the remaining four toys, 
the subject may have become satiated with 
them, thus increasing the relative attractive- 
ness of the crucial toy. It should be empha- 
sized that, if either of these processes 0C- 
curred, their effect would have been identical 
in both experimental conditions; thus, if such 
a process took place, the tendency for the sub- 
jects to derogate the toy in the Mild Threat 
condition occurred in spite of a general tend- 
ency for the toy to become more attractive. 
To determine whether either of these proc- 
esses did, in fact, occur, it was necessary to 
run a condition in which no threat was ad- 
ministered but in which the experimenter did 
something to call the subject’s attention to 
one of the toys, and in which the subject was 
prevented from playing with that toy. This 
was accomplished in the following manner. 
Several weeks after the subjects had com- 
pleted both experimental conditions, the ex- 
perimenter randomly selected one half of the 
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subjects and ran each through an experi- 
mental procedure which was identical except 
for one change: instead of forbidding the sub- 
ject to play with the second-ranked toy, the 
experimenter merely picked it up and took it 
with him when he left the room. It can be 
seen that this control condition establishes a 
proper baseline for the experiment. Tf, in this 
condition (No Threat), the subjects did not 
change their rankings of the crucial toy in a 
systematic direction, then it would be clear 
that the proper baseline was zero change. This 
would mean that the positive shift which oc- 
curred in the Severe Threat condition was a 
function of the severe threat. If, on the other 
hand, the subjects in the No Threat condi- 
tion systematically increased their liking for 
the crucial toy, this would indicate that the 
apparent increase in the Severe Threat condi- 
tion was not due to the severity of the threat. 
Rather, it would suggest that the increase was 
attributable to emphasis, satiation with the 
other toys, or some similar process which was 
common to both the Severe Threat and the 
Mild Threat conditions. The results support 
the second interpretation. Of the 11 subjects 
in the No Threat condition, 7 increased their 
evaluations of the toy, 4 did not change, and 
none decreased. The results are virtually 
identical with those in the Severe Threat con- 
dition and significantly different from those 
in the Mild Threat condition. This indicates 
that in the Mild Threat condition, the trend 
toward devaluation of the crucial toy oc- 
curred in spite of the general tendency to 
overrate the toy. 

The results are consistent with the theory 
of cognitive dissonance. In the Severe Threat 
condition, an individual's cognition that he 
did not play with an attractive toy was con- 
sonant with his cognition tbat he would have 
been severely punished if he had played with 
the toy. There was no need for him to pro- 
vide further justification for his abstinence. 
However, when he refrained from playing with 
the toy in the absence of a severe threat, he 
experienced dissonance. His cognition that he 
did not play with the toy was dissonant with 
his cognition that it was attractive. In order 
to reduce dissonance, he derogated the toy. 

We have implied that through the medium 
of dissonance reduction, a lasting change in 
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values could be evoked. Thus far we have 
demonstrated that devaluation does occur 
where dissonance is aroused and does not 
occur in the absence of dissonance. But just 
one mild threat, although arousing dissonance 
and leading to devaluation, is a tiny fraction 
of a child’s life. Accordingly, it would seem 
ambitious indeed to expect this event to have 
produced a lasting distaste for the crucial toy. 
Tn order to achieve a more or less permanent 
devaluation, consistently mild threats over an 
extended period of time would appear to be 
essential. Nevertheless, it would be of interest 
to examine the duration of this effect. Data 
are available to test for duration differences 
between the experimental conditions. The 
reader will recall that the same subjects par- 
ticipated in both experimental conditions. One 
half of the subjects were run first in the Severe 
Threat condition, and about 45 days later, 
were run in the Mild Threat condition; one 
half of the subjects were run in the reverse 
sequence. Thus, it is possible to assess the 
relative duration of effects due to mild and 
severe threats by comparing the rankings 
made by the subjects who were first run in 
the Mild Threat condition with those of the 
subjects who were first run in the Severe 
Threat condition. The child's premanipula- 
tion ranking of the toys during the second 
experimental session provides the necessary 
data. Thus, it is possible to see where the for- 


-bidden toy, originally ranked second, was 


ranked 45 days after the experimental ma- 
nipulation. A rank lower than second in the 
dissonance condition (Mild Threat) indicates 
a long-lasing derogation of the toy. The re- 
sults are summarized in Table 2. 

In the Mild Threat condition, of the six 
subjects who changed at all, five showed a 
decrease after 45 days. In the Severe Threat 


TABLE 2 
CHANGE IN ATTRACTIVENESS OF FORBIDDEN Tov 


Rating 
Strength of 
threat 
Increase Same . Decrease 
Mild 1 5 5 
Severe 3 6 2 


Note.—Forty-five days later. 
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condition, two of the five subjects who changed 
showed a decrease. These results represent a 
trend in the expected direction, although with 
an z of only 11, they do not reach an ac- 
ceptable level of significance. 

It is interesting to note that the long-term 
effect of a mild threat was very similar to the 
immediate effect. When the threat was severe, 
however, this did not appear to be the case. 
In the Severe Threat condition, although 64% 
of the subjects initially increased their rank- 
ing of the forbidden toy, only 27% ranked 
their toy higher after 45 days, suggesting only 
an ephemeral effect. 

Hopefully, the effect demonstrated in this 
experiment may be generalizable beyond mere 
toy preferences; that is, mild rather than 
severe threats of punishment may be an ef- 
fective means of inducing the formation of a 
system of values in children. For example, the 
results of this experiment suggest that if a 
parent were to administer a mild threat of 
punishment for aggressive behavior it might 
induce children to derogate aggressiveness, 
Thus, a mild threat might be more effective 
than a severe threat in the ultimate reduction 
of aggressive behavior. This speculation is 
consistent with data from research in child 
development which indicates a positive corre- 
lation between aggressiveness in children and 
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the severity of parental punishment for 
gression (for example, Sears, Maccoby, 
Levin, 1957; Sears, Whiting, Nowlis, & Sears; 
1953). = 
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724 Ss of ages 6-22 were told that they were to be tested for imaginative 
ability, and were then given 8 standardized test suggestions as follows: Arm 
Lowering, Arm Levitation, Hand Lock, Thirst “Hallucination,” Verbal In- 
hibition, Body Immobility, "Posthypnotic-Like" Response, and Selective 
Amnesia. The sexes did not differ in response to the suggestions. Ss between 
6 and 12 were more “suggestible” than adults. Children of 8-10 showed the 
highest level of response. No differences in “suggestibility” were found among 
Ss of ages 14-22. Suggestibility in both children and adults was inversely 


related to “conscious” (verbalizable) resistance to the test suggestions. 


A consensus exists that children are more 

* responsive than adults to the type of sug- 
gestions that have been historically associ- 
ated with the word “hypnosis.” This con- 
sensus is based on a number of older studies, 
summarized by Weitzenhoffer (1953, 1959) 
and Tromator (1961), which used varying 
criteria of “suggestibility” or “hypnotiza- 
bility” and which were conducted under non- 
standardized conditions. To our knowledge, 
children and adults were assessed on re- 
sponses to a series of suggestions of the type 
associated with the word hypnosis in only 
two experimental studies performed under 
carefully standardized conditions. In one of 
these studies, presented by Stukát (1958), 
three standardized tests of "primary" or 
“hypnotic-like” suggestibility—body sway, 
arm lowering, and Chevreul pendulum—were 
administered to a large group of subjects of 


1This research was supported by a grant 
(MY4825) from the National Institute of Mental 
Health, United States Public Health Service. 

The authors wish to express their appreciation to 
the following persons of Massachusetts schools for 
their cooperation in the conduct of the study: John 
H. Gotschall, Superintendent of Auburn schools; 
Alfred L. Jones, Principal of the Julia Bancroft 
Elementary School, and J. Arden Woodall, Prin- 
cipal of Auburn Junior High School; Harry Y. 
Hilyard, Superintendent of Grafton schools; Waldren 
P. Lojko, Principal, and Wallace Johnson, Vice 
Principal, of Grafton Junior-Senior High School; 
Doris Heusel, Supervisor of Student Nurses at Med- 
field State Hospital; John Elberfeld, Dean, and 
James Eteson, President, of Worcester Junior 
College; and the teachers at the Auburn schools, 
the Grafton schools, and at Worcester Junior 
College. 


different age levels. This worker, however, 
presented his results separately for the chil- 
dren and the adults and failed to make a 
direct comparison between the two groups. 
In the second study, presented by London 
(1962a, 1962b), 57 subjects of ages 6-16 
were assessed on the Stanford Hypnotic Sus- 
ceptibility Scale as revised and adapted for 
children. The 6-16 age group was found to 
be more responsive to the test suggestions 
than 124 college students. Three considera- 
tions, however, suggest caution in drawing 
conclusions from the London study: only a 
small number (4-6 subjects) were tested at 
each age level from 6 to 16; the younger 
subjects were tested by investigators at 
Illinois while the college students were tested 
by other investigators at Stanford; the 
younger subjects were “volunteered” by their 
parents to be “hypnotized” and this selection 
process may have rendered the sample un- 
representative of American children. 

To formulate a general theory of “hypnotic 
behavior,” reliable data are required con- 
cerning the development of “hypnotic-like” 
suggestibility. It is extremely difficult, how- 
ever, to obtain such data if children and 
their parents and teachers are told that 
hypnosis is to be employed and if a “trance 
induction procedure” is actually used. Few 
parents and teachers are willing to permit 
their children or their pupils to be hypno- 
tized and if some subjects are obtained by 
this method it appears unlikely that they 
will constitute a representative sample. It is 
possible, however, to approach the problem 
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by assessing response to suggestions without 
formally defining the situation as hypnosis 
and without administering a formal trance 
induction procedure. Employing this method, 
we were able to secure a large group of 
middle-class American children and adults as 
experimental subjects. The investigation was 
conducted as follows. 


METHOD 
Subjects 


The subjects were 724 students (388 males and 
336 females) of ages 6-22. Of these 484 were 
elementary school students (ages 6-13); 119 were 
high school students (ages 14-17); and 121 were 
college students and nursing students (ages 18-22). 
The number of subjects in each subgroup, classified 
in terms of year in school, approximate age, and 
sex, is presented in Table 2 (Columns 1-3). 

All subjects were of the Caucasian race. The 
elementary and high school students resided in 
middle-class suburbs composed primarily of profes- 
sional people, white-collar workers, and skilled tech- 
nicians; with a few possible exceptions, the college 
students and nursing students also came from 
middle-class backgrounds. 

None of the subjects were volunteers. The ele- 
mentary school students participated routinely after 
arrangements for the experiment were made with 
the school principal and with the teachers of each 
class. The high school students were selected at 
random from study hall classes after arrangements 
were made with the high school principal and with 
the study hall teachers, The college students par- 
ticipated in the experiment to fulfill the require- 
ments of a course in elementary psychology. The 
nursing students were secured by arrangements made 
with the nursing supervisor. 


Assessment of Response to Suggestions 


All subjects, and all teachers and school adminis- 
trators, were told that a test of imagination was to 
be administered. All subjects were tested in indi- 

. vidual sessions by one experimenter (DSC). Each 
experimental session subsumed a period of about 
10-12 minutes, The experimental sessions were held 
in One room at each school provided by the school 
administrator, During regular school hours, the sub- 
jects Were sent to the experimental room one at a 
time by the teachers, 

Children of ages 6-7 
preliminary instructions: 

We are going to play a game of imagination 
or make-believe. I want you to close BUE eyes 
ad to keep them closed and to think about what 

say. 


The other subjects (ages 8-22) were instructed 
as follows: 


I am going to test your abilit: imagi 
1 0 s y to imagine. 
Since you can imagine better With your eyes 


were given the following 


» 
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closed, I want you to close your eyes and to kee 
them closed and to try to imagine the thi 
I say. 


subject was tested individually on response to the 
Barber Suggestibility Scale. This scale includes 
eight standardized test suggestions, administered as 
follows: ' 


1. Arm Lowering 
Hold your right arm straight out in front of 
you like this. [The experimenter guides the subject 3 
to extend the right arm directly in front of the 
body at shoulder height and parallel to the floor.] - 
Concentrate on your arm and listen to me. a 
[Begin timing] Imagine that your right arm is 
feeling heavier and heavier, and that it’s moving 
down and down. It’s becoming heavier and heavier 
and moving down and down. It weighs a ton! 
It’s getting heavier and heavier. It's moving down 
and down, more and more, coming down and 
down, more and more; it’s heavier and heavier, © 
coming down and down, more and more, more — 
and more. [End 30 seconds] 
You can relax your arm now. 4 
Scoring criterion: 1 point for response of 4 
inches or more. 
2. Arm Levitation 
Keep your eyes closed and put your left arm 
straight out in front of you in the same. way. 
Concentrate on your arm and listen to me, 5 
[Begin timing] Imagine that the arm is be- 
coming lighter and lighter, that it’s moving up 
and up. It feels as if it doesn’t have any weight 
at all, and its moving up and up, more and 
more. It’s as light as a feather, it’s weightless 
and rising in the air. It’s lighter and lighter, 
rising and lifting more and more. It’s lighter 
and lighter and moving up and up. It doesn't — 
have any weight at all and it's moving up and ~ 
up, more and more. It's lighter and lighter, 
moving up and up, more and more, higher and 
higher. [End 30 seconds] 
You can relax your arm now. 
Scoring criterion: 1 point for response of 4 
inches or more. 
3. Hand Lock : 
Keep your eyes closed. Clasp your hands to- - 
gether tightly. [Experimenter explains interlacing 
the fingers of both hands with palms together.] 
Put them in your lap. Concentrate on your hands E 
and hold them together as tightly as you can. 
[Begin timing] Imagine that your hands are 
two pieces of steel that are welded together so - 
that it’s impossible to get them apart, They’re 
stuck, they’re welded, they’re clamped. When I E. 


? The Barber Suggestibility Scale, which was h 
originally constructed by adapting items and adding | 
new items to two scales previously used by Barber. 
(19562, 1956b, 1956c, 1957a, 1960) has also been ; 
used in other recent studies (Barber & Calverley, | 
1962, 1963, in press; Barber & Glass, 1962; Barber | 
& Hahn, 1962, 1963; Glass & Barber, 1961). 1 
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ask you to pull your hands apart they'll be 
stuck and they won't come apart no matter how 
hard you try. They're stuck together; they're 
two pieces of steel welded together. Vou feel as 
if your fingers were clamped in a vise. Your 
hands are hard, solid, rigid! The harder you 
try to pull them apart the more they will stick 
together! It's impossible to pull your hands 
apart! The more you try the more difficult it 
wil become. Try, you can't. [End 45 seconds] 

[5-second pause] Try harder, you can’t. [10- 
second pause] You can unclasp your hands now. 

Scoring criteria: 14 point for incomplete sepa- 
ration of hands after 5-second effort; 1 point 
for incomplete separation after 15-second effort. 

4. Thirst “Hallucination” 

Keep your eyes closed. 

[Begin timing] Imagine that you’ve just fin- 
ished a long, long walk in the hot sun. You've 
been in the hot sun for hours, and for all that 
time you haven't had a drink of water. You've 
never been so thirsty in your life. Vou feel 
thirstier and thirstier. Your mouth is parched, 
your lips are dry, your throat is dry. Vou have 
to keep swallowing and swallowing. You need 
to moisten your lips. [3-second pause] You feel 
thirstier and thirstier, drier and drier. Thirstier 
and thirstier, dry and thirsty. You're very, very 
thirsty! Dry and thirsty! Dry and thirsty! [End 
48 seconds] Now, imagine drinking a cool, re- 
freshing glass of water. [5-second pause] 

Scoring criteria: Y4 point if subject shows 
swallowing, moistening of lips, or marked mouth 
movements; additional 14 point if the subject 
indicates during the “post-experimental” question- 
ing that he became thirsty during this test. 

5. Verbal Inhibition 

Keep your eyes closed. 

[Begin timing] Imagine that the muscles in 
your throat and jaw are solid and rigid, as if 
they're made of steel. They're so solid and rigid, 
that you can't speak, Every muscle in your throat 
and mouth is so tight and so rigid that you can't 
say your name. The harder you try to say your 
name the harder it becomes! Vou can't talk! Your 
larynx has tightened up; your throat and jaw 
feel as if they are in a vise. Your throat is 
clamped so tightly that you can't talk; you can't 
say your name. The harder you try the harder 
it will be. It’s useless, the words won't come 
out; you can't speak your name; it's impossible 
to talk! The harder you try to say your name 
the harder it will become. Try, you can't! [End 
45 seconds] 

[3-second pause] Try harder; you can’t. 
[10-second pause] You can say your name now. 

Scoring criteria: 34 point if the subject does 
not say his name after 5-second effort; 1 point 
if he does not say his name after 15-second 
effort. 

6. Body Immobility 

Keep your eyes closed. 

[Begin timing] Imagine that for years and 
years you’ve been sitting in that chair just as 


you are now. Imagine that you’ve been sitting 
in that chair so long that you’re stuck to it! 
It’s as if you're part of the chair. Your whole 
body is heavy, rigid, solid and you weigh a ton. 
You're so heavy that you can’t budge yourself. 
It’s impossible for you to stand up, you're stuck 
right there! Vour body has become part of the 
chair. When I ask you to stand up you won't 
be able to do it! Vou're stuck tight. The harder 
you try the tighter you'll be stuck and you won't 
be able to get up. Youtre heavy in the chair! 
Stuck in the chair; you can't stand up. You're 
so heavy and stuck so tight. You can't stand up; 
you're stuck. Try, you can’t! [End 45 seconds] 

[5-second pause] Try harder; you can’t. [10- 
second pause] You can relax [or sit down] now. 

Scoring criteria: V4 point if the subject is not 
standing fully erect after 5-second effort; 1 point 
if not standing fully erect after 15-second effort. 

7. “Posthypnotic-Like” Response 

[Begin timing] When this experiment is over in 
a few minutes and your eyes are open, I'll click 
like this [experimenter taps once on the metal 
back of a stopwatch with a fountain pen] and 
you'll cough automatically. At the moment I 
click [experimenter presents auditory stimulus] 
you'll cough. It will happen automatically. When 
I click like this [experimenter presents auditory 
stimulus] you'll cough immediately; I'll click and 
youll cough. When your eyes are open I'll click 
[auditory stimulus is presented] and you'll cough. 
When I click you'll cough. [End 30 seconds] 

Scoring criterion: 1 point if subject coughs or 
clears his throat “post-experimentally” when pre- 
sented with the auditory stimulus. 

8. Selective Amnesia 

Your eyes are still closed but I’m going to ask 
you to open them in a minute. When they're open 
Im going to ask you to tell me about these 
tests. 

[Begin timing] You'll remember all the tests 
and be able to tell me about them, all except 
for one. There's one that you'll completely forget 
about as if it never happened! That's the one 
where I said your arm was becoming lighter 
and moving up and up. You'll forget all about 
that and when you try to think about it, it will 
slip even further away from your mind. You 
will forget completely that I told you that your 
arm was becoming lighter. This is the one test 
that you cannot remember! You will remember 
that I said your arm was heavy and all the other 
tests will be perfectly clear but the harder you 
try to remember that I told you your arm was 
rising the more difficult it will become. You will 
not remember until I give you permission by 
saying, “Now you can remember,” and then, and 
only then, you will remember that I said your 
arm was rising. [End 45 seconds] 

Scoring criterion: 1 point if the subject does 
not refer to the amnesic task (Test Suggestion 2) 
but recalls at least four other test suggestions 
and then verbalizes Test Suggestion 2 (Arm Levi- 
tation) in response to the cue words. 
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“Post-experimental” scoring of Test Suggestions 
4, 7, and 8. The subject is next told: “Open your 
eyes, the experiment is over.” After the subject 
has opened his eyes and before conversation com- 
mences, the experimenter scores Test Suggestion 7 
(“Posthypnotic-Like” Response) by presenting the 
auditory stimulus. 

Test Suggestion 8 (Selective Amnesia) is then 
scored as follows: The experimenter asks: How 
many of the tests can you remember?” The experi- 
menter prompts the subject by asking, “Were there 
any others?” “Can you think of any more?” and 
“Is that all?” until the subject mentions at least 
four of the test suggestions, If the subject verbal- 
izes Test Suggestion 2 (Arm Levitation) during his 
recital, he receives a score of O on Test Suggestion 8 
(Selective Amnesia). If the subject does not include 
Test Suggestion 2 in his enumeration, the experi- 
menter states, “Now you can remember,” and, if 
the subject still does not verbalize Test Suggestion 2, 
“You can remember perfectly well now!” The sub- 
ject receives a score of 1 point on Test Suggestion 8 
(Selective Amnesia) if he mentions at least four 
of the test suggestions, but does not mention Test 
Suggestion 2 before he is given the cue words, and 
verbalizes Test Suggestion 2 when given the cue 
words, “Now you can remember,” or, “You can 
remember perfectly well now!” 

The final scoring of Test Suggestion 4 (Thirst 
“Hallucination”) occurs when the subject refers 
to this test suggestion during his recital. At this 
point the experimenter asks: “Did you become 
thirsty during this test?” If the subject answers 
Yes to this question he receives the additional x 
point on Test Suggestion 4. If the subject answers 
Yes but adds a qualifying statement, eg, he had 
been thirsty to begin with, he is asked: “Did the 
imaginary glass of water help quench your thirst?” 
If the subject now answers Yes he receives the 
additional 74 point. 

The maximum score obtainable on the test 
suggestions was 8 points. 


Subjective Reports 


After the test suggestions were scored, all subjects 
were asked two questions orally: 


1. Subjective Involvement 

Which of these tests [test suggestions passed] 
did you actually feel? [The experimenter de- 
scribes the test suggestions that the subject had 
passed with either 14 or 1 point and asks, e.g., 
with respect to Test Suggestion 3, “Did you actu- 


8 The Barber Suggestibility Scale was administered 
as described above to all subjects with the exception 
that, for the elementary school students, the follow- 
ing substitutions were made in Test Suggestion 3 
(Hand Lock) and Test Suggestion 5 (Verbal 
Inhibition); The word stuck was substituted for the 
words welded and clamped; the word mouth was 
substituted for the word larynx; and the phrase 
stuck together was substituted for the phrase in a 
vise, 
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ally feel that you couldn’t take your hands apart 
or did you keep your hands together in order 
to follow instructions or to please me?”] 

Scoring criterion: 1 point for each test sug- 
gestion passed which the subject claimed that he 
had “actually felt.” 

2. Verbalized Resistance 

Which of these eight tests did you try to 
resist? [The experimenter describes each of the 
test suggestions and asks, e.g., with respect to 
Test Suggestion 1, “Did you try to keep your 
arm from coming down ?”] 

Scoring criterion: 1 point for each of the 
eight test suggestions which the subject said that 
he had attempted to resist. 


After these questions were answered, the subject 
was admonished not to discuss the experiment with 
the other subjects and was then dismissed. 


Retest 


Twelve subjects of age 7, 22 subjects of age 10, 
and 29 subjects of ages 18-22 were retested indi- 
vidually on the Barber Suggestibility Scale after 
an interval of 6 weeks. 


Analyses of Data 


To -attain a minimum of 50 subjects in each 
subgroup, the scores of subjects of the following 
ages were combined: 6-7, 14-15, 16-17, and 18-22. 
Separate two-way analyses of variance for unequal 
subclasses were performed on the test suggestions 
and on the subjective reports by the approximation 
method presented by Walker and Lev (1953, pp. 381- 
382). Duncan’s (1957) multiple range test for unequal 
subclasses was used to compare the group means.‘ 
Product-moment coefficients of correlation were 
computed between scores on the test suggestions 
(Objective scores) and Verbalized Resistance scores, 
between Subjective Involvement scores and Verbal- 
ized Resistance scores, and between test-retest scores. 


RESULTS AND DISCUSSION 
Sex Differences 


The analyses of variance, summarized in 
Table 1, indicate that the sexes did not differ 
significantly in response to the test sugges- 
tions and in subjective reports. Since no sex 
differences were found, the scores for males 
and females at each age level were combined 
giving the resultant means in Table 2. 


Age Differences 


_ The analyses of variance (Table 1) show 
highly signifücant differences between age 


* The error mean squares used in the Duncan 
tests were computed from one-way analyses of 
variance performed on the test suggestions and the 
subjective reports. 
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TABLE 1 
SUMMARY OF ANALYSES OF VARIANCE FOR TEST SUGGESTIONS AND SUBJECTIVE REPORTS 


Test Suggestions 


Subjective Involvement Verbalized Resistance* 


Source df 
MS F MS F df MS F 
Sex (A) 1 325 1.72 264 1.86 1 .634 2.73 
Age Groups (B) 9 | 156 10.76*** 1.52 10.70*** 8 | 1.607 6.93*** 
AXB 9 9 1.31 .39 Dipset 8 467 2.01* 
Error 704 145 442 651 .232 
Total 723 668 


Note.—The Walker-Lev approximation method for unequal subclasses was used in 
Verbalized Resistance analysis, since in many 


a The 6- and 7-year-olds are not included in the 


these analyses. 
instances subjects at these age 


levels gave DU which indicated thet they did not understand this question. 


"5 < 01. 
HED < 001, 


groups on the test suggestions and on the 
e subjective reports. 

Objective scores. Duncan range test, ap- 
plied to the mean scores on the total eight 
test suggestions (Table 2, Column 4), showed 
the following: subjects between 6 and 12 
years of age showed significantly higher 
scores than subjects of 14 years and older; 
subjects of ages 8-10 obtained the highest 
scores; and the scores of subjects of 14 
years and above did not differ significantly 
from each other. These relationships can be 
observed visually by inspection of Objective 
scores in Figure 1 which show increasing 
responsiveness from ages 6-8, maximal re- 
sponse levels at ages 8-10, gradually de- 
creasing responsiveness from ages 10-14, and 


a stable plateau of response from ages 14 
and above. 

Table 3 presents the mean scores on each 
of the eight test suggestions. As can be 
determined by inspection of this table, the 
mean scores on Test Suggestions 1, 2, 4, 7, 
and 8 followed the overall curvilinear pattern, 
that is, the means tended to increase from 
ages 6-8, were high at around ages 9 and 10, 
and then tended to decrease gradually from 
ages 10-14. Test Suggestions 3, 5, and 6 did 
not follow this pattern in that subjects of 
ages 6-7 scored as high as subjects of 
ages 8-10. 

'These results are in line with a series of 
previous reports, summarized by Weitzen- 
hoffer (1953, 1959) and Tromator (1961), 


TABLE 2 


Mean Scores, STANDARD DEVIATIONS, AND RESULTS OF DUNCAN TESTS APPLIED TO 
TEST SUGGESTIONS AND SUBJECTIVE REPORTS 
5 1 i bjecti ; t 
i VON Number of subjects Eight test suggestions Tovohsment Verbalized Resistance 
ge school ) 
a) res) 
Male | Female | Total M SD M SD M SD 
6-7 1-2 35 20 55 4.545. 1.76 3.18. 1.78 — — 
8 3 43 36 79 5.36ab 1.92 4,20, 2.01 1,48, 2.48 
9 4 23 31 54 5.91, 1.70 5.35. 1.89 2.41y 2.89 
10 5 23 28 51 5.76. 1.94 5.16. 2.04 2.57» 3.13 
11 6 37 46 83 4.7 Toc 2.30 4,36, 2.34 3.18 ab 2.81 
12 7 AT 40 87 4.61. 2.23 4.25, 2.28 3.37 a» 2.58 
13 8 44 31 75 424 2.36 3.895. 2.28 3.84.5 2.52 
14-15 9-10 28 31 59 3.564. 2.70 3.41. 2.55 3.305 2.73 
16-17 11-12 29 31 60 3.55ae 2.38 3.23ca 2.60 2.98.5. 2.76 
18-22 13-15 79 42 121 3.53. 2.28 2.604 2.28 3.50, 2.46 
Total 388 336 724 4.50 2.35 3.86 2.38 2.87 2.72 


ote.—Means in the same column which do not contain a common letter in the subscript differ significantly from each other 


N 
at the .05 level by Duncan test. 


594 THEODORE XENOPHON BARBER AND Davin SMITH CALVERLEY 
TABLE 3 
Mean SCORES on TEST SUGGESTIONS 
Age 
Test suggestion 

6-7 8 9 10 11 12 13 14-15 | 16-17 | 18-22 
1. Arm Lowering «454 | .645 | .759 | .666 | .783 | .632 | .680 | .610 | .550 | .404 
2. Arm Levitation 254 | .392 | .518 | .509 | .469 | .448 | .426 | .338 | .416 | .338 
3. Hand Lock 909 | .848 | .861 | .872 | .704 | .810 | .673 | .602 | .583 | .674 
4. Thirst *Hallucination" -636 | .727 | .898 | .814 | .819 | .655 | .640 | .576 | .500 | .558 
5. Verbal Inhibition -781 | .759 | .824 | .755 | .548 | .586 | .500 | .440 | .458 | .495 
6. Body Immobility -818 | .772 | .814 | .794 | .560 | .563 | .453 | .406 | .375 | .504 
7. "Posthypnotic-Like" Response | .254 | 405 | .481 | .549 | .313 | .402 | .400 | .305 333 | .347 
8. Selective Amnesia 436 | .810 | .759 | .803 | .578 | .517 | .466 | .288 214 


+333 


which appeared to indicate that the sexes 
do not differ significantly in response to sug- 
gestions of the type traditionally associated 
with the word hypnosis and children are 
more responsive than adults to suggestions 
of this type. Two additional findings merit 
comment: 

1. Response to standardized test sugges- 
tions appears to be highest around ages 


8-10. Children at these age levels obtained 
mean scores above 5 points on the criterion 
suggestions. It should be noted, however, that 
adults, in general, also obtain mean scores ' 
above 5 points on the same test suggestions 
after they have received either explicit in- 
structions designed to produce positive moti- 
vation to perform maximally on the sug- 
gested tasks or a formal trance induction 
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procedure consisting of eye fixation on the 
blinking light of a metronome and repetitive 
suggestions of* relaxation, drowsiness, and 
sleep (Barber & Calverley, 1962, in press). 
It thus appears that the high level of 
“suggestibility” manifested under ordinary 
conditions by children of ages 8-10 can also 
be manifested by adults, provided that 
the adults have received either explicit 
motivating instructions or a formal trance 
induction. 

2. The “adult” level of response to the 
standardized test suggestions is apparently 
attained at approximately 14-15 years of 
age. Subjects of ages 14-15 did not differ in 
“suggestibility” from subjects of ages 18-22. 
The mean score on the standardized sug- 
gestions of the 14-15 age group was also 
comparable to the mean scores attained on 
the same test suggestions by college students 
and graduate students participating in a 
previous investigation (Barber & Glass, 
1962). 

Subjective Involvement. Duncan test ap- 
plied to the mean scores on Question 1 indi- 
cated that subjects of ages 8-12 showed 
higher Subjective Involvement scores (stating 
that they “actually felt? more of the sug- 
gested effects) than subjects of ages 14—22, 
and subjects of ages 9 and 10 showed the 
highest scores of all (see Table 2, Column 
5). As can be determined by inspection of 
Figure 1, subjects of ages 9-17 showed a 
close correspondence between their Objective 
scores and their Subjective Involvement scores 
but subjects at the youngest age levels (ages 
6-8) and at the oldest age levels (ages 18-22) 
showed a discrepancy between objectively 
scored responses and Subjective Involvement 
scores; the youngest and oldest subjects on 
the average passed one test suggestion which 
they said that they did not "actually feel" 
but “went along with to follow instructions 
or to please the experimenter.” The findings 
with respect to the youngest age group are 
in line with data obtained by London 
(1962b) indicating that children of ages 6-8 
tend to manifest less subjective involvement 
in the suggested effects than subjects of 
. ages 9-16. The findings with respect to the 
' oldest group are in line with recent studies 
(Barber & Calverley, 1962, in press) which 
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found that college students of ages 18-22, 
who were given the same eight test sugges- 
tions in the same way as in the present 
experiment, showed a discrepancy of about 
1 point between objectively scored responses 
and Subjective Involvement scores. 

Table 1 shows a significant Sex X Age 
Groups interaction on Subjective Involve- 
ment. A clear pattern was not discerned in 
this interaction: At age 10 the boys stated 
that they “felt” more of the suggested effects 
than the girls; at ages 12, 14-15, and 18-22, 
the girls stated that they “felt” more of 
the suggested effects than the boys. 

Verbalized Resistance. Duncan range test 
applied to the mean scores on Question 2 
indicated that the youngest subjects (ages 
8-10) stated that they “tried to resist” a 
significantly smaller number of the test sug- 
gestions than the oldest subjects (ages 18-22) 
(see Table 2, Column 6). Table 4 indicates 
that: all age groups showed highly signifi- 
cant negative correlations ranging from —.36 
to —.83 and averaging —.54 between Objec- 
tive scores and Verbalized Resistance scores; 
and all age groups showed highly significant 
negative correlations ranging from —.31 to 
—.66 and averaging —.48 between Subjective 
Involvement scores and Verbalized Resistance 
scores. Fisher z transformations showed sig- 
nificant differences (p< .02) between age 
groups on both sets of correlations. 

There was a significant Sex X Age Groups 
interaction on Verbalized Resistance (see 
Table 1). This interaction did not appear 
amenable to a clear interpretation: At ages 
16-17, the girls stated that they “tried to 


TABLE 4 
PEARSON CORRELATION COEFFICIENTS 


Objective scores- | Subjective Involve- 
Age N Verbalized ment-Verbalized 
Resistance Resistance 

8 79 —.39 —.33 
9 54 —.44 —.60 
10 51 —.68 —.66 
11 83 —.68 —.63 
12 87 —.42 —AT 
13 75 —.36 —.34 
14-15 59 —.83 —.59 
16-17 60 —.72 —.52 
18-22 121 —.37 = 31 
Average | 669 —.54 — 48 


Note.—All correlations are significant beyond the .01 level. 
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resist” more of the test suggestions than 
the boys; at ages 12, 13, 14-15, and 18-22, 
the boys stated that they “tried to resist” 
more than the girls. 

The major findings with respect to sub- 
jective reports can be summarized as follows: 

1. In general, a close correspondence was 
found between objectively scored responses 
to the test suggestions and Subjective In- 
volvement scores; however, subjects in the 
youngest age group (ages 6-8) and in the 
oldest age group (ages 18-22) showed a 
discrepancy between their Objective scores 
and Subjective Involvement scores, stating 
that they “went along with” but did not 
“actually feel” some of the test suggestions 
which they passed. 

2. To the extent that reliance can be 
placed on subjective reports, it appears that 
some of the variance in “suggestibility” 
can be accounted for by differences in the 
degree to which the various age groups tried 
to resist the suggested effects; the less the 
subjects tried to resist, the higher their 
Objective scores on the test suggestions and 
the higher their subjective involvement in 
the suggested effects. 


Retest Findings 


The three groups retested on the Barber 
Suggestibility Scale after a 6-week interval 
(12 subjects of age 7, 22 subjects of age 10, 
and 29 subjects of ages 18-22) showed 
significant correlations of 62, .66, and .82, 
respectively, between test-retest scores. Al- 
though there appeared to be a trend toward 
greater reliability in response on the part of 
the older subjects, Fisher z transformation 
indicated that the three correlation coef- 
ficients did not differ significantly (x? = 2.03, 
df=2, p> .30). The three groups main- 
tained their relative ranks on the retest: 
subjects of age 10 showed the highest mean 
score (5.18); subjects of age 7 showed the 
next highest mean score (4.58) and the older 
Subjects of ages 18-22 showed the smallest 
mean score (3.26). 


Limitations of the Investigation and Sugges- 
tions for Further Research 


The subjects in this investigation were 
white Americans of middle-class background. 


» 
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Whether the findings apply to other cultures 
and to other subgroups in the American 
culture can be determined ónly by further 
research. 

"The experimenter defined the procedure to 
the subjects (and to the teachers and school 
administrators) as a "test of imagination." 
To what extent the children and adult sub- 
jects defined the procedure to themselves as 
hypnosis and to what extent their private 
definition of the situation influenced their 
responses is not known. Further studies are 
needed to clarify this point. 

A formal trance induction procedure was 
not used in this investigation. Further, the 
experimenter did not attempt to form a 
friendly or close relationship with the sub- 
jects and did not attempt to motivate the 
subjects to perform maximally on the sug- 
gested tasks. These variables should be 
manipulated experimentally in further re- 
search. Three considerations are pertinent: 

1. Recentexperiments (Barber & Calverley, 
1962, in press; Barber & Glass, 1962; 
Weitzenhoffer & Sjoberg, 1961) demonstrate 
that a standardized trance induction pro- 
cedure which includes eye fixation and 
Tepetitive suggestions of relaxation, drowsi- 
ness, and sleep is effective in enhancing 
suggestibility in adults. Does such a trance 
induction procedure also produce heightened 
Tesponse to test suggestions in children? An 
affirmative answer to this question cannot be 
assumed; children may differ from adults in 
that they show no change in suggestibility 
after a formal trance induction, 

2. Recent experiments (Barber & Calverley, 
1962, in press) also demonstrate that adults 
show enhanced response to test suggestions 
when given explicit task motivating instruc- 
tions. It appears likely that suggestibility in 
children can also be enhanced by instructions 
designed to produce positive motivation to 
perform well on suggested tasks. However, it 
cannot be assumed that the same instruc- 
tions which are task motivating for adults 
are also task motivating for children. Fur- 
ther studies should determine what type of 
instructions are effective in motivating chil- 
dren to perform optimally on test suggestions. 

3. There is evidence to indicate that sug- 
gestibility in adults is affected by the nature 
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of the interpersonal relationship between the 
subject and the experimenter (Barber, 1957b, 
1958, 1961, 1962a, 1962b, 1962c, 1963, in 
press). It appears likely that children also 
show variations in suggestibility when tested 
by experimenters with whom they do and do 
not have a close relationship. Carefully con- 
trolled studies are needed to test this point. 


REFERENCES 


Barber, T. X. Comparison of suggestibility during 
“light sleep” and hypnosis, Science, 1956, 124, 
405. (a) 

Barser, T. X. A note on “hypnotizability” and 
personality traits. J. clin. exp. Hypn., 1956, 4, 
109-114. (b) 

Barper, T. X. “Sleep” and “hypnosis”: A reap- 
praisal, J. clin. exp. Hypn., 1956, 4, 141-159. (c) 

* Barer, T. X. Experiments in hypnosis. Scient. 
American, 1957, 196, 54-61. (a) 

Barger T. X. Hypnosis as perceptual-cognitive 
restructuring: III. From somnambulism to auto- 
hypnosis. J. Psychol., 1957, 44, 299-304. (b) 

Barser, T. X. The concept of “hypnosis.” J. 
Psychol, 1958, 45, 115-131. 

Barner, T. X. The necessary and sufficient condi- 
tions for hypnotic behavior. Amer. J. clin. Hypn., 
1960, 3, 31-42. 

Barser, T. X. Antisocial and criminal acts induced 
by "hypnosis": A review of clinical and experi- 
mental findings. Arch. gen. Psychiat, 1961, 5, 
301-312. 

BARBER T. X. Experimental controls and the 
phenomena of "hypnosis": A critique of hypnotic 
research methodology. J. nerv. ment. Dis. 1962, 
134, 493-505. (a) 

Barser, T. X. Toward a theory of hypnosis: Post- 
hypnotic behavior. Arch gen. Psychiat. 1962, 
7, 321-342. (b) 

Barser, T. X. Toward a theory of “hypnotic” be- 
havior: The “hypnotically induced dream.” J. 
nerv, ment. Dis., 1962, 135, 206-221. (c) 

Barser, T. X. A theory of “hypnotic” behavior. 
Medfield, Mass.: Medfield Foundation, 1963. 
(Mimeo) 

Barser, T. X. The effects of "hypnosis" on pain: 
A critical review of experimental and clinical 
findings, Psychosom. Med., in press. 

Bareer, T. X., & Catvertey, D. S. “Hypnotic 
behavior” as a function of task motivation. J. 
Psychol., 1962, 54, 363-389. 


597 


Barger, T. X, & Catvertey, D. S. Toward a 
theory of hypnotic behavior: Personality vari- 
ables, attitudinal-motivational variables, and hyp- 
notizability. Medfield, Mass.: Medfield Founda- 
tion, 1963. (Mimeo) 

BARBER, T. X., & Catvertey, D. S. The relative 
effectiveness of task motivating instructions and 
trance induction procedure in the production of 
“hypnotic-like behaviors. J. nerv. ment. Dis, 
in press. 

Barser, T. X., & Grass, L, B. Significant factors in 
hypnotic behavior. J. abnorm. soc. Psychol., 1962, 
64, 222-228. f 

Barer, T. X, & Haun, K. W., Jr. Physiological 


and subjective responses to pain producing 
stimulation under  hypnotically-suggested and 
waking-imagined “analgesia.” J. abnorm. soc. 


Psychol., 1962, 65, 411-418. 

Bareer, T. X., & Haun, K. W. Jr. “Hypnotic in- 
duction” and “relaxation”: An experimental study. 
Arch. gen. Psychiat., 1963, 8, 295-300. 

Duncan, D. B. Multiple range tests for correlated 
and heteroscedastic means. Biometrics, 1957, 13, 
164-176. 

Grass, L. B., & Barger, T. X. A note on hypnotic 
behavior, the definition of the situation and the 
placebo effect. J. nerv. ment. Dis, 1961, 132, 
539-541. 

Lowpow, P. The Children's Hypnotic Susceptibility 
Scale. Urbana: University of Illinois, Department 
of Psychology, 1962. (Mimeo) (a) 

Lonpon, P. Hypnosis in children: An experimental 
approach. Int. J. clin. exp. Hypn., 1962, 10, 79- 
91. (b) 

SruxÁr, K.-G. Suggestibility: A factorial and ex- 
perimental analysis. Stockholm:  Almqvist & 
Wiksell, 1958. 

TroMator, FLORENCE T. Some developmental cor- 
relates of ‘hypnotic susceptibility. Unpublished 
master's thesis, University of Illinois, 1961. 

Warxzg, Heren M., & Lev, J. Statistical inference. 
New York: Holt, 1953. 

Werrzennorrer, A, M. Hypnotism: Am objective 
study in suggestibility. New York: Wiley, 1953. 
WEITZENHOFFER, A. M. A bibliography of hypnotism 

in pediatrics. Amer. J. clin. Hypn, 1959, 2, 92-95. 

Werrennorrer, A. M., & Syoserc, B. M. JR. 
Suggestibility with and without “induction of 
hypnosis.” J. nerv. ment. Dis., 1961, 132, 204-220. 


(Early publication received November 23, 1962) 


Journal of Abnormal and Social Psychology 
1963, Vol. 66, No. 6, 598-600 


CRITIQUE AND NOTES 


PERSONALITY SIMILARITY AND FRIENDSHIP: 
A FOLLOW-UP STUDY: 


CARROLL E. IZARD 


Vanderbilt University 


This study repeated the procedures of an earlier study (Izard, 1960b) that 
showed similarity of objectively measured (Edwards Personality Preference 
Schedule) personality characteristics to be a significant correlate of friend- 
ship, Experiment I tested the repeatability of this finding with a similar 
sample (college freshmen) and the results again supported the similarity 
principle. Experiment II tested the generalizability of the earlier finding. The 
procedures were applied to a different population (college seniors), and the 
results failed to confirm the similarity hypothesis, Personality differences be- 
tween freshmen and seniors can be viewed as a function of increased social 
and emotional maturity on the part of seniors (Izard, 1962). Perhaps the 
more “mature” person has less need to see his personality characteristics 


reflected in his friends, 


A number of investigators have been con- 
cerned with the role of personality similarity in 
the formation of interpersonal affective ties— 
more particularly in the development of friend- 
ship. Several studies supported in varying degrees 
the hypothesis that similarity was a determinant 
of friendship or interpersonal attraction (Beier, 
Rossi, & Garfield, 1961; Izard, 1960b, 1960c; 
Morton, 1959; Reilly, Commins, & Stefic, 1960). 
However, the findings of these studies were far 
from consistent with respect to all variables 
considered, and in the second and fifth studies 
cited different results were obtained with the 
same measure of personality characteristics. 
This suggests that there may be situational, de- 
velopmental, or interpersonal variables in inter- 
personal attraction that are peculiar to the par- 
ticular populations sampled. 

The present study had two aims, The first was 
to repeat one of the earlier studies (Izard, 
1960b) to see if the shaky similarity hypothesis 
could be shored up by demonstrating repeat- 
ability of supporting data (Experiment I). The 
second was to repeat the study in a different 
Population, testing the generalizability of the 
earlier findings (Experiment II). 

The change in populations for the test of 

. generalizability consisted in shifting from high 
school seniors or beginning college freshmen 


1This study was supported by Vanderbilt-ONR 
Contract Nonr 2149(03), C. E. Izard, principal 
investigator, The opinions and conclusions do not 


necessarily reflect those of the United States Depart-- 


ment of the Navy. 


to second semester college seniors. Since it is 
known (Izard, 1961) that personality character- 
istics as measured in these studies change sig- 
nificantly during college years, it was felt that 
a sample of older subjects from the same gen- 
eral socioeconomic group might throw additional 


light on the problem of interpersonal attraction ' 


and friendship. However, the developmental 
factor cannot be viewed as the only one that 
might account for a change in similarity within 
dyads. Homogeneity with respect to race, sex, 
and social class was not a significant factor in 
dyad similarity among underclassmen from dif- 
ferent colleges (Izard, 1961c), but there is the 
possibility that increasing homogeneity over 
time within a given college group might result 
in increased personality similarity among all 
possible pairs, regardless of friendship. It is 
certainly reasonable to expect homogeneity of 
student groups to increase with time and with 
progress through the 4-year curriculum. The case 
is even stronger for professional schools, such 
as engineering where students are known to 
differ from the general population of arts and 
science students on several of the personality 
characteristics with which we are concerned 
(Izard, 1960a). For all college groups at the 
senior level attrition has certainly narrowed the 
original. freshman group on variables relating 
to educational status and academic achievement. 
There probably are numerous other environ- 
mental press operating to increase conformity 
and similarity in a broad sense. However, in- 
creased personality profile similarity will result 
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only with increased homogeneity of the variances 
of the personality characteristics being studied. 


PROCEDURE 


To obtain subjects for Experiment I, 323 pledges 
were asked to list their three closest personal friends. 
They were also asked to list the three people whom 
they felt were least likely to become their close 
personal friends. From these data it was possible 
to set up 19 pairs of subjects who had mutually 
chosen each other as friends and 17 pairs of sub- 
jects who had placed each other in the “least likely” 
category. After the 36 pairs of friends and rejects 
were eliminated from the pool of subjects, a control 
group of 30 pairs was drawn at random. As in the 
earlier study, personality was measured by Edwards’ 
Personal Preference Schedule (EPPS). The EPPS 
was administered at the time the students entered 
college and about 4 months prior to obtaining the 
friendship data. 

To obtain the data for Experiment II, the friend- 
ship rating form and the EPPS were administered 
during class periods to the graduating seniors in the 
school of engineering. From the test data it was 
possible to assemble 20 pairs of friends among the 
engineering seniors. As in the previous study, a 
control group was established by randomly pairing 
subjects who had not named each other on the 
friendship rating form. The analysis of variance for 
comparing profiles of pairs (Izard, 1960b) was 
applied to the EPPS data of Experiment I and 
Experiment II. 

RESULTS 

Table 1 presents the analysis of variance 
comparing the EPPS profiles of the pairs of 
friends and the pairs drawn at random from the 
freshman sample of Experiment I. The analysis 
indicated that the variance between pairs of 
friends (Variables X Pairs) was significantly 
greater than the variance within pairs of friends ? 


? The formula for sum of squares for Variables 
X Subjects (Izard, 1960b) can be applied to the 


TABLE 1 


ANALYSIS OF VARIANCE COMPARING EPPS PROFILES 
or FRIEND AND RANDOM PAIRS DRAWN FROM 
FRATERNITY PLEDGES (Freshmen) 


Source df MS F 

Between variables 14 | 326.71 
Variables X Groups 14 18.21 
Variables X Pairs 

(Friends) 252 2243 1.243* 
Variables X Pairs 

(Random) 406 23.07 0.906 
Variables X Subjects 

(Friends) 266 | 17.80 
Variables X Subjects 

(Random) 420 | 2545 


*p = .05. 
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TABLE 2 


ANALYSIS OF VARIANCE COMPARING EPPS PROFILES 
OF FRIEND AND RANDOM PAIRS DRAWN FROM 
SCHOOL OF ENGINEERING SENIORS 


Source df MS F 

Between variables 14 | 620.43 
Variables X Groups 14 14.56 
Variables X Pairs 

(Friends) 280 24.00 .988 
Variables X Pairs 

(Random) 406 25.40 1.098 
Variables X Subjects 

(Friends) 294 24.28 
Variables X Subjects 

(Random) 420 23.14 


(Variables X Subjects, p<.05). This confirms 
the earlier study and supports the similarity 
thesis. The comparable F ratio for the random 
pairs was less than 1. In addition, a separate 
analysis was run for the sociometric rejects: 


23.39 (Variables X Pairs) 
21.84 (Variables X Ss) 


Table 2 presents the analysis of variance com- 
paring the EPPS profiles of friend and random 
pairs drawn from the engineering seniors. This 
analysis indicated that the variance between 
pairs of friends was no greater than the variance 
within pairs of friends. This fails to support. the 
similarity principle. The F ratio comparing the 
between and within variances for random pairs 
was also insignificant. 


F= = 1.070, p >.05. 


Discussion 


This study repeated the procedures of an 
earlier study (Izard, 1960b) that showed 
similarity of objectively measured (EPPS) per- 
sonality characteristics to be a significant cor- 
relate of friendship. Experiment I was a test of 
the repeatability of the findings of the previous 
study. The procedures of the earlier study were 
applied to a similar sample and the findings again 
supported the similarity principle. However, it 
should be noted that the contrast between the 
F ratio for friend and nonfriend pairs is not 
great, and the former barely reached significance 
at the .05 level. The support for the similarity 
hypothesis is fairly weak. 

Experiment II was a test of the generalizabil- 
ity of the finding of the earlier study. In Experi- 
ment II the procedures were applied to a dif- 
ferent population, and the results failed to con- 


data of a single pair- to obtain an index of profile 
similarity that is numerically one-half the similarity 
score described by Cronbach and Gleser (1953) as 
D? and discussed by Cronbach (1958) as AD*. 
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firm the hypothesis that similarity is a significant 
determinant of friendship. 

Although it may be difficult to select among 
alternative explanations of the failure of Experi- 
ment II to support the similarity thesis, the 
shift in populations might well be expected to 
have an effect on the results. The populations 
yielding the positive and negative results differ 
substantially in one important respect. College 
seniors are known to differ considerably from 
freshmen in personality profiles as measured by 
the EPPS. This difference can be reasonably 
viewed (Izard, 1962) as a function of increased 
social and emotional maturity on the part of 
seniors. It might be argued that as a result of 
this, personality similarity becomes less im- 
portant as a determinant of interpersonal at- 
traction. Perhaps the more “mature” person has 
less need to see his personality characteristics 
reflected in his friends. 

The notion that the seniors might be more 
homogeneous with respect to EPPS personality 
variables appears untenable. The estimates of 
the population variance for pairs of seniors 
(Table 2, MS column) are somewhat larger than 
those for freshmen (Table 1, MS column). 

Regardless of the cause, the failure of Ex- 
periment II to repeat the finding of Experiment I 
and the earlier study restricts the generalizability 
of the conclusion that similarity of objectively 
measured personality characteristics is a signifi- 
cant correlate of friendship. 
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ELECTRODERMAL REACTIVITY AND RATED BEHAVIOR IN 
EMOTIONALLY DISTURBED CHILDREN? 


MALCOLM M. HELPER, SOL L. GARFIELD, ann ROBERT C. WILCOTT 2 
Nebraska Psychiatric Institute 


A group of 34 emotionally disturbed children was subdivided according to 2 
measures of electrodermal reactivity and the resulting subgroups were com- 


pared on 18 rating scales. The 
fectivity, 


Jones’ concept of internalization should 


that high autonomic reactivity 
justive behavior. 


On the basis of extensive work with the GSR, 
Jones (1950, 1960) concludes that high auto- 


1 This research was supported by Research Grant 
MY 2318 from the National Institute of Mental 
Health, United States Public Health Service. 

2 Now at Western Reserve University. 


High reactors showed less appropriate af- 
and tended to be less active motorically. It was concluded that 
be redefined to exclude the implication 
is typically accompanied by socially ad- 


nomic reactivity is associated with "internaliza- 
tion” of emotional processes. As correlates of 
high reactivity Jones reports motor restraint. 
stable mood, responsible social behavior, and 
freedom from attention getting maneuvers, which 
he attributes to the “sustaining influence of 


abnorm. soc, 
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inner emotional responses [Jones, 1960, p. 171." 
From this generally favorable picture of the 
“high reactive," Jones concludes that increasing 
autonomic reactivity may be a fairly basic con- 
comitant of successful socialization. However, 
Block (1957) reports that high GSR reactivity 
in college subjects is associated with passivity, 
dependency on others for approval, overconcern 
about ethical problems, somatic anxiety symp- 
toms, and dreaminess, whereas the low reactors 
.were found to be more independent, skeptical, 
practical, realistic, and opportunistic. He con- 
cluded that GSR reactivity should be regarded as 
an index of emotional responsiveness and not as a 
predictor of whether the individual will use 
“internal” or “external” channels to discharge 
the tension after it is aroused. 

The two investigators thus seem to disagree 
empirically as to the social and personal cor- 
relates of GSR reactivity, and to disagree theo- 
retically as to the mechanism underlying the 
correlations that do exist. All the data sum- 
marized seem to have been collected from emo- 
tionally normal subjects. Data from disturbed 
subjects may aid in clarifying the empirical 
picture and conceivably could help choose be- 
tween the theoretical positions. 

The present note reports on the rated behavior 
of children in a psychiatric hospital who showed 
high and low reactivity on two electrodermal 
variables. 
i METHOD 


As part of a study of chlorpromazine, behavioral 
ratings and skin conductance measures were ob- 
tained on 34 children hospitalized for emotional dis- 
turbance. All data to be reported were obtained 
prior to administration of any tranquilizing medica- 
tion. The subjects represented all of the cases ad- 
mitted to the children’s ward of the Nebraska Psy- 
chiatric Institute between June 1958, and June 1961, 
who met the following criteria: between 6 and 13 
years of age, sufficiently cooperative to participate in 
test procedures, not physically ill, no marked depres- 
sive reaction or active convulsive disorder. The re- 
sulting group ranged in age from 6 years and 9 
months to 13 years and 2 months of age, and had 
the following diagnoses: adjustment reaction of child- 
hood, 14; psychoneurotic disorders, 11; psychotic re- 
action, 7; personality trait disturbance, 2. 


GSR Reactivity Measurement 


Subjects were seen individually in the psycho- 
physiological laboratory about 1 month after admis- 
sion. In addition to skin conductance, electromyo- 
graphically recorded tension of the frontalis muscles 
and heart rate were simultaneously obtained with 
each subject. Only the skin conductance data are re- 
ported here. During the recordings the subject was 
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lying on a bed in a darkened, sound insulated room, 
in which one investigator (MMH) was present. Skin 
conductance was recorded with 1-inch diameter rub- 
ber cups with zinc plate mounted on top. A zinc 
sulfate-bentonite paste was used, The electrodes were 
mounted in the center of the palm of each hand. 
Conductance was recorded continuously by a Wheat- 
stone bridge feeding into the power amplifier of an 
Offner Type D EEG machine. The recording period 
was divided into three 5-minute segments: a quiet 
period in which the subject was asked to relax as 
much as possible; an interview period in which the 
subject was asked a standard series of emotionally 
loaded questions about himself by the investigator 
who was in the room with him; and a final quiet 
period, in which the subject was again asked to 
relax as much as possible. Data from the last period 
were not used in this study. 

The stimulus material for the interview period con- 
sisted basically of 15 adjectives: WEAK, SAD, DIRTY, 
DUMB, UGLY, ANGRY, AFRAID, GIRLISH,® LOVABLE, LOUD, 
NAUGHTY, WILD, DISHONEST, CRUEL, LAZY. It can be 
seen that all of these adjectives, with the exception 
of LovaBLE, have negative connotations, and might 
thus constitute something of a threat. The subject 
was asked orally how much each word was charac- 
teristic of him and was supplied with the response 
alternatives of “a lot,” “a little bit,” or “not at all.” 

A dry run of the recording procedure was carried 
out with each subject about a week before the actual 
testing was done, so that the recording situation it- 
self would not arouse anxiety simply because of its 
novelty. 

Two measures of changes in skin conductance dur- 
ing the recording session were used. For the first 
measure GSRs were counted in both the initial quiet 
period and the interview period. A GSR was arbi- 
trarily defined as a drop in resistance of at least 
2000 ohms occurring over an interval of not less 
than .6 second nor more than 2 seconds. The 2000- 
ohm criterion was adopted to aid reliability of judg- 
ment, Decreases of this size occurring in less than .6 
second were excluded to prevent the inclusion of 
artifacts, while those requiring longer than 2 seconds 
were judged to be hard to discriminate from base- 
line shifts. Only one GSR was counted for any given 
6-second interval. The first 30 seconds of both 5- 
minute periods were disregarded because of the rapid 
shifts in baseline that were occurring at these times. 
The final GSR reactivity measure consisted of the 
difference between the number of GSRs in the inter- 
view period and that in the quiet period. 

This scoring procedure is more like that of Block, 
who used visual judgments of number of GSRs dur- 
ing a lie detection situation, than it is like that of 
Jones, who used average magnitude of GSR to dis- 
crete stimuli. While the GSRs during the interview 
period in the present study could not be related to 
individual stimuli, as Jones’ were, the subtraction of 
the quiet period GSRs had the effect of making the 
resulting score specific to the interview period as a 


8 The term Bovrsm was used with female subjects, 
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whole. This difference score was found to correlate 
.81 with the interview GSRs alone. Changes in num- 
ber of GSRs among the 34 subjects ranged from an 
increase of 14 to a decrease of 4, with exactly half 
of the subjects showing increases and half showing 
decreases or no change. 

A second measure dealt with overall changes in 
basal conductance during the interview periods. 
Changes in basal conductance have been shown to 
be related to stress and to performance (Woodworth 
& Schlosberg, 1954), and it seems justifiable to as- 
sume that the longer term shifts reflect aspects of 
autonomic functioning that may be as significant for 
personality as the shorter term GSRs. The measure 
used in this study was the percentage of change in 
average skin conductance from the last minute of 
the quiet period to the last minute of the interview 
period. Average skin conductance was determined 
from 10 conductance measurements evenly spaced 
through the minute in question. Only the last minute 
of each period was used because it was thought that 
the effects of the two conditions, especially of quiet, 
would become most stable during this time. Changes 
in this conductance measure ranged from an increase 
of 213% to a decrease of 29%, with all but four sub- 
jects showing increases. The correlation between the 
two electrodermal measures—change in GSRs and 
change in conductance—was found to be +.34, indi- 
cating little overlap between them, 


Behavioral Ratings 


As part of the study mentioned above, 18 five- 
point rating scales were developed to cover various 
aspects of ward behavior observable by nurses and 
aides. Each point was defined by one or more 
Phrases. For example, the Activity-Inactivity scale 
was defined as follows: 


1, Extremely nonactive, never initiates activity, 
stays in one place 

2, Rarely active, occasionally institutes activity, 
usually inactive 

3. Neither outstandingly active or passive 

4, Quite active, initiates most of his activities, in- 
frequently still 

5. Overactive, on the go practically all the time 


The titles of the other scales were: Anxiety Level, 
Verbal Behavior, Physical Aggression, Verbal Hos- 
tility, Mood, Affectivity, Personal Appearance, So- 
cialization, Cooperation, Orientation to Environment 
and Others, Reactions to Injury, Eating Habits, Abil- 
ity to Dress Self, Toilet Habits, Responsiveness and 
Alertness, Likeability, Appropriateness of Affect. 
Each child was rated on each scale by from 6 to 12 
raters. Reliability was evaluated by splitting the pool 
of raters into two groups and correlating the mean 
ratings on each item for 35 subjects. Reliability co- 
efficients for the 18 scores, after Spearman-Brown 
correction, ranged from .67 to .95 with a median of 
teh For the Activity-Inactivity scale, the value was 


The ratings to be reported were collected within 
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the same week in which the GSR measures were 
made. Ratings from all raters were combined to yield 
a mean value for each subject on each variable. The 
personnel making the ratings had no knowledge of 
GSR results, and the GSR measurements were made 
before the ratings were known to the experimenters, 


RESULTS 


For both reactivity variables, the 17 high 
scorers were compared with the 17 low scorers 
on the 18 behavioral ratings, resulting in a total 
of 36 ¢ tests. Of these comparisons, three were 
significant at the .05 level. These included the 
two involving Appropriateness of Affect, and one 
comparison involving Activity Level. While the 
total number of significant differences is not much 
larger than would be expected by chance, the 
consistency of the findings for Appropriateness of 
Affect and the obvious relevance of Activity Level 
for Jones’ theoretical position make the results 
seem meaningful. Table 1 contains the mean rat- 


TABLE 1 


MEAN BEHAVIOR RATINGS FOR SUBJECTS ABOVE AND 
BELOW THE MEDIAN ON Two MEASURES OF 
ELECTRICAL REACTIVITY OF THE SKIN 


Basal 


GSR increase | conductance increase y 


Rating variable 


Low | High Low, | High 
Activity Level 3.6 3.3 3.7 32h 
Appropriateness | 4.5 4.1* 4.5 4.1* 
of Affect 
*p<.05. 


ings of the reactivity subgroups on these two 
variables. 

The nature of these mean differences can be 
summarized briefly. High reactors by either cri- 
terion are rated as less appropriate in affective 
expression than low reactors. High reactors on the 
basal conductance measure are rated as less ac- 
tive motorically than low reactors. High and Low 
reactors on number of GSR likewise differ in 
motor activity, but not to a significant extent. 


DISCUSSION 


How do these findings fit with past results? 
The tendency toward less motor activity in the 
High reactor, even though statistically significant 
only for a longer term measure of reactivity than 
that of Jones, must be said to fit in nicely with 
his findings. The less appropriate affect in the 
High reactors within our disturbed sample is 
harder to compare with empirical findings on nor- 


E CRITIQUE AND NorES 


mals, because normals, almost by definition, show 
little inappropriateness. At a theoretical level, 
however, inappropriate affective expressions can 
be seen as responses to private, primarily in- 
ternal cues, thus linking the present finding with 
Jones’ general notion that High reactors are more 
strongly influenced by inner emotional responses 
than are Low reactors. In the case of these dis- 
turbed children, however, the inwardly deter- 
mined emotional responses are obviously not part 
of a good social adjustment, as they appeared to 
be in Jones’ normal subjects. In this respect, the 
present findings would seem to agree with those 
of Block, who also found High reactors to have 
affective responsiveness of a type that seemed to 
impair rather than promote good adjustment. 
However, the present results would not entirely 
support Block’s theoretical sugestion that the 
"basic personality correlate of GSR reactivity is 
amount of anxiety and/or emotionality. In the 
present study, ratings of amount of anxiety and 
amount of affectivity did not differentiate the re- 
activity groups, while a qualitative variable (ap- 
propriateness) did. In addition, it would seem 
that an increased amount of undifferentiated ten- 
sion in the High reactives should have led, if 
anything, to increased motor activity, instead of 
less, as the results actually suggest. 
„à The present study thus agrees with Jones’ work 
to the extent of suggesting greater emotional re- 
sponsiveness to internal cues in High GSR re- 
actors, and with Block’s study in indicating that 
the social responses of High reactors may not be 
particularly effective. 
Theoretically, there would seem to be no rea- 
son why Jones’ concept of internalization could 
not allow for negative consequences of inwardly 
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determined emotional responses as well as the 
more favorable ones that occurred in his studies. 
If internalization were defined strictly as the in- 
ward direction of emotional sensitivity, and freed 
of the corollary that internalized cues neces- 
sarily or typically produce socially constructive 
responses, it could handle the present findings, 
and probably much of Block’s data as well. Sev- 
eral of the characteristics reported by Block for 
his High reactors, including undue concern with 
ideals and ethical problems, dreaminess, and lack 
of social aggressiveness, would seem as amenable 
to explanation in terms of response to conflicting 
internal cues as in terms of a quantitative excess 
of emotional tension. High autonomic reactivity 
would thus be seen as accompanying the exist- 
ence of numerous and vivid internal cues, with- 
out in any way specifying the nature of those 
cues. It would seem reasonable to hypothesize 
that the exact content of the internalized cues 
and their adjustive value will depend heavily 
upon individual learning experiences. 
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PERSISTENCE AT A DIFFICULT TASK WITH ALTERNATIVE 
TASK OF INTERMEDIATE DIFFICULTY 


N. T. FEATHER 
University of New England, Australia 


60 male Ss were tested individually for persistence at an insoluble task pre- 
sented to them as very difficult. They experienced repeated failure at the task 
but could turn to a similar task described as intermediate in difficulty when- 
ever they wished. Differences in persistence at the initial task were examined 
between Ss high in n Achievement and low in Test Anxiety (HL) and Ss low 
in n Achievement and high in Test Anxiety (LH). Results show that: 
(a) persistence is positively related to initial estimates of probability of 
success (Ps) at the task for HL Ss (p< .005) but there is no relationship 
between persistence and initial estimates of P, for LH Ss; (b) estimates of 
P, tend to rise initially and then to fall following repeated failure (p < .005) ; 
(c) Ss who persisted at the initial task show a later preference for that task 
(p<.05). Results are discussed in terms of the “motive-expectancy-value” 


model. 


The present investigation of persistence is con- 
cerned with a situation in which an individual 
works at an achievement task presented to him 
as very difficult. He knows that he can turn, 
whenever he so chooses, to another task of the 
same type which is described as intermediate in 
difficulty (i.e., 50/50 probability of success). The 
initial achievement task is, in fact, objectively 
insoluble and the person undergoes repeated 
failure in his attempts to get the solution. Within 
this context, persistence is studied in relation 
to the relative strength within the individual of 
the motives to achieve success and to avoid 
failure. Persistence is measured by the total 
number of trials the person works at the initial 
achievement task before turning to the alter- 
native. 

Results of an earlier study (Feather, 1961) 
indicate that subjects classified as high in n 
Achievement (n Ach) and low in Test Anxiety 
persist longer at an achievement task when it 
is presented to them as easy than as very dif- 
ficult. In contrast, subjects classified as low in 
n Ach and high in Test Anxiety do just the re- 
verse. No attempt was made in this earlier study 
to specify the difficulty level of the alternative 
to which the subject could turn when he so 
desired. The present study is designed to fill this 
gap and specifies an alternative of intermediate 
difficulty. The investigation also provides a fur- 
ther test of the theoretical approach adopted. 

Two hypotheses are investigated. Hypothesis 1 
states that, when the initial achievement task is 
presented as very difficult (subjective probability 
of success, P, < .50) and the alternative achieve- 
ment task as intermediate in difficulty (P,=.50), 


subjects in whom the motive to avoid failure 
is stronger than the motive to achieve success 
(Mar M,) should persist longer at the initial 
task than subjects in whom the motive to achieve 
success is stronger than the motive to avoid 
failure (M, M,,). In accordance with the 
earlier theoretical derivation (Feather, 1961, 
p. 555) total motivation to perform the initial, 
subjectively very difficult task should increase 
for subjects in whom M, > M, as they undergo 
repeated failure, but should decrease for subjects 
in whom M,- M,, Furthermore, since the 
alternative task is described as intermediate in 
difficulty, achievement related motivation to 
perform this task should have a maximum 
negative value for subjects in whom M, > Mg, 
and a maximum Positive value for subjects in 
whom M, M,, when P,=.50 (Atkinson, 
1957; Feather, 1961, p. 553). Hence, for the 
former subjects in whom M,;>M,, total 
motivation to perform the alternative task should 
be relatively weak; the alternative task should 
provoke a high level of anxiety about failure 
which dominates any achievement interest. In 
marked contrast, for the latter subjects in whom 
M, > M,; total motivation to perform the alter- 
native task should be relatively strong; the alter- 
native task should elicit a high degree of achieve- 
ment interest which dominates any anxiety about 
failure. It follows from these considerations 
that subjects in whom M,,> M, should persist 
at the initial achievement task longer than sub- 
jects in whom M, > M,,. Since the alternative 
task is presented as intermediate in difficulty we 
would expect these differences in persistence 
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between the two types of subjects to be quite 
pronounced. 

Hypothesis 2 states that, when the initial 
achievement task is presented as very difficult 
(P, <.50), persistence should be positively re- 
lated to initial expectation of success among 
subjects in whom M, > Mag, but there should be 
no relationship between persistence and initial 
expectation of success among subjects in whom 
Mas > M,. Although the initial task is described 
to subjects as very difficult, we would expect 
their initial expectations of success to differ, 
some subjects having higher expectations than 
others, Persistence should relate positively to 
initial expectation of success among subjects in 
whom M, > Mar since, for these subjects, total 
motivation to perform the initial task should de- 
crease as expectations of success fall with re- 
peated failure. The higher the initial expectation 
of success the more unsuccessful attempts at 
the task it should take to reduce expectation to 
the low level at which total motivation to per- 
form the initial task becomes less than total 
motivation to perform the alternative (Feather, 
1961, p. 554). However, there is no basis in the 
present theory for predicting a definite relation- 
ship between persistence and initial expectation 
of success among subjects in whom Mar > Ms, 
when their initial expectations are very low 
(P, < .50). In fact, under such conditions, these 
subjects should persist at an achievement task 
indefinitely * (p. 555). 

Both Hypothesis 1 and Hypothesis 2 assume 
that initial expectations for success for the initial 
achievement task are very low (P, < .50) among 
the subjects tested. 


METHOD 


The basic procedure is essentially the same as 
that used in the earlier investigation. n Ach scores 
(M,) and Mandler-Sarason Test Anxiety scores 
(Ma) were obtained for 60 male students in in- 
troductory, undergraduate courses at the University 
of New England. The author administered the 
Thematic Apperception Test (TAT) under neutral 
conditions according to the standard procedure (Mc- 
Clelland, Atkinson, Clark, & Lowell, 1953). Six 
pictures were presented in the following order 
(using numbers assigned by Atkinson, 1958): 2, 48, 


iFuture elaboration of the theory of persistence 
in achievement contexts will need to explain why 
subjects in whom Mar > M, ever stop working at a 
very difficult achievement task (Feather, 1960, pp. 
183-186). Such an elaboration might consider as 
possible factors a decrease in extrinsic motivation 
to perform the initial task with performance and the 
development of other motivational tendencies asso- 
ciated with alternatives. 
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Fic. 1. Item 1 (insoluble) and Item 2 (soluble) from 
the Perceptual Reasoning test. 


1, 7, 100, 24. Interscorer reliability * for scoring 
n Ach was .89. In subjects classified as high in 
n Ach and low in Test Anxiety (using median 
splits) it was assumed Ms > Mar; in subjects classi- 
fed as low in n Ach and high in Test Anxiety it 
was assumed Mar > Ms. A score of 8 and above 
was designated High n Ach; a score below 8 was 
designated Low n Ach. A score of 83 and above was 
designated High Test Anxiety; a score below 83 
was designated Low Test Anxiety. 

A subject's Test Anxiety score was obtained by 
summing his scores on the individual items of the 
Test Anxiety Questionnaire. As in the previous 
investigation these item scores were obtained by 
dividing the rating scale for each item into five 
equal parts and by scoring responses from one to 
five in the direction of increasing anxiety. 

Some 3 months after this initial testing the test 
of persistence was administered to each of the 60 
subjects in individual sessions. The procedure 
adopted ensured that the experimenter was unaware, 
when testing a subject for persistence, of his n Ach 
and Test Anxiety scores, The first two items of 
the Perceptual Reasoning test were used in the 
present study. These two items consist of the 
unicursal puzzles presented in Figure 1. For each 
item the subject is required to trace over all the 
lines of the diagram without lifting his pencil from 
the paper and without tracing over any line twice. 
Detailed instructions for administration of this test 
are presented in the earlier paper (Feather, 1961, 
p. 556). Item 1 is, in fact, insoluble and the subject 
can work at it for as many trials as he chooses, 
failing at each attempt, before he decides to turn 
to Item 2. Each subject was allowed to persist at 
Item 1 for up to 20 minutes (if he so desired) so 
as to increase the possibility of obtaining a wide 
variance in persistence scores. It was necessary to 
suggest to 12 subjects who were still persisting at 
Item 1 after 20 minutes that they move on to 
Item 2. This second item is soluble and quite easy. 
All subjects were able to solve it. A subject could 


2 The author wishes to thank Catherine Macleod 
for her assistance in scoring the TAT protocols and 
for her administration of the test of persistence. The 
present study was conducted as part of her Honors 
program (Macleod, 1961). The author also wishes 
to thank Graeme Halford for scoring the Test 
Anxiety Questionnaires. 
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not see the content of Item 2 until he began to 
work at it but he knew that the item was similar in 
kind to Item 1. 

Initial subjective probabilities of success for 
Items 1 and 2 were induced by reporting fictitious 
group norms. The following instructions were given 
to the subject before he began working at Item 1: 


The two items that I’m going to give you 
today vary in difficulty. The first item is very 
difficult. In fact, only about 5% of University 
students are able to pass it, so the chances of 
success are very low indeed. The second item is 
of average difficulty. About 50% of University 
students are able to pass it, so the chances of 
success are 50/50. 


Thus, in contrast to the previous study, a subject 
knew the diffüculty level of the alternative task in 
advance, 

As in the previous investigation, estimates of 
probability of success and ratings of confidence in 
these estimates were obtained from subjects prior 
to performance at Item 1 as a check on the effective- 
ness of the fictitious group norm procedure. Prob- 
ability estimates and confidence ratings related to 
the middle and terminal stages of performance at 
Item 1, along with other information relevant to 
assumptions involved in the hypotheses, were ob- 
tained from a postperformance questionnaire ad- 
ministered at the end of the experimental session. 


REsuLTS 
Analysis of Persistence Data 


The distribution of persistence trials at Item 1 
for the 60 subjects tested in the present in- 
vestigation has a range from 1 to 48+? and a 
median of 9.5. Persistence trials for subjects 
Classified as High n Ach-Low Test Anxiety and 
for subjects classified as Low n Ach-High Test 
Anxiety are designated as high and low in terms 
of whether they are above or below this median. 
The resulting frequencies are presented in 
Table 1. ! 

Table 1 does not provide definite support for 
Hypothesis 1. The results indicate that the 
Proportion of subjects who show high per- 
sistence at Item 1 is greater for those subjects 
classified as low in n Ach and high in Test 
Anxiety than for those subjects classified as high 
in n Ach and low in Test Anxiety. This difference 
in Proportions is consistent with Hypothesis 1 in 
implying that, when the initial task is presented 
as very difficult and the alternative task is 


i 

2 The + indicates that the subject had to be inter- 
rupted. Due to the interruption procedure some 
scores are open-ended. Hence, results are classified 
on the basis of median splits and relationships in 
the resulting frequency tables analyzed using x2 as 
the basic statistic, 
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TABLE 1 


NuMBER or Susjecrs WHO WERE Hicu AnD LOW IN 
PERSISTENCE ON ITEM 1 IN RELATION TO STATED 
DIFFICULTY OF THE TASK AND NATURE 
oF MOTIVATION 


N = 36) 
Motivation Persistence trials 
Stated 
difficulty 
n Ach Test of task High | Lowb 
Anxiety 
High Low 5% (difficult) 6 12 
Low High 5% (difficult) 10 8 


Note.—x? = 1.80, df = 1, ns (p < .10, one-tailed test). 
a Above median, 
b Below median. 


described as intermediate in difficulty, subjects 
in whom M,,; M, tend to persist longer at the « 
initial task than subjects in whom M, > My, 
However, the difference in proportions is not 
statistically significant (y*= 1.80, df=1, p< 
-10, one-tailed test). 

Table 2 presents data concerning the relation- 
ship of persistence to estimates of probability of 
success given prior to performance at Item 1 by 
subjects classified as high in n Ach and low in 
Test Anxiety, and by subjects classified as low 
in n Ach and high in Test Anxiety. These prob- 
ability estimates are designated as high or low in 
terms of whether they are above or below the 
median of the distribution for all subjects. This 


TABLE 2 


NuMBer or Supjects Wuo Were Hicu anp Low IN 
PERSISTENCE ON ITEM 1 IN RELATION TO INITIAL 
ESTIMATES OF PROBABILITY OF SUCCESS FOR 
Trem 1 AND NATURE or Motivation 


(N = 36) 
Motivation Initial Persistence trials 
probability 
estimate for 
n Ach | Test Anxiety! item 1 Highs | Low? 
High Low High 6 3 
Low 0 9 
Low High High $ z 
Low 5 4 
Partition of x? 
Source xt df 
Motivation X Persistence 1.80 1 
Probability Estimate X Persistence | 4.05* 1 
Motivation X Probability Estimate | 4.05* 1 
X Persistence 
Total 9.90** 3 


^ Above median, 
5 Below median. 
*p «.05. 

**5 <.02, 


| 


f 
| 


| 
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TABLE 3 


NUMBER or SUBJECTS WHo WERE HICH AND Low IN 
PERSISTENCE ON ITEM 1 IN RELATION TO INITIAL 
ESTIMATES OF PROBABILITY OF SUCCESS 


FOR ITEM 1 
(N = 60) 
Initial Persistence trials 
probability estimate 
for Item 1 
High* Low? 
High* 20 10 
Low? 9 21 


Note.—x? = 8.08, df = 1, p <.01, two-tailed test. 
^ Above median. 
b Below median. 


median probability estimate is 16.5. Table 2 also 
presents the results of a partitioned x? analysis 
(Sutcliffe, 1957) applied to these data. 

Results in Table 2 support Hypothesis 2. 
The partition of y? indicates that the two mo- 
tivation groups (High-Lows versus Low-Highs) 
differ significantly (y? = 4.05, df — 1, $ «.05) 
with respect to the relationship of persistence 
at Item 1 to initial estimates of probability of 
success. In Table 2 the positive relationship be- 
tween persistence and probability estimates is 
highly significant (p< .005, Fisher exact test, 
one-tailed) among subjects classified as high in 
n Ach and low in Test Anxiety. In contrast, 
Table 2 shows that there is no evidence for a 
relationship between persistence and probability 
estimates among subjects classified as low in 
n Ach and high in Test Anxiety, These data are 
consistent with Hypothesis 2 in implying that, 
when the initial achievement task is presented as 
very difficult (P, < .50), persistence should be 
positively related to initial expectation of success 
among subjects in whom M, > Mas, whereas 
there should be no relationship between per- 
sistence and initial expectation of success among 
subjects in whom Myr > Ms. 

Finally, Table 3 presents data concerning the 
relationship of persistence to initial estimates of 
probability of success for all 60 subjects tested 
in the present study. 

Table 3 shows that there is a statistically 
significant tendency (x? = 8.08, df = 1, p < .01) 
for subjects with high initial probability esti- 
mates to persist longer at Item 1 than subjects 
with low initial probability estimates, and con- 
versely. No such relationship was found in the 
earlier study (Feather, 1961). 

In summary, the results obtained from an 
analysis of persistence trials for Item 1 support 
Hypothesis 2 but do not provide definite sup- 


port for Hypothesis 1. 
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Analysis of Supplementary Data 


The following results obtained from analysis 
of responses to the postperformance question- 
naire are of interest in their bearing upon as- 
sumptions involved in the theoretical derivation 
of the hypotheses and in providing suggestions 
for future research. 

Probability estimates. Means of the prob- 
ability estimates obtained from the 60 subjects 
both prior to performance at Item 1 and from 
the postperformance questionnaire for the middle 
and terminal stages of performance at Item 1, 
show an initial increase followed by a decrease, 
These mean probability estimates for the three 
different stages of performance at Item 1 are 
as follows: Prior to performance, M = 20.37, 
SD = 16.63; middle stage of performance, M 
= 28.00, SD = 21.20; terminal stage of perform- 
ance, M = 19.95, SD = 21.90. These means 
differ significantly (F= 7.29, df = 2/118, p< 
.005) and the trend is independent of n Ach and 
Test Anxiety. Apparently subjects become more 
“optimistic” about their chances of success once 
they have worked at Item 1 for a number of 
trials. After a certain number of failures, how- 
ever, expectations of success begin to decline. 
This tendency for probability estimates to rise 
initially did not appear in the earlier study 
(Feather, 1960, 1961), where estimates decreased 
from prior to performance through the middle 
to the terminal stage of performance. It is 
interesting to note that, as in the previous study, 
probability estimates tend to exceed the 5% 
reported norm. This general tendency to over- 
estimate low probabilities has been found in 
previous research (cf. Cohen, 1960; Feather, 
1963; Preston & Baratta, 1948). 

Preference for Item 1 versus Item 2. Data 
from the postperformance questionnaire indicate 
that subjects who showed high persistence at 


TABLE 4 
NUMBER or SUBJECTS CHOOSING ITEM 1 or ITEM 2 
IN THE POSTPERFORMANCE QUESTIONNAIRE 
IN RELATION TO DEGREE OF PERSISTENCE 
AT ĪTEM 1 


(N = 60) 


Persistence trials 
Choice in questionnaire 


High Low> 
Item 1 (5% difficulty) 17 9 
Item 2 (50% difficulty) 12 22 


Note.—x? = 5.34, df = 1,  <.05, two-tailed test. 
* Above median. 
b Below median. 
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Item 1 tend to select Item 1 when asked which 
of Item 1 (5% difficulty) or Item 2 (50% dif- 
ficulty) they would have chosen to work at if 
they had been given the choice at the beginning 
of the test session. In contrast, subjects who 
showed low persistence at Item 1 tend to prefer 
Item 2. The positive relationship between per- 
sistence at Item 1 and choice of Item 1 in the 
postperformance questionnaire is evident in 
Table 4, and is statistically significant (y? = 5.34, 
df=1, p<.05). 


Discussion 


The above results; taken in conjunction with 
those of the earlier study, provide strong sup- 
port for the type of theoretical analysis of the 
persistence situation employed in both investiga- 
tions in the derivation of hypotheses. The in- 
teresting difference between the results of the 
two investigations is that, whereas in the earlier 
study no general relationship was found be- 
tween persistence at Item 1 and probability esti- 
mates given by subjects prior to performance at 
the item, the present investigation reveals a 
general tendency for subjects to persist longer 
at Item 1 when their estimated probabilities of 
success are high rather than low (Table 3). The 
results show that this positive relationship be- 
tween persistence and probability estimates is 
very apparent for subjects classified as high in 
n Ach and low in Test Anxiety, but does not 
occur for subjects classified as low in n Ach 
and high in Test Anxiety (Table 2). 

One would expect a general trend of this nature 
if, among subjects tested, a high proportion were 
those in whom the motive to achieve success is 
stronger than the motive to avoid failure (M, 
> Mar). There are, in fact, good reasons for 
expecting a predominance of subjects at uni- 
versity level in whom M, > M,r, and a greater 
predominance of such persons in Australian as 
opposed to American universities. The proportion 
of high school students proceeding to university 
is lower in Australia than in the United States. 
Selection for university undergraduate work 
tends to be-more stringent, perhaps too stringent. 
It is reasonable to suppose that many students 
who perform poorly at examinations do so partly 
because they become highly anxious under test 
conditions. These students are selected out and 
do not proceed to university (cf. Atkinson & 
Litwin, 1960, p. 61). Such considerations lead 
one to expect that Australian freshmen may have 
lower Test Anxiety scores than American fresh- 
men. A comparison of Test Anxiety scores from 
the earlier study (American students) with 
Scores from the present study (Australian stu- 
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dents) supports this prediction. The respective 
mean Test Anxiety scores are as follows: Earlier 
study, M = 97.92, SD = 22.64; present study, 
M = 83.97, SD = 19.04. The difference between 
these means is highly significant (F= 15.23, 
df = 1/147, p < .001). When this result is taken 
in conjunction with the relatively high ratings 
of achievement concern and the relatively low 
ratings of anxiety about failure at Item 1 given 
by subjects in the postperformance question- 
naire, the supposition is strengthened that a large 
proportion of subjects in the present study are 
subjects in whom M, > Myy. This may account 
for the failure to obtain statistically significant 
support for Hypothesis 1 in the present study 
(Table 1), when differences in persistence be- 
tween subjects from the contrasting motivation 
groups are examined. The group of subjects 
classified as low in n Ach and high in Test 
Anxiety may include some in whom the motive 
to avoid failure is not stronger than the motive 
to achieve success but rather the reverse. 
Future research is needed in test development 
to improve the basis for selecting subjects in 
whom M, > My, as contrasted to those in whom 
Mar M, If the Test Anxiety scale is to be 
used again in relation to the study of persistence 
it appears that research with Australian students 
will need to study more extreme groups on the 
"Test Anxiety continuum. 

The results of the present study indicate that 
estimates of chances of success at Item 1 tend 
to rise and then fall with repeated failure at 
the task. This trend did not occur in the earlier 
Study where these estimates tended to decrease 
with successive failure experiences. The initial 
rise in probability estimates obtained in the 
present study may relate to the earlier finding 
(Feather, 1960) that subjects are less confident 
about their probability estimates prior to per- 
formance of a task when the task is presented as 
very difficult than as easy. Estimates of prob- 
ability of success given by a subject after the 
fictitious norm has been reported but before he 
has seen the content of an item may change once 
the task is available to him, i.e., the structure of 
the task should influence his expectation of 
Success (cf. Feather, 1963). In the present study 
this influence seems to have produced an initial 
"optimism" among subjects. As subjects be- 
come more familiar with the task we would 
expect them to become more confident about 
their estimates of probability. Results from the 
Present study do indicate that subjects tend to 
be more confident about their probability esti- 
mates for the middle and terminal stages of 
Performance than about the estimates they gave 


prior to performance at Item 1. The respective 
mean confidence ratings are as follows: Prior to 
performance, M = 2.22, SD = .69; middle stage 
of performance, M = 2.78, SD =.79; terminal 
stage of performance, M — 2.85, SD = 96. The 
difference between these means is highly sig- 
nificant (F—13.80, df=2/118, p< .001). 
Clearly, future research needs to consider the 
different factors which may influence change in 
expectation of success with repeated failure at 
a task. As indicated previously (Feather, 1961, 
1962), specification of the rate and manner of 
change in expectation of success with repeated 
failure is essential if predictions about per- 
sistence are to be made in the present theoretical 
context. 

Finally, it is interesting to note that when 
subjects were asked in the postperformance 
questionnaire which of Item 1 (5%) or Item 2 
(50%) they would have chosen if they had been 
given the choice at the beginning of the test 
session, subjects who showed high persistence at 
Item 1 tended to choose Item 1 while subjects 
who showed low persistence at Item 1 tended 
to choose Item 2 (Table 4). It is possible that 
subjects in stating a preference for Item 1 were 
attempting to justify their extreme persistence 
at Item 1 to the experimenter. Alternatively, one 
can argue that total motivation to perform Item 
1 was not entirely reduced for highly persistent 
subjects and remained at a sufficient level to 
influence their later preference. Certainly, highly 
persistent subjects tend to give higher prob- 
ability estimates for the terminal stage of per- 
formance than do less persistent subjects (cor- 
relation of persistence trials with probability 
estimates for the terminal stage of performance 
is r= .53, df = 58, P< (001), Such an explana- 
tion is similar to the Lewinian concept of per- 
sisting tension following an incompleted task. In 
a sense, Item 1 has not been completed; the 
subject has not succeeded because the task is 
insoluble. Total motivation to perform Item 2 
has, however, been dissipated since the problem 
has been solved and expectation of success is 
now certain. Thus, as in the studies by 
Ovsiankina and Lissner (Lewin, 1935), we would 
expect the nondissipated total motivation to 
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perform Item 1 to persist and later to influence 
a preference for the item. Future research might 
well be directed towards an analysis of the 
Lewinian literature on incompleted tasks with 
a view to discovering whether this area of in- 
vestigation which has been based on a "tension" 
model can be conceptualized using the “motive- 
expectancy-value” framework adopted in the 
present study. " 
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BIRTH ORDER AND AN EXPERIMENTAL STUDY OF EMPATHY? 


EZRA STOTLAND anp JAMES A. WALSH 
University of Washington 


104 Ss in all, divided into groups of 6, had the opportunity of empathizing 
with the assumed anxiety of 1 of their number with whom they had inter- 
acted. The critical S was placed in a test situation and in % of the cases 
performed poorly and in the other % very well. The palmar sweating of the 
other Ss was measured during the test performance and their reports on their 


own feelings of anxiety were obtained. 


One problem of the present study was to 
demonstrate empathy experimentally, ie, an 
individual’s experiencing the same emotion be- 
cause he perceives another to experience this 
emotion. The particular emotion studied was 
anxiety. Another problem of the study was 
whether an individual’s tendency to empathize 
was related to his birth order. 

The present approach to empathy is an exten- 
sion of previous work (Stotland & Cottrell, 1962) 
in which it was demonstrated that individuals 
tend to acquire attributes which they perceive 
in others with whom they have previously inter- 
acted, In the Stotland and Cottrell study, which 
was not directly concerned with empathy, the 
attribute acquired was level of performance on 
a task requiring a person to count the repetitions 
of a given syllable in a passage from the Hindi 
language. In the present study, the attribute that 
the subjects might acquire was the level of 
anxiety they perceived in a person with whom 
they have previously interacted. The first hy- 
pothesis was that, if individuals have interacted 
with another person, they will experience more 
anxiety when they perceive this person to be in 
a high anxiety provoking situation than when 
they perceive him to be in a low anxiety provok- 
dng situation. Stotland and Cottrell also found 
that later born children tended more to acquire 
the attribute of another with whom they had 
previously interacted than did first and only 
children. In the present study, it was hy- 
pothesized that the difference predicted in the 
first hypothesis above would Obtain more for 
later borns than for first and only borns. 


METHOD 
Subjects 


One hundred and four students in introductory 
and intermediate courses in psychology at the Uni- 
versity of Washington were the subjects. They were 


1 The work on this study was financed by a grant 
(M-2423) from National Institutes of Health, 
United States Public Health Service. 


recruited as participants in three one-hour sessions 
of research for a total of $3.75. During one session 
they participated in the present laboratory study. 
They were combined into six-person, same-sex, 
groups on the basis of having the same hour free. 
An attempt was made to take one subject from each 
of the six classes from which they were recruited in 
order to minimize the possibility of the subjects 
being acquainted with each other prior to the 
experimental session. 


Procedure 


When a group of six subjects arrived at the 
experimental room they were seated at a round 
table in the middle of which was a Lazy Susan. 
The table was divided into six compartments by 
separators. The subjects were told that they were 
first to form words from packs of syllables each 
subject found in his compartment. The words 
formed fell into one of six categories of nouns. 
The subjects’ second job was to collect words of 
one category. When they formed words in their own 
category, they kept them. When they formed a 
word in some other person’s category, they sent it 
to him. Subjects were instructed to keep track on 
their record sheet of the number of words they sent 
to each other category and the number of words 
they received from each category. The only inter- 
action the subjects had with each other was the 
sending and receiving of words. It is obvious that 
the distribution of words could be arranged so 
that varying patterns or amounts of word inter- 
change would result. In this study the materials were 
arranged so that five of the subjects exchanged 
words only with the sixth. 

The purpose of the first part of the experiment 
was simply to have five of the six subjects interact 
with the sixth one. In the next Part of the study, 
they were to perceive that the interacting subject 
did Poorly or well at a task, and therefore be 
Perceived as experiencing more or less anxiety, At 
the same time, their level of anxiety was measured, 
both by Measuring the amount of palmar sweating 
and by the subjects’ own reports. 

Palmar Sweating was measured by a procedure 
originated by Silverman and Powell (1944) and 
discussed by Mowrer (1953). It involves a simple 
chemical process requiring the subject to keep one 
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finger pressed on a sheet of paper for 3 minutes, 
Since the subjects had saturated their finger tips 
with ferric chloride, their perspiration dissolved the 
ferric chloride which was absorbed by the paper. 
Since the paper had been treated with a tannic acid 
a fingerprint was formed in the paper. The darkness 
of the print is the index of amount of perspiration. 
This measure was taken twice during the experiment, 
and the change in perspiration was the measure of 
anxiety., At each measure, a different finger was 
used because of the residue of ferric chloride left 
after each administration. 

After the word assembly task had been com- 
pleted, the experimenter told the subjects that one 
of them would be selected by chance to sit at a 
tape recorder placed on a table a few feet from the 
word assembly task. The experimenter said that the 
person selected by chance was the subject who 
collected names of animals. After the subject was 
seated at the recorder, earphones were fitted to his 
head. Then the experimenter handed out to each 
person a booklet (to be described below), and the 
simple apparatus need for the measure of perspira- 
tion. The experimenter then said: 


Now let me make this very clear. The person 
seated here will listen to a tape and take a test. 
No one else will take this test either in front 
of the group or in private. This person is the 
only one who will take it. Nor will the rest of 
you take any test remotely like it. 


The reason for this statement to the subjects was 
to eliminate the possibility that any anxiety in the 
subjects observing the test would result from an 
anticipation of failing at the same test. The experi- 
menter then went on to explain the mechanics of 
taking the palmar sweat measure. He said, 


These instructions concern the preparation for 
our chemical measure of your reaction to one 
another. Exactly what is happening will be un- 
clear now, but I will explain it to you more fully 
later. 


While they were pressing their fingers on the 
paper, the subject at the tape recorder (hereafter 
called the demonstrator) was given the following 
instructions as the subjects listened and observed: 


The test that you are about to take is called 
the Watson-Glaser test of critical thinking. It is 
a test of how logically you can think when you 
are under pressure, that is, if you have only a 
little time to figure something out. In other words, 
it is a test of how fast you can think on your 
feet, This test has been found to be a very good 
predictor of how well a person is going to do 
jn various professions and lines of business. You 
can see that the ability to think on your feet is 
a very important one in a great variety of voca- 
tions. This test is particularly valuable because 
many people do not have a clear or accurate 
idea of how well they can think on their feet. 

I will give you your instructions orally, but you 
will actually take this test by listening to the 
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question through earphones. You can see how 
listening to the questions is like a situation in 
which you have to think on your feet. Similarly, 
you are to give your answers orally. There is 
nothing for you to write down. 


The test was described as a series of syllogisms, 
with the subjects deciding whether each was log- 
ically sound or unsound. The subject was given 7 
seconds in which to respond to each syllogism. The 
purpose of giving the test was to give the demon- 
strator a personally important experience of success 
or failure. Since the items could be heard only 
through the earphones, the observing subjects could 
not know directly how well or poorly the demon- 
strator was doing and had to rely on the experi- 
menter's evaluation, 

When the instructions were completed, the ex- 
perimenter told the subjects to remove their fingers 
from the paper but not to be surprised if they 
left a fingerprint. He then instructed them to keep 
a record of the demonstrator’s responses on the 
first sheet of the booklets given them. They 
recorded the demonstrator's responses by circling 
either an S or a U after each of his responses. Then 
they were asked to record any unusual expressions 
by the demonstrator. "These instructions were given 
to insure the attention of the subjects, The experi- 
menter then turned on the tape recorder, which 
played 17 items of the test into the demonstrator's 
earphone. The experimenter pretended to record the 
oral responses of the demonstrator. 

When the test was over, the experimenter in- 
structed all the subjects to repeat the palmar sweat 
measure, this time using their second fingers. During 
the 3 minutes in which the subjects pressed their 
fingers on the scale, the experimenter went back 
over the responses of the demonstrator one at a 
time and stated whether each of the responses was 
right or wrong. He made these statements loud 
enough so that all the subjects could hear. In the 
Good condition, he indicated that the demonstrator 
was right on 13 of the 17 items. Then he said: 


Let me see. You got ... 13 out of 17 right. 
That's extremely good. Let's see how you did 
with respect to other college students. Let me 
see ... you would be in the upper 25%. Seventy- 
five percent of college students would be below 
you—not as good as you in their ability to think 
fast under pressure; that is, to think fast and 
logically on their feet in diffücult situations. 


In the Poor condition, the experimenter said that 
13 of the 17 were wrong and then he said: 


Let me see. You got... 13 out of 17 wrong. 
That is not good. Let's see how you did with 
respect to other college students. Let me see . . - 
you would be in the lower 25%. Seventy-five per- 
cent of college students would be above you— 
better than you in their ability to think fast 
under pressure; that is, to think fast and logically 
on their feet in difficult situations. 
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These evaluations of the demonstrator's perform- 
ance were timed to last 3 minutes, at the end of 
which the experimenter told the subjects to remove 
their fingers from the paper and to answer the 
questions in the inside pages of the booklets they 
had received. The first three items concerned the 
subjects’ perception of the anxiety of the demon- 
strator and were answered on eight-point scales. 
The first item was rating of the calmness or nervous- 
ness of the demonstrator while taking the test; the 
second, the same rating while he heard how he did 
on the test; the third, the same rating “now.” The 
responses to these three items were added to form 
an index of perceived anxiety of the demonstrator, 
The fourth item was a rating on an eight-point 
scale of how interested the subjects were while 
the demonstrator took the test. 

The fifth through eighth items (answered on 
eight-point scales) concerned the subject’s reports 
of their own nervousness. The items were: 


How calm or tense did you feel when you 
found out how the person [ie., the demonstrator] 
had done? 

How hard or how easy are you finding it to 
concentrate on these questions? 

How dry or how wet do you think that your 
mouth is now? 

How calm or tense do you feel right now? 


The next two items asked the subjects to rate how 
well they thought they would do on the test, first 
on an evaluation scale, second in comparison 
to other college students. The next item concerned 
their desire to meet the demonstrator, and the 
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following item asked interest in meeting the other 
subjects in the study. 

When the subjects were through, the experimenter 
explained to the demonstrator that his reported 
performance was fictitious, and at a later session the 
study was explained and all the subjects’ questions 
answered. 


RESULTS 


Before testing the hypothesis, it is necessary 
to demonstrate that the subjects perceived the 
poor demonstrator as being more anxious than 
the good one. Table 1 shows that the subjects in 
the Poor condition rated the demonstrator sig- 
nificantly more anxious than in the Good 
condition. 

The first hypothesis leads to the prediction that 
subjects in the Poor demonstrator condition will 
manifest more anxiety than subjects in the Good 
condition. Two measures of subject anxiety were 
obtained, their changes in palmar sweating and 
their reports of their own anxiety. Table 1 shows 
that there was no significant difference between 
conditions on palmar sweating, the means being 
almost identical. However, the subjects did 
report that they were significantly more anxious 
when they had heard the poor evaluation of the 
demonstrator than the good. Also, they show a 
slight tendency to report a parallel difference 
in anxiety while filling out the questionnaire 
(Row 4). They do not show any difference on 


TABLE 1 


MEAN Scores ON CHANGE IN PALMAR SWEATING AND POST-EXPERIMENTAL 
Rarms For Eacu CONDITION 


Evaluation of demonstrator 
Condition 1 
Good 
QN 2:49) a ss) 
1. Perceived anxiety of demonstrator 5.40 et 
2 Change in pan a 96.40 9520 UR 
+ 9e'I-report of anxiety when demonstrator's perf i j 
a ys EET. y r's performance 1.93 3.36 4,931*** 
- Self-report of anxiety at moment 2.56 
- Self-report of dryness of mouth 3.21 320 
6. Self-report of difficulty of concentrating 3.81 3.85 
7. Evaluation of own ability on test® 2.12 2.50 2.000* 
8. Comparison of own ability to others on teste 2.90 311 ; 
9. Interest while demonstrator taking teste 431 3.73 
10. Attraction to demonstrator 4.30 5.00 
11. Attraction to group 4.25 4.42 


*p <.05, one-tailed. 
** 5 <.03, one-tailed. 
Herb <,001, 


lange in sweating. Sc I is 
score was obtained by first taking the reading on each of the two printer: oh cane change toward less palmar sweating, This 
d 


If the second print was darker, the difference in the two 
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TABLE 2 


MEAN SCORES FOR FIRST AND ONLY CHILDREN AND LATER BORN CHILDREN ON CHANGES IN PALMAR 
SWEATING AND POST-EXPERIMENTAL RATINGS FOR EACH CONDITION 


Later borns First and Only borns 
Demonstrator evaluation t 
Good Poor Good Poor 
a b c d 
1. Perceived anxiety of demonstrator* 43 10.4 6.1 10.2 a vs. b: 5.125*** 
c vs. d: 7.922*** 
a vs. c: 2.500*** 
2. Changes in palmar sweating> 92.8 100.2 98.8 91.1 
3. Self-report of anxiety when demonstrator’s 1.94 3.55 1.93 3.15 
performance was evaluated^ 
4. Self-report of anxiety at moment* 2.53 3.17 2.59 2.62 
5. Self-report of dryness of mouth* 3.26 3.62 3.17 2.92 
6. Self-report of difficulty in concentrating* 3.79 3.93 3.83 3.77 
7. Evaluation of own ability on test? 2.10 2.72 2.20 2.23 
8. Comparison of own ability to'others* 2.81 3.41 2.91 2.73 
9. Interest while demonstrator taking test 4.41 3.21 4.14 4.35 a vs. b: 2.000* 
b vs. d: 2.037** 
10. Attraction to demonstrator* 4.32 5.93 4.31 4.00 
11. Attraction to group 4.37 421 4.17 4.65 
N 18 29 29 26 


^ The Bae the score, the greater the attribute rated. 
b See Table 1. : 
e The lower the number, the better the self-evaluation. 
*p < .06, two-tailed. 

**$ < .05, two-tailed, 

we D < .001. 


dryness of mouth or difficulty of concentrating. 
The first hypothesis is supported only by the 
subjects’ report of anxiety when actually hear- 
ing the demonstrator being instructed. 

Aside from the phenomenon of empathy, the 
data also provide additional support for the 
results of the study by Stotland and Cottrell 
(1962) in which it was found that after individ- 
uals had interacted with another individual, they 
would predict their own performances on a task 
consistently with his performance while they 
would not do so if they had not interacted. 
Table 1 also shows that subjects in the Poor 
demonstrator condition evaluated their own abil- 
ities on the test significantly lower than in the 
Good. However, the difference on comparison of 
self-ability with other college students was not 
+ significant. 
` None of the differences between the conditions 
in interest in the task, in attraction to the 
demonstrator, or to the group was significant. 
This lack of difference suggests that the experi- 
ment did not influence attraction to any part of 
the 'situation. 

Thé second hypothesis leads to the prediction 
that later borns more than first and only borns 
will show the difference predicted between the 
conditions. in Table 2 it can be seen first that 
both later boris and first and only borns perceive 
the demonstrater as being more anxious in the 

X 
x 


\ 


Poor condition than in the Good; in fact, the 
mean scores are almost identical for the two 
groups in the Poor condition. However, the first 
and only borns tend to perceive the demonstrator 
as more anxious in the Good condition than do 
the later borns. The later borns, perhaps, per- 
ceive some positive affect resulting from the 
demonstrator’s success on the task. 

On palmar sweating, there is no significant 
difference between conditions for either group. 
However, the direction of differences is con- 
sistent with the hypothesis. The later borns show 
more decrease in palmar sweating in the Good 
condition than in the Poor, as predicted, while 
the first and only borns show an opposite trend. 
(A Walker-Lev, 1953, test of the significance of 
the difference between these differences was not 
significant.) 

On none of the four items (Rows 3-6) de- 
signed to measure self-reported anxiety was there 
a significant difference among the groups. There 
is, however, a tendency suggested by nonsignifi- 
cant trends in a number of items that the later 
borns in the Poor condition reacted differently 
from any of the other groups. It has already 
been noted that they show less reduction of 
palmer sweating. They also report more anxiety 
when perceiving the demonstrator being evalu- 
ated, and while taking the questionnaire. They 
report more dryness of the mouth and difficulty in 
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concentrating on the questionnaire. They made 
the poorest self-evaluations of own ability on 
the test and on comparison with other college 
students. This difference is consistent with the 
findings of the Stotland and Cottrell (1962) 
study with respect to birth order. The later 
borns in the Poor condition show more attrac- 
tion to the demonstrator than any of the other 
subjects in cither condition. These subjects also 
report significantly less interest in the demon- 
stration task in the Poor condition than in the 
Good, and significantly less than the first and 
only borns in the Poor condition. This configura- 
tion of trends tends to suggest that, when the 
later borns perceive that another person with 
whom they have interacted has failed on a test, 
they experience more anxiety, feel they too 
would do poorly, and are attracted to this person. 
Yet, the process of testing is rejected by them. 


DISCUSSION 


Although the general hypothesis was supported 
only by one of the items on self-report of 
anxiety but not for the physiological measure, 
the study does demonstrate the possibility of 
taking as intangible a phenomenon as empathy 
and translating it into experimentally useful 
terms. The failure of the physiological measure 
to show a difference parallel to the difference 
in self-reports may be the result of a variety 
of factors: individual differences in the form of 
manifestation of anxiety (Lacey, 1950); birth 
order difference in opposite direction which 
cancel each other out; the subject’s adaptation to 
the experimental situation, etc. 

The difference in reactions between the later 
borns and first and only borns is consistent with 
other findings on birth order. Stotland and Dunn 
(1963) have also found that later borns em- 
pathize more than first and only borns, using 
indicators of empathy almost identical to those 
used here. Stotland and Cottrell (1962) found 
that later borns evaluate themselves more con- 
sistently with the performance level of others. 
Stotland and Dunn (1962) also found that the 
later borns identify more with other people. 
"These trends are consistent with the present 
findings, that later borns appear to empathize 
more, although not significantly so; and evaluate 
their abilities more in line with those of others. 
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The present study also suggests that the later 
borns also sympathize more with others in the 
sense that they are attracted to another person 
who has suffered anxiety. Furthermore, they ap- 
pear to tend to place at a distance the experience 
leading to this other person's anxiety (ie., they 
report less interest in retrospect in the test), as 
if they want to help the other suppress the 
anxiety provoking situation. As suggested by the 
Stotland and Cottrell and both Stotland and 
Dunn studies, the later borns may react as if 


they were still in a family of peers, which was ^ 


their initial experience in life. Others may still 
be perceived as “part of the same family” and 
thus sharing a common fate. 

On the other hand, Schachter (1959) reports 
that, when anxious, first and only borns turn 
to others. In neither the present study nor in 
any of those cited in the previous paragraph 
were the subjects themselves subjected to any 
anxiety. Thus, the reaction of turning to others 
may be a socially oriented defense against 
anxiety, while identification, empathizing, and 
sympathy may be the typical reaction of later 
borns when they are not personally threatened. 
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PERSISTENCE OF MALADAPTIVE RESPONSES IN SCHIZOPHRENIA * 


EVELYN 


CRUMPTON 


Veterans Administration Neuropsychiatric Hospital, Los Angeles 


30 schizophrenic patients and 30 psychiatric aides were required in 4 experi- 
mental tasks to change their responses to meet changing conditions. It was 


hypothesized that (a) schizophrenic 


patients are more likely than normals 


to continue a response after it becomes ineffective; (b) the persistence of 


the maladaptive response is a functi 
predicted, the patients persisted longe: 


ion of the severity of schizophrenia. As 
r than the aides in the first learned but 


later maladaptive responses, and there was some evidence that degree of 
resistance to change was related to the severity of schizophrenia in the patient. 


Two hypotheses are proposed: schizophrenic 
patients are more likely than normals to continue 
a response after it becomes ineffective; and 
the persistence of the maladaptive response is a 
function of the severity of schizophrenia. 

To test these hypotheses, 30 schizophrenic 
patients and 30 psychiatric aides were subjected 
to four experimental conditions, each of which 
involved shifting from a first learned but later 
unsuccessful response to a new response. It was 
predicted that the patient would continue to 
make the first learned response longer than the 
aides, and also that the more severely schizo- 
phrenic patients would persist in the ineffective 
response longer than the less severely schizo- 
phrenic patients. 

METHOD 
Subjects 

Two psychiatrists in charge of an acute intensive 
treatment ward of the Brentwood Veterans Ad- 
ministration Neuropsychiatric Hospital independently 
rank ordered 70 schizophrenic patients with respect 
to the undefined concept of “severity of schizo- 
phrenia Each psychiatrist first made his rankings 
in sets of 10 patients each and then merged the 
sets, placing the entire 70 patients in rank order 
by the rank-comparison rating method of Bittner 
and Rundquist (1950). The 30 patients on which 
the two psychiatrists had the closest agreement 
(rank correlation coefficient of .88) were selected 
as subjects, and the two rankings were averaged. 


1From the Veterans Administration Neuropsy- 
chiatric Hospital, Los Angeles, and the Neuro- 
psychiatric Institute, University of California, Los 
Angeles, Medical Center. 

Statistical analysis of data was performed in part 
by the Statistical Unit of the Neuropsychiatric Hos- 
pital Psychology Service. The author expresses ap- 
preciation to David Baxter and Edward Smith for 
their assistance in data collection. A shorter version 
“Of the paper was read at the California State 
Psychological Association, San Francisco, December 
1961. 


These 30 patients were then asked to give the 
meaning of 22 proverbs selected from Gorham (1956). 
Two judges independently scored each response 2, 1, 
or 0 according to appropriateness of response, and 
the patients were put in rank order on the basis of 
their overall proverb scores. The two judges agreed 
with a rank correlation coefficient of .93. The two 
rankings were merged for a single rank order of sever- 
ity, Scoring the proverbs qualitatively—as overly ab- 
stract, abstract, overly concrete, concrete, etc,—did 
not yield acceptable reliability. 

As a rough check on whether the psychiatric 
judgments and proverbs were actually getting at 
something besides severity of the overall emotional 
disturbance, the Shipley-Hartford Institute of Living 
Scale? was administered to the patients in groups 
of five. The CQ, which is usually interpreted as 
impairment due to either functional or organic 
disturbance, was not significantly correlated with 
either method of ranking severity of schizophrenia, 
but the IQ was correlated with the proverbs rank- 
ing (r, = .72, p<.001). 

Thus the group of 30 patients was selected and 
rank ordered for severity of schizophrenia by psy- 
chiatric judgment and also rank ordered according 
to their total proverb scores. The two methods of 
ranking severity of schizophrenia were not signifi- 
cantly correlated (r, —.15). The patients could all 
be described as chronic schizophrenics, but the level 
of overt symptoms varied from almost in remission 
to almost: completely out of contact, Table 1 shows 
the composition of the sample in more detail. 

A group of 30 aides was obtained by taking the 
entire day shift of two wards who could be spared 
from their duties, They were comparable to the 
patients in age, but they had a somewhat higher 
educational level. 


2The scale consists of a multiple-choice vocab- 
ulary test and a fill-in test of abstract thinking 
involving logical progressions. Scores on the tests 
are converted to estimates of vocabulary age and 
abstraction age. An estimate of intelligence, IQ, is 
derived from the combination of VA and AA, and 
an estimate of intellectual impairment, CQ, from 
the difference between VA and AA. The scale is 
described by Grayson (1951). 
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TABLE 1 
COMPOSITION OF THE SAMPLE 
TT Fre- 
Diagnosis auency | M | Range | ¢ 
Schizophrenic reaction 30 
Chronic undifferentiated | 16 
Paranoid 9 
Catatonic 3 
Without psychiatric disord ? go 
ithout psychiatric disorder 
Msn 0.47 
Patients 35.6| 25-47 
Aides 32.1 | 22-45 
Education 4.98* 
Patients 10.7| 4-16 
ides 12.7| 8-17 
Hospitalizations (patients) 24» 1-8 
Length of present hospital- 3,8*| 3 months 
ization (patients) -15 
years 
a Median. 
*p <.01. 
Procedure 


Two experimenters collected the data, one ad- 
ministering the ball game and car game, the other 
the proverbs and mazes. The order of appointments 
with the two experimenters was counterbalanced, 
but in each case the ball game was administered 
before the car game, and the easier maze before 
the more difficult maze. 


RESULTS 
Ball Game 


The ball game (Friedman, 1958) consisted of 
a box shaped like an L with four rectangular 
windows cut in the far wall of the bottom of 
the L. The subject rolled a golf practice ball 
down the alley formed by the side of the L 
toward a specified window. The first two windows 
could be reached by a straight line approach 
without going over the foul line, but the third 
window required a banking shot. The fourth 
window could not be made, but it served as an 
additional cue pointing to the need for banking. 
All of the subjects played pool and were familiar 
with the concept of banking. ; 

"The subject was given as many tries as neces- 
sary for a successful shot at the first window and 
then at the second window, and was rewarded 
with a piece of hard candy when he succeeded. 
Most subjects got the ball through the window 
on the first or second shot. The subject was 
then told to aim at the third window, and a count 
was made of the number of trials before he 
gave up the straight line approach. It was not 
necessary to get the ball through the window; a 
trial was called successful if there was any 
attempt at either a banking shot or putting 
enough "English" on the ball to make it curve 
around the protruding edge of the box. 
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As Table 2 shows, the aides took about 3 
trials before realizing that the old way no longer 
worked. The patients tried to make the old 
way work regardless of the changed conditions, 
averaging about 14 trials before shifting. Several 
patients even started the ball halfway up the 
alley and insisted they had not crossed the foul 
line. Since the distributions could not be shown 
to meet the assumptions of the ¢ test, a chi 
square median test was also calculated; and the 
results for each task were essentially the same. 

Within the group of schizophrenics, the ball 
game score was significantly correlated with the 
total proverbs score (rg=.42, p<.05), but 
not with the psychiatric ranking of severity. 


Car Game 


'The second task involved driving a toy car 
by remote control over a consistently rewarded 
route; later the reward was given only for a 
detour route, and the previously rewarded route 
resulted in punishment. The car was operated 
by a magnet controlled by a stick and wheel 
something like a control column for aircraft. The 
subject was told merely to drive from Miami 
to Omaha, through a territory which allowed two 
choice points. On the first trial the subject was 
rewarded with candy and "that's right" at each 
of the choice points and at the end, no matter 
which route he took. On successive trials he was | 
rewarded for taking exactly the same route, and 
punished by withdrawal of candy and "that's 
wrong" for taking a different route, until he 
had completed three consecutive trips using his 
first chosen route. On the fourth trip over the 
same route, he was told "that's wrong" and 
candy was taken away for taking the same route 
that had been rewarded before. If he then took 
a different route, he was rewarded. A count was 
made of the number of trials to reach the 


TABLE 2 
PERSISTENCE OF MALADAPTIVE RESPONSES 


Relationship to severity 


Patient| Aide 


; X [Median] proverbs | Psychiatric 
trials | trials ae 
ta, lator jee 
Ball game 13.6 2.6 10.6***| .42* .20 
ar game 17.2 | 3.7 |i13.5***|.33 59 
Direct maze] 7.1 | 14 | 32 7.5% 1.20 


Tu Note.—Results for the insight maze are discussed in the 


*p <05. 
Hxc. 
PED <.001. 
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criterion of taking the different route twice in 
a row. 


In the car game the aides took about 4 trials 


to reach the criterion, the patients took 17 


trials, a highly significant difference (see Table 
2). The car game scores were significantly cor- 
related with both measures of severity (psy- 
chiatric ranking, r,=.33, p<.05; proverbs 
score, 7, = .59, p<.01). 


Mazes 


The last two tasks were concealed maze pat- 
terns rewarded only by knowledge of correct or 
incorrect response; the first one involved a shift 
from a straight line to a detour approach, and 
the second one provided opportunity for insight 
into which pathways were blocked by a barrier 
placed across the previously rewarded path. 

The two maze problems were set up on a fully 
adjustable electrical pegboard maze. The top of 
the box showed merely a pegboard square with 
144 holes showing metal rivets beneath, The sub- 
ject ran the maze by putting a metal stylus 
through the hole he chose. Some of the rivets, 
which could be placed in any position, were long 
enough to make contact with a metal plate 
underneath, which caused a bulb to light up at 
one corner of the box. After each trial, he was 
told to turn around while the experimenter 
might or might not change things. Thus he was 
given a clue that the maze might be changed. 
After the subject had run the maze successfully, 
the maze pattern was changed so that the sub- 
ject had to turn his back on the goalbox in order 
to take a correct route. 

Again, as Table 2 shows, the schizophrenics 
took considerably more trials than the aides be- 
fore shifting from the first learned response. The 
patients had a mean of about 7, the aides less 
than 2. 

Similarly to the ball game, the direct maze 
persistence scores were not significantly related 
to psychiatric ranking of severity, but they were 
related to the proverbs score (median test x? 
= 749, p<.01). (Here the chi square median 
test was used in preference to rank correlation 
because of the larger number of tie scores.) 

The second maze pattern was designed after 
the Tolman-Honzik (1930) maze for demonstrat- 
ing insight" in rats. It consisted of three paths, 
a direct path, a short path, and a long path. The 
subject was given the opportunity to familiarize 
himself with all three paths, and then required 


~to find the correct path, which at first could be 


either the direct or the short path. This time 
the light was turned on by a correct response. 
After the subject had negotiated the path suc- 
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cessfully, a barrier was placed across the direct 
path such that it blocked both the direct and 
the short paths; only the long path remained as 
a successful route. The subject continued to run 
the maze until he settled on the long path, or 
until the pattern of his response became obvious. 
This maze turned out to be more difficult for 
these subjects than the Tolman-Honzik maze 
was for rats. Some of the subjects, both schizo- 
phrenics and aides, never solved it, and very few 
showed evidence for insight by going immedi- 
ately to the unblocked long path. 

The responses were classified as insight, testing 
insight, trial and error, and haphazard. The aides 
and patients differed in the types of response 
favored (y? — 16.27, df —3, p<.001); more 
patients than aides used a haphazard approach 
(429.68, df —2, p<.01), and more aides 
than patients used a testing insight approach 
(429.69, df = 2, p<.01) (the subject made 
the correct response but was not sure so had to 
test out by later trying a wrong path). In each 
group the modal response was the trial and 
error approach. 

It was not possible to test the relationship 
between the insight maze scores and the measure- 
ments of severity, because of the uneven dis- 
tribution of responses in the schizophrenic group. 
Inspection of the data suggested that larger 
frequencies in the cells probably would not have 
yielded anything beyond slight trends toward 
confirmation of the hypothesis. 


DISCUSSION 


The results of the four tasks, taken as a whole, 
indicated confirmation of the first hypothesis 
and partial confirmation of the second. Schizo- 
phrenics were shown to be different from these 
normals in clinging to maladaptive responses 
that once were successful, and under certain cir- 
cumstances their persistence was a function of 
the severity of schizophrenia, especially as re- 
flected in their interpretation of proverbs. On 
each task patients took more trials than did 
aides to shift from the first learned but now 
maladaptive response to the new response. On 
the ball game, car game, and simpler maze, 
difficulty in shifting was related to total proverb 
score, Only the car game related persistence to 
psychiatric judgment of severity. 

Psychiatric aides were used as a normal con- 
trol group from practical rather than scientific 
considerations. Certainly aides differ from schizo- 
phrenics in many ways other than the symptoms 
of schizophrenia. The patients were institutional- 
ized and tranquilized, and probably had in their 
own minds much less freedom of choice about 
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participation in the experiment. The contamina- 
tions of hospitalization and medication could 
have been avoided by using as a nonschizophrenic 
control some other group of patients, such as 
personality disorders or neurotics. This type of 
control was not used because of our hunch 
that persons with any sort of neuropsychiatric 
disorder have trouble changing their behavior to 
meet changing conditions. 

The patients and aides were known to differ 
in educational level. The relationship of educa- 
tional level to the tasks was considered for 
patients and aides separately, since the distribu- 
tions were known to differ, and the tables for 
Fisher’s exact test were used to evaluate the 
small frequency contingency tables. Not one of 
the four tasks or the two measures of severity 
showed any significant association with educa- 
tional level. 

The hypothesis relating abnormally persistent 
behavior to severity of schizophrenia received 
some support, which itself suggests that the dif- 
ferences found between the schizophrenics and 
aides were actually related to the psychiatric 
disorder. The fact that the two severity meas- 
ures were not significantly correlated indicates 
that they were getting at different things, even 
though both might have touched on some aspect 
of severity of schizophrenia. Judging from the 
results, the proverbs test did a better job than 
the psychiatric ranking. The relationship between 
IQ and proverb scores (r,— —.72, p «€ .01) 
leads us to question whether persistence was 
really related to IQ rather than severity of 
Schizophrenia. But IQ was not significantly re- 
lated to any of the scores on the four tasks (ball 


game r, = .03, car game 7, = — .09; direct maze 
Y = 2.12, df — 2, p < 20; insight maze y? —.38, 
J= 3; 5 € 10). 


Only the car game scores were correlated with 
psychiatric ranking of severity, although the 
correlation coefficients for the other test did not 
miss statistical significance by too much. The car 
game differed from the other tasks in two main 
ways: the initial response was left up to the 
subject, and the game was experienced as frus- 
trating. In the ball game, the way the box was 
built determined what response was appropriate; 
and in the two mazes, again the reality of the 
situation called for a certain type of response. 
(We know or at least think that schizophrenics 
differ from aides in their perception of or at- 
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tention to reality.) In the car game, however, the 
patient was free to respond in his own way, to 
take whichever route he chose, even though it 
might be the longest and the most difficult way 
to get from Miami to Omaha; and this way was 
established as his own most adaptive response to 


the problem. Then his own preferred route was | 


made wrong, and the new reality of the situation 
demanded that he learn a new way. Most of 
our subjects tried hard to drive properly even 


though they were reassured that it was not | 


necessary. They kept to the right. They stopped 
at stop signs. They tried hard to keep the car 
facing forward even though the car often flipped 
around sideways or even backtracked. Learning 
to drive the car was something like learning 
mirror drawing. It took good motor coordination 
and a fairly high degree of ability to concentrate 
on the movements without undue motor tension. 
Any jerkiness would send the car off course 
or even completely turn it around. But it was 
fun to drive the car. Thus the car game was 
apparently experienced by most subjects as 
both challenging and highly frustrating. We are 
reminded of Maier’s rats who developed fixated 
responses in a frustrating situation. We think 
also of the motion that the schizophrenic’s long 
and tortuous route in approaching a human rela- 
tionship might in his earliest frustrating and 
challenging relationships have been the most 


adaptive response he possibly could have made 1 


and one he would be reluctant to give up. It 
seems reasonable then that the car game was 
better designed than the other tasks to bring out 
the hypothesized relationship between severity 
and the persistence of maladaptive responses. 
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Experimental Ss were verbally conditioned to use more future tense responses 
in making up sentences. Transfer effects were assessed using a 2nd sentence 
completion task containing certain stimulus cues present on the original task. 
Transfer of learning was demonstrated on the basis of a grossly defined, 2-step 
gradient of similarity of stimulus cues contained on the learning and transfer 


tasks. The variables of S's awareness 


and S's sex did not lead to statistically 


significant between-group differences. The response form (written vs. oral) also 
failed to demonstrate a significant effect. 


Numerous experiments have been conducted 
in which generalized verbal reinforcers were 
used to manipulate operant verbal behavior. In 
many of these verbal conditioning experiments, 
a particular verbal response class such as plural 
nouns was subjected to experimental manipula- 
tion (e.g., Greenspoon, 1955; Salzinger & Pisoni, 
1958). In such experiments the increase in rela- 
tive frequency of responses of a particular verbal 
response class depended on generalization of 
verbal reinforcement effects throughout the re- 
sponse class, Until recently, however, few in- 
vestigators in the area of operant verbal con- 
ditioning have been concerned with the problem 
of transfer or generalization of learning effects 
from one stimulus situation to another. 

Interest in this area was apparently initiated 
by Sarason (1956). Since Sarason's experiment 
was reported, investigations have been conducted 
by Timmons (1959); Rogers (1960); Rickard, 


Dignam, and Horner (1960); Tobias (1960); 


Rosenberg (1961); Simkins (1961); Timmons, 
Noblin, Adams, and Butler (1961); and Singer 
(1961). In five of these nine experiments results 
were considered statistically significant with re- 
gard to generalization of the effects of verbal 
conditioning. Changing experimenters had no 
effect upon degree of generalization in the studies 
by Tobias and Timmons et al. Absence of the 
experimenter, on the other hand, led to dif- 
ferential results in the experiment conducted by 
Singer. Except with regard to the experimenter 
as a stimulus cue variable, in none of the experi- 
ments reported was there an attempt to assess 
differential degrees of transfer as a function of 
similarity of cues in the learning and the transfer 


iThis study is based upon a dissertation sub- 
mitted in partial fulfillment of the requirements for 


-the degree of Doctor of Philosophy to the graduate 


school of Florida State University. The author is 
indebted to Joel Greenspoon for his advice and 
guidance. 


task situations. An attempt to investigate the 
variable of task cue similarity was the major 
objective of the present study. Secondary ob- 
jectives were to assess the effects of sex and 
degree of awareness upon both learning and 
generalization of verbal behavior. 
METHOD 
Subjects 


Ninety-seven junior college students were used as 
subjects. Four control groups each contained 10 
subjects. Two experimental groups each included 10 
subjects who met a specific learning criterion. There 
was an attempt to balance the proportion of male 
and female subjects among the groups; otherwise, 
assignment of subjects to the various groups was 
on a random basis. 


Materials 


For the original learning task, three tense forms 
of each of 40 simple verbs were typed on 40, 3- X 5- - 
inch, white index cards. In addition to three tense 
forms of the same verb on each card, three pro- 
nouns, “He,” “She,” “It,” were typed below. The 
order of verb tense and pronouns was random, The 
task, as presented to the subjects, was to make up 
sentences by selecting one of the three pronouns to 
start the sentence and one of the three verb tenses 
as the second part of the sentence. The mode of 
verbal reinforcement was the experimenter’s saying 
either “good,” “fine,” or “uh huh” in random 
sequence. 

The transfer task consisted of a sentence com- 
pletion task of 42 items. Twenty-one of the items 
consisted of either He, She, or It beginnings which 
were interspersed among 21 verbless beginning words 
such as “Strange things,” “People,” etc. The 42 
beginning words or items on the transfer task were 
contained on one sheet of paper and in the same 
order for each subject. 

Thus, the two tasks were similar in that they 
both required subjects to make up sentences from 
various beginning words. They differed in that the 
original task required the subject to choose a 
pronoun and a verb from those contained on a card, 
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TABLE 1 
EXPERIMENTAL PARADIGM 
Group Original task procedure Transfer task procedure 
E: Task A: subjects give sentences orally; future | Task B: subjects give sentences orally; no rein- 
tense selections reinforced forcement. 4 à 
Es Same as E; Task B: subjects write sentences; no reinforcement 
[o] Task A: subjects give sentences orally; no rein- | Task B: subjects give sentences orally; no rein- 
forcement forcement | 
Ca Same as Ci Task B: subjects write sentences; no reinforcement 
Cs Task B: subjects give sentences orally; no rein- 
forcement 
C4 "Task B: subjects write sentences; no reinforcement 


while on the transfer task no verb was provided. 
In addition, while the beginning words, He, She, It, 
on the transfer task were the same pronouns as 
those used on the original task situation, the other 
beginning words such as People, Strange things, 
etc, were different. Hereafter, the original sentence 
completion task will be referred to as Task A, and 
the transfer task as Task B. 


2 
Procedure 


As indicated by Table 1, a summary of the 
experimental design, subjects were divided into four 
control groups (Ca, Ca, Cs, C4) and two experimental 
groups (E; E:). Subjects in E; were instructed to 
complete sentences by saying them aloud on both 
Task A and Task B. They were verbally reinforced 
for selection of a future tense verb in making up 
sentences on Task A. They were then instructed to 
make up additional sentences from beginning words 
on Task B. No reinforcement was given to the 
subjects, regardless of group, on Task B. The 
procedure for subjects in Group Es was the same 
as for subjects in E, except that on Task B they 
were instructed to write out the sentences rather 
than say them aloud. Subjects in Groups C; and C; 
were instructed to do both Task A and Task B 
but did not receive verbal reinforcement, regardless 
of verb tense usage. Subjects in Groups Cs and C, 
did only Task B and received no verbal reinforce- 
ment, regardless of tense usage. The only difference 
between these latter groups was that subjects in 
Group C, completed the sentences by saying them 
aloud, while subjects in Group Ci wrote out their 
sentence completions. Control Groups Cs and C, 
were used specifically to assess for a possible de- 
veloped response class set independent of verbal 
reinforcement (ie, in Groups C, and Ca). 

After completion of both Task A and Task B, 
subjects in Groups E, and Es were interviewed 
with regard to awareness of reinforcement and 
reinforcement contingency. An interview question- 
naire similar to that devised by Levin (1961) was 
used for this purpose. 


RESULTS 


The frequency of the use of future tense by 
subjects in making up sentences constituted the 


2 


data for statistical evaluation on both Task A 
(learning task) and Task B (transfer task). The 
data of all subjects were retained in evaluating 
for between-group differences with regard to 
original learning. A preimposed learning criterion 
for experimental subjects was used with two 
objectives: to be able to assess for transfer 
effects of subjects meeting the learning criterion 
if the total sample of experimental subjects 
failed to demonstrate learning; and to be able 
to assess for between-group differences in terms 
of transfer behavior which might not be demon- 
strated by use of the total experimental subject 
sample. As the results below will indicate, use of 
this learning criterion did not prove necessary. 


ale 


Group means of future tense frequency by | 


subjects in the combined experimental (E,, E2) 
and combined control (C4, C5) groups are sum- 
marized in Table 2 in blocks of 10 trials. A 
trend analysis (Edwards, 1960) of the data 
illustrated in Table 2 indicated statistically sig- 
nificant between-group differences with respect 
to trials, treatment, and treatment-trial inter- 
action at the .01 level. Thus, there was sub- 
stantial evidence that treatment was effective in 
producing conditioning of operant verbal be- 
havior. 

Differences with respect to the variables of 
sex and degree of the subject's awareness were 
not statistically significant (p .10). 


TABLE 2 


MEAN NUMBER or FUTURE TENSE RESPONSES 
FOR EXPERIMENTAL AND CONTROL GROUPS 
(Er, E», Ci, C2) 1N BLOCKS or TEN 
TRIALS AND ALL TRIALS 


ON Task A 
Trials zi 
CX trials 
1-10 | 11-20 | 21-30 | 31-40 
E, E2 | 2.98 3.95 3.65 4.68 15.26 
Cı, C2 | 2.80 2.85 2.10 2.55 10.30 


CRITIQUE AND Notes 


The manner in which the subjects performed 
the transfer task (i.e. written versus oral) 
did not affect operant rates of future tense re- 


- sponses to a statistically significant extent. Also, 


3 


the between-group differences of control groups 
were not statistically significant with respect to 
the possibility of a developed responses set (C4, 
Co versus Ca, C4). Thus, the transfer data of 
the control groups (C;, C», Cg, C4) were com- 
bined for purposes of making comparisons with 
the combined experimental groups (E,, Ez). 
Table 3 contains the comparative results on the 
transfer task with respect to these combined 
groups. The data are broken down with respect 
to the type beginning on the transfer task in 
order to illustrate the gross generalization 
gradient as a function of task-cue similarity. 

Because the transfer data were not homogene- 
-ous with respect to within-group variance, square 
root transformations were performed, These trans- 
formations resulted in reducing the within-group 
variance so that it was no longer statistically 
significant. The transformed transfer data of the 
combined control versus the combined experi- 
mental groups were analyzed by a series of 
t tests with respect to transfer to all beginnings, 
transfer as elicited by personal pronoun begin- 
nings, and transfer to other types of beginnings. 
Statistically significant between-group differences 
of future tense responses to all beginnings (p 
<.01); personal pronoun type beginnings ($ 
« 01), and other type beginnings (5 < .05) 
were demonstrated. These between-group dif- 
ferences were all in favor of the experimental 
groups and the null hypothesis is rejected. (In- 
cidentally, use of a nonparametric statistic and 
a parametric statistic using raw data led to 
highly similar levels of significance.) 

On the basis of a gross two-point gradient of 
task-cue similarity, it was expected that personal 
pronoun type beginnings would elicit more 
transfer of behavior than would other type 
beginnings with respect to Group E. This experi- 


TABLE 3 


MEANS AND STANDARD DEVIATIONS OF FUTURE 
TENSE RESPONSE OF EXPERIMENTAL AND 
CONTROL GROUP SUBJECTS WITH RE- 
SPECT TO THE TYPE OF BEGINNING 
ON THE TRANSFER TASK 


All Pronoun Other 
beginnings beginnings beginnings 

Group 
M SD | M | sD | M | SD 
E 6.11 | 4.23 | 3.93 | 3.47 | 2.18 | 1.49 
c 305 | 1.79 | 1.52 | 141 | 1.52 | 0.99 
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mental hypothesis was supported. Using a dif- 
ference test (Lacey, 1953), a student's £ of 
3.96 (p<.05) was obtained. The possibility 
that this difference might be attributed to dif- 
ferential initial operant rates of future tense 
responses by control subjects to the two types 
of beginnings on the transfer task was rejected; 
ie. the control group (C) mean operant rates 
of future tense responses with respect to the 
two types of beginnings on the transfer task 
were identical. 

With respect to the transfer task data, the 
variables of sex of the subject and degree of 
awareness did not lead to statistically significant 
between-group differences (52.10). Although 
the transfer task group differences were evaluated 
with respect to the awareness of subjects of the 
experimenter's behavior during learning, it should 
be pointed out that not a single subject was 
able to express correctly the relationship of Task 
A conditions to Task B. Thus, no subject was 
aware that transfer of modified verbal behavior 
was being investigated. 


Discussion 


The present experiment represented an attempt 
to investigate the occurrence of differential de- 
grees of transfer of operant verbal behavior on 
the basis of a two-step, grossly defined gradient 
of task-cue similarity. Positive, differential re- 
sults were obtained in the expected direction; 
ie. the type of beginning items on the transfer 
task which were most similar to stimulus cues 
contained on the learning task elicited markedly 
more transfer behavior than did dissimilar begin- 
ning items. The problem remains, however, as 
to how to improve the selection and definition 
of a stimulus gradient in order to assess transfer 
of verbal behavior as a function of similarity 
more adequately. Further investigations of the 
variable of similarity would require some work- 
able definition of the stimulus similarity con- 
tinuum. Whether attempts to define this con- 
tinuum would be more useful in terms of phys- 
ical attributes of the stimulus situation, task-cue 
similarity, similarity of task-cue meanings or 
associations, or phenomenological definitions of 
similarity remains for future investigation. 

The lack of significant between-group differ- 
ences of the experimental groups (E; versus E2) 
as a function of a change from oral to written 
verbal behavior is in accord with the related 
results by Timmons (1959). This finding sup- 
ports the contention that learning effects may 
be readily generalized from one response sys- 
tem to another (i.e., oral responses to written 
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responses). However, to argue from this finding 
that changes in verbal behavior would generalize 
to other diverse forms of motor behavior would 
be considered premature. 

Although degree of awareness as defined by 
Levin (1961) had no significant effect, the re- 
sults were in the same direction as those obtained 
by Levin with respect to the ledrning data. How- 
ever, Levin's argument that the lack of a sig- 
nificant effect between his completely unaware 
subjects and completely aware subjects was due 
to the fact that his questionnaire was inadequate 
for eliciting awareness is considered untenable. 
A reading of this questionnaire convinces the 
present author, at least, that it could hardly 
be more suggestible to subjects in regard to the 
experimenter's behavior during learning. As in- 
dicated previously, none of the subjects in the 
present experiment expressed awareness as to 
the relationship between the original learning 
task and the transfer task. Yet, transfer effects 
of verbal conditioning were demonstrated. 

n the present experiment, the major reason 
for evaluating for sex differences in use of future 
tense responses was for control purposes. The 
results indicated that, initial operant behavior, 
learning behavior, and transfer behavior were 
not influenced by the subjects sex. These 
negative results are in accord with the findings 
of Rosenberg (1961) who varied sex of the 
experimenters rather than the subjects; they 
are in disagreement with the findings of other 
investigators (eg. Bender, McConnell, & 
Sjoholm, 1957; Simkins, 1961) who have noted 
significant effects based on the variable of sex 
,either with respect to the experimenters or the 
subjects. Thus, this question of sex differences 
remains open with seemingly contradictory evi- 
dence, However, as Rosenberg points out, it js 
probable that these contradictions are based on 
differences in experimental technique and sub- 
sequent differences with respect to interactions 
of the variable of sex with other experimental 
variables, 
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* ERRATUM : 


On page 178 of the article “The Relationship between Agreeing Response Set and 


Social Desirability,” 


by Leonard Solomon and Edward Klein (J. abnorm. soc. 


Psychol., 1963, 66, 176-179), the word “squared” was omitted in the first line of 


Column 2 and should read: 


sum of whose squared loadings was 1.00 or more were... . n 


